Skip to main content

2 docs tagged with "data_processing"

View all tags

Amazon EMR

Amazon EMR is a big data platform for processing vast amounts of data using open-source frameworks like Apache Spark, Hadoop, and Hive with managed infrastructure and automatic scaling.

AWS Glue

AWS Glue is a serverless ETL (extract, transform, and load) service that simplifies data preparation, transformation, and loading for analytics, using the Glue Data Catalog for metadata.