Skip to main content

10 docs tagged with "data_pipeline"

View all tags

Amazon Athena

Amazon Athena is a serverless, interactive query service that enables you to analyze data directly in Amazon S3 and other sources using standard SQL, paying only for the queries you run.

Amazon Athena

Amazon Athena is a serverless, interactive query service that enables you to analyze data directly in Amazon S3 and other sources using standard SQL, paying only for the queries you run.

Amazon Data Firehose

Amazon Data Firehose is a fully managed service for loading streaming data into data lakes, warehouses, and analytics services in near real-time with automatic scaling and data transformation.

Amazon EMR

Amazon EMR is a big data platform for processing vast amounts of data using open-source frameworks like Apache Spark, Hadoop, and Hive with managed infrastructure and automatic scaling.

Amazon Kinesis Data Streams

Amazon Kinesis Data Streams is a serverless streaming data service for real-time ingestion of terabytes of data from applications, streams, and sensors with automatic scaling.

Amazon OpenSearch Service

Amazon OpenSearch Service is a managed service for running and scaling OpenSearch clusters, used for log analytics, real-time application monitoring, and full-text search.

Amazon QuickSight

Amazon QuickSight is a scalable, serverless, cloud-native business intelligence (BI) service that allows you to create and publish interactive dashboards and reports.

Amazon Redshift

Amazon Redshift is a fully managed, petabyte-scale data warehouse service optimized for high-performance analysis and business intelligence on large structured and semi-structured datasets.

AWS Glue

AWS Glue is a serverless ETL (extract, transform, and load) service that simplifies data preparation, transformation, and loading for analytics, using the Glue Data Catalog for metadata.

AWS Glue Data Catalog

The AWS Glue Data Catalog is a centralized, managed metadata repository that enhances data discovery and provides a unified schema for data across various AWS services.