Linkedin

Modern Data Analytics Reference Architecture on AWS Diagram

Project Overview

Project Detail

  1. Data is collected from multiple data sources across the enterprise, SaaS applications, edge devices, logs, streaming media, flat files, and social networks.

  2. Based on the type of the data source, AWS Database Migration Service (AWS DMS), AWS DataSyncAmazon KinesisAmazon Managed Streaming for Apache KafkaAWS IoT CoreAmazon AppFlow, and AWS Transfer Family ingest the data into a data lake in AWS.

  3. AWS Data Exchange integrates third-party data into the data lake.

  4. AWS Lake Formation builds the scalable data lake, and Amazon S3 is used as the data lake storage. AWS Glue Data Catalog is a centralized metadata repository.

  5. AWS Lake Formation also enables unified governance to centrally manage the security, access control, and audit trails.

  6. AWS Glue and AWS Glue DataBrew catalog, transform, enrich, move, and replicate data across multiple data stores and the data lake.

  7. Amazon Managed Service for Apache Flink is used to transform and analyze streaming data in real time.

  8. Amazon QuickSight provides machine learning (ML)-powered business intelligence.

  9. Amazon OpenSearch Service offers operational analytics.

  10. Amazon Redshift is a cloud data warehouse. With federated queries, you can query and analyze data across operational databases, data warehouses, and data lakes.

  11. Amazon EMR provides the cloud big data platform for processing vast amounts of data using open-source tools.

  12. Amazon SageMaker and AWS AI services can build, train and deploy ML models and add intelligence to your applications.

  13. Amazon Redshift Spectrum and Amazon Athena enable interactive querying, analyzing, and processing capabilities. Athena supports Apache Iceberg for data and AWS Glue data catalog.

  14. Amazon Aurora offers high performance and availability at global scale. Aurora supports zero-ETL integration with Amazon Redshift.

https://docs.aws.amazon.com/architecture-diagrams/latest/modern-data-analytics-on-aws/modern-data-analytics-on-aws.html?did=wp_card&trk=wp_card

To know more about this project connect with us

Modern Data Analytics Reference Architecture on AWS Diagram