About This Architecture
Medallion architecture on AWS implements a three-tier data lake pattern using S3 buckets for Bronze (raw), Silver (cleaned), and Gold (curated) layers. Streaming data flows from Kinesis Data Streams and DMS into the Bronze S3 bucket, where Glue Crawlers catalog schemas and Glue ETL Jobs progressively refine data through Silver to Gold layers. Lake Formation governs access across all layers while IAM and KMS enforce security, with Athena, QuickSight, Redshift, and SageMaker consuming curated Gold data for analytics and ML. This architecture solves the challenge of managing data quality and governance at scale, enabling data teams to trace lineage from raw ingestion to production-ready datasets. Fork this diagram on Diagrams.so to customize bucket naming conventions, add AWS Glue DataBrew for no-code transformations, or integrate EventBridge for orchestration triggers.