AWS Financial Services Cloud Symposium 2019 – New York: Data Lakes and Analytics Design Patterns

607
20.2
Опубликовано 17 апреля 2019, 17:36
Learn more about AWS at – amzn.to/2DifWIz 
Dow Jones has created a new data platform that collects data from all lines of business and consolidates it into a single data lake. This data lake harnesses Amazon S3 for data ingestion; Amazon EMR/Spark for data transformation; Amazon Redshift to define external tables (utilizing AWS Glue as the data catalog); and Airflow for scheduling ETL jobs. A critical advantage of this approach is that the data is efficiently cleansed, prepared, and keyed to allow more time for data analysis, including the application of machine learning using Amazon SageMaker. Learn how Dow Jones built this solution and took advantage of an AWS Well Architected Review to maximize performance, reliability, security, cost optimization, and operational excellence.
автотехномузыкадетское