AWS Financial Services Cloud Symposium 2019 – New York: Data Lakes and Analytics Design Patterns
607
20.2
Amazon Web Services776 тыс
Следующее
Популярные
Опубликовано 17 апреля 2019, 17:36
Learn more about AWS at – amzn.to/2DifWIz
Dow Jones has created a new data platform that collects data from all lines of business and consolidates it into a single data lake. This data lake harnesses Amazon S3 for data ingestion; Amazon EMR/Spark for data transformation; Amazon Redshift to define external tables (utilizing AWS Glue as the data catalog); and Airflow for scheduling ETL jobs. A critical advantage of this approach is that the data is efficiently cleansed, prepared, and keyed to allow more time for data analysis, including the application of machine learning using Amazon SageMaker. Learn how Dow Jones built this solution and took advantage of an AWS Well Architected Review to maximize performance, reliability, security, cost optimization, and operational excellence.
Dow Jones has created a new data platform that collects data from all lines of business and consolidates it into a single data lake. This data lake harnesses Amazon S3 for data ingestion; Amazon EMR/Spark for data transformation; Amazon Redshift to define external tables (utilizing AWS Glue as the data catalog); and Airflow for scheduling ETL jobs. A critical advantage of this approach is that the data is efficiently cleansed, prepared, and keyed to allow more time for data analysis, including the application of machine learning using Amazon SageMaker. Learn how Dow Jones built this solution and took advantage of an AWS Well Architected Review to maximize performance, reliability, security, cost optimization, and operational excellence.
Свежие видео