AWS re:Invent 2016: Best Practices for Apache Spark on Amazon EMR (BDM301)

4 334

Amazon Web Services776 тыс

Следующее

01.12.16 – 2 46153:11

AWS re:Invent 2016: Real-Time Data Exploration and Analytics (BDM302)

Популярные

7 дней – 3 1511:20

AWS App Studio - Generative AI-Powered Application Development | Amazon Web Services

38 дней – 4407:56

Optimize Data Search with Amazon Bedrock and SAP GenAI Hub, Part-2 | Amazon Web Services

Опубликовано 1 декабря 2016, 20:49

Organizations need to perform increasingly complex analysis on data — streaming analytics, ad-hoc querying, and predictive analytics — in order to get better customer insights and actionable business intelligence. Apache Spark has recently emerged as the framework of choice to address many of these challenges. In this session, we show you how to use Apache Spark on AWS to implement and scale common big data use cases such as real-time data processing, interactive data science, predictive analytics, and more. We talk about common architectures, best practices to quickly create Spark clusters using Amazon EMR, and ways to integrate Spark with other big data services in AWS. This session will feature DataXu, a provider of programmatic marketing and analytics software. DataXu will share how they architected their petabyte-scale ETL processing pipeline and data science workflows using Spark.

Свежие видео