Advanced Data Cleanup Techniques using Cloud Dataprep (Cloud Next '19)

19 419
29.4
Опубликовано 10 апреля 2019, 22:37
In this session, we’ll take a deep dive into Cloud Dataprep by Trifacta and how its advanced capabilities address the complex data manipulations required by customers for common use cases like sales analytics and category management. Challenges include working with third-party data with different formats and standards needed to assess and transform to be combined into a single consistent view. After structuring and assessing data quality with Cloud Dataprep, joining (fuzzy matching), and unioning data, you need to pivot and aggregate the data into various logical time sessions to provide meaningful insights and useful pattern trends. Based on this use case, we will demonstrate the advanced features of Cloud Dataprep to master data preparation and generate an easy-to-manage, self-documented logic that can be scheduled with dynamic parameters for repeatable outcomes.

Build with Google Cloud → bit.ly/2K8Pdnu

Watch more:
Next '19 Data Analytics Sessions here → bit.ly/Next19DataAnalytics
Next ‘19 All Sessions playlist → bit.ly/Next19AllSessions

Subscribe to the GCP Channel → bit.ly/GCloudPlatform


Speaker(s): Cindy Sood, Sean Ma

Session ID: DA309
product:BigQuery,Cloud for Marketing; event: Google Cloud Next 2019; re_ty: Publish; product: Cloud - Data Analytics - Dataprep; fullname: Cindy Sood, Sean Ma;
автотехномузыкадетское