Process Web Logs with AWS Data Pipeline, Amazon EMR, and Hive

23 024
232.6
Опубликовано 25 января 2013, 17:10
In this video, you will learn how to use AWS Data Pipeline and a console template to create a functional pipeline.

The pipeline uses an Amazon EMR cluster and a Hive script to read Apache web access logs, select certain columns, and write the reformatted output to an Amazon S3 bucket.

Learn more about AWS Data Pipeline at aws.amazon.com/datapipeline
автотехномузыкадетское