Interactive Genomic Data Analysis Using Amazon Athena

824
22.9
Опубликовано 10 июля 2018, 21:34
Learn more about AWS at amzn.to/2ukXIRd

London Innovation Series 2017
The session kicks off with an introduction to Big Data in the healthcare realm, along with some Big Data use cases leveraging AWS Big Data platform. This is followed by an overview of Amazon Athena – an interactive query service to analyse data on Amazon S3 – that can query different file types straight from S3 including Parquet files. Pratim then goes on to introduce ADAM – a Spark Genomics Library and analysis platform with specialized file formats. The latter part of the presentation demonstrates preparation and analysis of genomic data in ADAM parquet files using Spark and conducting quality control of genomic sequencing by analysing ‘variations’.

Speaker: Pratim Das, Specialist Solutions Architect, AWS
автотехномузыкадетское