Big Data Discount: How UC Santa Cruz Uses Mesos & Amazon EC2 Spot to Enable Low Cost Cancer Research

4 489
22.3
Опубликовано 1 мая 2017, 17:09
On this episode of This is My Architecture, Mary Goldman, Design and Outreach Engineer at the UC Santa Cruz Genomics Institute explains how they process genomic sequencing data on AWS. With a need to crunch data measured in petabytes, they designed a low cost solution using a combination of Docker containers and EC2 Spot instances. TOIL, the pipeline management system they built is open source (link: github.com/BD2KGenomics/toil) and recently published (link: dx.doi.org/10.1038/nbt.3772) in Nature Biotechnology.

Learn more about This Is My Architecture at - amzn.to/2qfaOQc.

Subscribe:
More AWS videos bit.ly/2O3zS75
More AWS events videos bit.ly/316g9t4

#AWS
автотехномузыкадетское