The Coming Genomics Software Revolution?

204
Следующее
08.08.16 – 6751:31:16
Big Data Platforms
Популярные
Опубликовано 8 августа 2016, 19:12
With the cost to sequence a full human genome soon to fall below US$1,000, most people will have their DNA sequenced and stored in a database along with their medical records. The hardware revolution has largely occurred; this session will focus on the software issues that remain. If there were a database of a million or more fully sequenced genomes together with phenotype information such as disease and treatment codes, what software would be necessary to mine it for biological insights that might eventually lead to drug discovery and personalized medicine? Cancer is a fundamentally genomic disease, and drugs such as Gleevec have shown the effectiveness of combatting the genomic pathways of disease. Could a large genomic database enable progress like this for other cancers, or for diseases other than cancer? What are the medical and systems implications of such a database? Could genomics engender a new software industry? We will discuss the existing TCGA (The Cancer Genome Atlas) database currently consisting of 5,000 cancer genomes and the open algorithmic problems that arise in making use of the data. We will describe parallel infrastructure and computing paradigms for genome computation, and new systems issues that must be solved. We will discuss issues in scaling genomic inference in the face of confounding factors. We will also describe challenges that must be solved in order to query a vast genome database interactively in a few seconds.
автотехномузыкадетское