Managing Large-scale Probabilistic Databases

247
Следующее
06.09.16 – 2121:13:06
Towards Contextual Text Mining
Популярные
Опубликовано 6 сентября 2016, 18:40
For the next generation of data-management applications, such as sensor-based monitoring, data integration, and information extraction, data processing is the dominant cost. Often, the data driving these applications are uncertain, for example, due to missed, inconsistent, or imprecise sensor readings. Unfortunately, traditional data-management systems provide little or no support for managing uncertainty. To remedy this, my dissertation advocates an approach for data management in which uncertainty is modeled using probabilities. The cost of modeling imprecision using probabilities is that basic data-management tasks, such as querying, become theoretically and practically more difficult. Thus, the key challenge in managing large-scale probabilistic data is efficiency. In this talk, I will discuss the fundamental techniques that I developed in my dissertation to build a probabilistic database capable of handling large, imprecise datasets: these techniques include top-k processing with probabilities, materialized views, approximate lineage, and extensional processing for complex analytic queries. This work resulted in two systems: Mystiq, the first system to support complex queries on gigabytes of probabilistic relational data, and Lahar, the first system to support rich event-style queries on large, probabilistic streams.
Случайные видео
231 день – 251 2180:54
Waymo…The Real Driverless Car
28.06.23 – 193 3418:00
Your new favorite! Shokz OpenFit
25.04.22 – 1 2297:30
Customer success Q1 2022
04.02.21 – 3 7515:58
Global names and replication
автотехномузыкадетское