DeepDB

Modern machine learning techniques can significantly improve classical database components such as the query optimizer to improve the overall performance. To train these learned DBMS components a representative set of queries has to be executed and used to train a machine learning model. This workload-driven approach, however, has two major downsides. First, collecting the training data can be very expensive, since all queries need to be executed on potentially large databases. Second, training data has to be recollected when the workload or the database changes.

In this project, we instead propose a data-driven approach for learned DBMS components which directly supports changes of the workload and data without the need of retraining. To achieve this, we learn deep probabilistic models over different parts of the database schema and show how to combine them efficiently. Indeed, one may now expect that this comes at a price of lower accuracy since workload-driven approaches can make use of more information. However, we demonstrated that our data-driven approach not only provides better accuracy than state-of-the-art learned components for cardinality estimation but also generalizes better to unseen queries.

Researchers

  Name Contact
Dr. rer. nat. Benjamin Hilprecht

Publications

Loading...
Loading data from TUbiblio…

Error on loading data

An error has occured when loading publications data from TUbiblio. Please try again later.

  • {{ year }}

    • ({{ publication.date.toString().substring(0,4) }}):
      {{ publication.title }}.
      In: {{ publication.series }}, {{ publication.volume }}, In: {{ publication.book_title }}, In: {{ publication.publication }}, {{ publication.journal_volume}} ({{ publication.number }}), ppp. {{ publication.pagerange }}, {{ publication.place_of_pub }}, {{ publication.publisher }}, {{ publication.institution }}, {{ publication.event_location }}, {{ publication.event_dates }}, ISSN {{ publication.issn }}, e-ISSN {{ publication.eissn }}, ISBN {{ publication.isbn }}, {{ labels[publication.type]?labels[publication.type]:publication.type }}
    • […]

Number of items in this list: {{ publicationsList.length }}
Only the {{publicationsList.length}} latest publications are displayed here.

View complete list at TUbiblio View this list at TUbiblio