Deep Neural Networks (DNNs) have successfully been used to replace classical DBMS components such as indexes or query optimizers with learned counterparts. However, commercial vendors are still hesitating to put DNNs into their DBMS stack since these models not only lack explainability but also have other significant downsides such as the requirement for high amounts of training data resulting from the need to learn all behavior from data.
In this project, we investigate alternative approaches that incorporate domain knowledge to obtain more reliable learned DBMS components requiring less training data. While the high-level design of the DBMS component is still specified by code, we optimize it for a particular workload and hardware using differentiable programming. Differentiable programming is a recent shift in machine learning away from the direction taken by DNNs towards simpler models that take advantage of the problem structure. We successfully applied this technique to learned indexing and cost modeling for query optimization.
|Benjamin Hilprecht M.Sc.|
|S2|02 D110||+49 6151 email@example.com-...|
|Tiemo Bang M.Sc.|
|Muhammad El-Hindi M.Sc.|
|S2|02 E115||+49 6151 firstname.lastname@example.org-...|
|Benjamin Hättasch M.Sc.|
|S2|02 D110||+49 6151 email@example.com-...|
|Robin Rehrmann M.Sc.|
External Ph.D. Student
|Lasse Thostrup M.Sc.|
|S2|02 D111||+49 6151 firstname.lastname@example.org-...|
|Tobias Ziegler M.Sc.|
|S2|02 D111||+49 6151 email@example.com-...|
Hilprecht, Benjamin ; Binnig, Carsten ; Bang, Tiemo ; El-Hindi, Muhammad ; Hättasch, Benjamin ; Khanna, Aditya ; Rehrmann, Robin ; Röhm, Uwe ; Schmidt, Andreas ; Thostrup, Lasse ; Ziegler, Tobias (2020):
DBMS Fitting: Why should we learn what we already know?
In: CIDR 2020, 10th Conference on Innovative Data Systems Research, Amsterdam, January 12-15, 2020, [Online-Edition: http://cidrdb.org/cidr2020/papers/p34-hilprecht-cidr20.pdf],