Foto Benjamin Hättasch

Dr. rer. nat. Benjamin Hättasch

Postdoctoral Researcher

Working area(s)

Machine Learning for Data Engineering, DFKI

Contact

work +49 631 205752900
fax +49 6151 16-25602

Work S2|02 E112
Hochschulstr. 10
64289 Darmstadt

Despite its importance, accessing information in storage systems or raw data is challenging or impossible for most people due to the sheer amount and heterogeneity of data as well as the overheads and complexities of existing systems. Data-driven and \acs{ai} based approaches make it possible to provide the necessary information access for many tasks at scale, but can often only be used by experts and are expensive to use.

In my research I, therefore, develop low-overhead approaches, allowing data scientists, domain experts and end users to access and manipulate information in a quick and easy way. These approaches should work without the need for large amounts of training data, manually annotating or extracting information, and extensive computation, but still adapt to special terminology of different domains, and the individual information needs of the users. Moreover, they should be usable without extensive training; we thus aim to create ready-to-use systems that provide intuitive or familiar ways for interaction, e.g., chatbot-like natural language input or graphical user interfaces.

Beyond that, I'm interested in Ethical Aspects of AI and Machine Learning, in particular Explainability, Transparency and Fairness. Additionally, I try to incorporate relevant aspects of Human Computer Interaction.

I supervise or have supervised the following students or interns:

  • Robin Harth (2025): Evaluating Preprocessing in AutoML approaches (bachelor thesis)
  • Noel Goldschmitdt (2024/25): Sloppy SQL (bachelor thesis)
  • Leon Krüger (2024): Schemaless Insertions for Self-Organizing Databases (bachelor thesis)
  • Jonathan Markgraf (2023/24): Towards Ad-hoc Multi-relational Table Retrieval from Unstructured Texts (master thesis)
  • Jannis Bauer (2023): Towards Interactive Generalized Entity Extraction from Text Collections (bachelor thesis)
  • Samar Syed (2021): DOC2DB: Turning Text into Relational Data Structures Automatically (master thesis)
  • Sebastian Bremser (2021): Towards Cluster-based sampling for Interactive Multi-Document Summarization using Transformers (bachelor thesis)
  • Jan-Micha Bodensohn (2020): Interactive Structured Data Extraction from Document Collections (bachelor thesis)
  • Michael Troung-Ngoc (2020): Automatisches Schema Matching mit modernen NLP Ansätzen (master thesis)
  • Arslan Yasin (2019/20): Between the lines of ArXiv (master thesis)
  • Nadja Geisler (2018/19): Enhancing Natural Language Interfaces to Databases (master thesis)
  • Ajay Sheoran (2019): research intern
  • Orkan Özyurt (2018): Interactive Text Summarization for Large Corpora (master thesis)

Beyond that, I supervised over twenty groups in the (extended) systems lab course and the software engineering lab course for undergraduates (Bachelorpraktikum).

I'm the research manager of the DFKI Site Darmstadt and a postdoctoral researcher at the Systems Group. In December 2023 I defended my PhD thesis titled “Democratizing Information Access Through Low Overhead Systems” under the supervision of Carsten Binnig.

I started working for the Data Management Lab as a PhD student in January 2018 and was part of the research training group AIPHES from July 2018 to June 2021. From July 2021 to March 2023, I was affiliated with the National High Performance Computing Center for Computational Engineering Sciences (NHR4CES).

From March 2021 to February 2023, I led the BMBF funded project INTEXPLORE – Interactive Structured Text Exploration, which I have acquired as part of the Software Campus.

In 2019, I co-organized the Conference of Aspiring Students in Tech Rhein-Main.

I obtained a Bachelor of Science in Computer Science in 2014 and a Master of Science in Computer Science as well as a Master of Science in Internet- and Web-Based Systems in 2017 all from Technical University of Darmstadt.

Between 04/2014 and 09/2017 I worked as student assistant for the Fraunhofer Institute for Computer Graphics (IGD). Additionally, I was student assistant for several different courses at Technical University of Darmstadt between 10/2012 and 12/2017.

During my studies I was a member of the student council and part of different commissions and groups as well as responsible for organizing the welcome week for first year students for five times.

You expect me to put private information about me on my professional homepage? Well -- since you could find all that information by using your favorite search engine anyways -- here we go:

I enjoy dancing, traveling, taking pictures, am interested in railway systems and am chairman of the friends' association of the conference of German-speaking student councils in computer science (KIF e.V.). Furthermore, I was part of the organization team of Antenne Bergstraße, a non-commercial radio station.

  • 36c3: Der Deep Learning Hype – Wie lange kann es so weitergehen? [More Information, Recording, German, English and French translation available]
  • AIDB 2020: It’s AI Match: A Two-Step Approach for Schema Matching Using Embeddings [Recording, Publication]
  • DESIRES 2021: Netted?! How to Improve the Usefulness of Spider & Co. [Recording, Publication (opens in new tab)]
  • DESIRES 2021: WannaDB: Ad-hoc Structured Exploration of Text Collections Using Queries. [Recording, Publication (opens in new tab)]
  • MRMCD 2018: Kreative KI – wenn der Computer das Setup hackt. How machine learning systems exploit bugs and bad reward functions [Recording, German, targeted at a broad audience]
  • MRMCD 2019: Cheating AI – Wenn Menschen die KI hacken. How humans trick AI systems. [Recording, German, targeted at a broad audience]

Demos

Recommended external content

We have selected external content from YouTube for you and would like to show it to you right here. To do this, you must reveal it with one click. You can hide the external content at any time with another click.

I agree to external content from YouTube being shown to me. This may result in personal data being transmitted to third-party platforms. You can find more information in our Privacy Policy.

Recommended external content

We have selected external content from YouTube for you and would like to show it to you right here. To do this, you must reveal it with one click. You can hide the external content at any time with another click.

I agree to external content from YouTube being shown to me. This may result in personal data being transmitted to third-party platforms. You can find more information in our Privacy Policy.

Recommended external content

We have selected external content from YouTube for you and would like to show it to you right here. To do this, you must reveal it with one click. You can hide the external content at any time with another click.

I agree to external content from YouTube being shown to me. This may result in personal data being transmitted to third-party platforms. You can find more information in our Privacy Policy.

Loading...
Loading data from TUbiblio…

Error on loading data

An error has occured when loading publications data from TUbiblio. Please try again later.

  • ({{ publication.date.toString().substring(0,4) }}):
    {{ publication.title }}.
    In: {{ publication.series }}, {{ publication.volume }}, In: {{ publication.book_title }}, In: {{ publication.publication }}, {{ publication.journal_volume}} ({{ publication.number }}), ppp. {{ publication.pagerange }}, {{ publication.place_of_pub }}, {{ publication.publisher }}, {{ publication.institution }}, {{ publication.event_location }}, {{ publication.event_dates }}, ISSN {{ publication.issn }}, e-ISSN {{ publication.eissn }}, ISBN {{ publication.isbn }}, {{ labels[publication.type]?labels[publication.type]:publication.type }}
  • […]