Sherlock

Sherlock

There exists an ever-growing set of data-centric systems that allow data scientists of varying skill levels to interactively manipulate, analyze and explore large structured data sets. However, there are currently not many systems that allow data scientists and novice users to interactively explore large unstructured text document collections from heterogeneous sources.

Therefore, we present a new system for interactive text summarization called Sherlock. The task of automatically producing textual summaries is an important step to understand a collection of multiple topic-related documents. It has many real-world applications in journalism, medicine, and many more. However, none of the existing summarization systems allow users to provide feedback at interactive speed. We therefore integrate a new approximate summarization model into Sherlock that can guarantee interactive speeds even for large text collections to keep the user engaged in the process.

Plugin required: in order to see this object, your browser has to support files of type text/html. Download

Researchers

Name Office Phone E-mail
Benjamin Hättasch M.Sc.
Doctoral Researcher
S2|02 D110-25603
Foto Benjamin Hättasch

Publications

Hättasch, Benjamin ; Meyer, Christian M. ; Binnig, Carsten (2019):
Interactive Summarization of Large Document Collections.
In: HILDA'19: Proceedings of the ..., Amsterdam, Niederlande, In: Workshop on Human-In-the-Loop Data Analytics, Amsterdam, 05.07.2019, S. 1-4, ISBN 978-1-4503-6791-2,
DOI: 10.1145/3328519.3329129,
[Online-Edition: https://hilda.io/2019/proceedings/HILDA2019_paper_4.pdf],
[Konferenzveröffentlichung]

Hättasch, Benjamin Alonso, Omar ; Silvello, Gianmaria (Hrsg.) (2018):
Towards Interactive Summarization of Large Document Collections.
Bertinoro, Italy, CEUR Workshop Proceedings, In: First Biennial Conference on Design of Experimental Search & Information Retrieval Systems, Bertinoro, Italy, 28.-31.08.2018, S. 103, [Online-Edition: http://ceur-ws.org/Vol-2167/short6.pdf],
[Konferenzveröffentlichung]

go to TU-biblio search on ULB website

P. V. S., Avinesh ; Hättasch, Benjamin ; Özyurt, Orkan ; Binnig, Carsten ; Meyer, Christian M. (2018):
Sherlock: A System for Interactive Summarization of Large Text Collections.
11In: Proceedings of the VLDB Endowment, In: The 44th International Conference on Very Large Data Bases (VLDB 2018), Rio De Janerio, Brazil, 27.08.2018--31.08.2018, S. 1902-1905, DOI: 10.14778/3229863.3236220,
[Online-Edition: http://www.vldb.org/pvldb/vol11/p1902-p.v.s..pdf],
[Konferenzveröffentlichung]

go to TU-biblio search on ULB website