Dr.-Ing. Richard Eckart de Castilho

Postdoctoral researcher

Contact

work +49 6151 16-25299
fax +49 6151 16-25295

Work S2|02 B117
Hochschulstraße 10
64289 Darmstadt

I earned my Diplom in Computer Science at the TU Darmstadt in 2006. From that time until September 2008, I worked for the Linguistische Profile interdisziplinärer Register (LingPro) DFG project at the TU Darmstadt. I gathered more experience with the UIMA framework during a three-month stay at the IBM Watson Research Center in Hawthorne, New York.

Darmstadt Knowledge Processing Repository (DKPro)

I currently work as a technical lead on the Darmstadt Knowledge Processing Software Repository Project (DKPro). DKPro covers many projects. Some that I work more intensively on are listed below separately. My main responsibilities are development process optimization, release management, documentation and dissemination activities.

DKPro Core

DKPro Core is a collection of interoperable NLP components building up on the Apache UIMA framework. The project integrates many proven tools and resources from the NLP community into a common processing framework and to provide a common abstraction layer. Through the use of Apache UIMA, uimaFIT and Apache Maven, DKPro Core offers convenient access to NLP components and implementation of NLP pipelines. I am one of the main developers of DKPro Core.

DKPro Lab

DKPro Lab is a lightweight framework for parameter sweeping experiments. It allows to set up experiments consisting of multiple interdependent tasks in a declarative manner with minimal overhead. Data produced by a task for any particular parameter configuration is stored and re-used whenever possible to avoid the needless recalculation of results. Reports can be attached to each task to post-process the experimental results and present them in a convenient manner, e.g. as tables or charts. I am the main developer.

Apache uimaFIT™

Apache uimaFIT, an open source library that provides factories, injection, and testing utilities for Apache UIMA™. uimaFIT is a core technology at UKP because it greatly simplifies development of UIMA components. Thus it allows us to concentrate on implementing actual NLP functionality. I am a committer on the Apache UIMA project, working primarily on uimaFIT.

TT4J – TreeTagger for Java

TreeTagger for Java (TT4J) is an open source library which provides a Java API to Helmut Schmid's TreeTagger. TT4J is used by the DKPro TreeTagger UIMA component. I am the main developer.

JWPL – Java-based Wikipedia Library

JWPL (Java Wikipedia Library) is a free, Java-based application programming interface that allows to access all information contained in Wikipedia. I am a consulting developer.

I have co-organized the following events:

  • Summer term 2011 – Unstructured Information Management (Software project)
  • Winter term 2010/11 – Unstructured Information Management (Software project)
  • Summer term 2010 – Unstructured Information Management (Software project)
Jump to: 2020 | 2019 | 2018 | 2017 | 2016 | 2015 | 2014 | 2013 | 2012 | 2011 | 2009 | 2008
Number of items: 32.

2020

Klie, Jan-Christoph and Eckart de Castilho, Richard and Gurevych, Iryna (2020):
From Zero to Hero: Human-In-The-Loop Entity Linking in Low Resource Domains.
pp. 6982-6993, The 58th annual meeting of the Association for Computational Linguistics (ACL 2020), virtual Conference, 05.-10.07.2020, [Conference or Workshop Item]

2019

Eckart de Castilho, Richard and Ide, Nancy and Kim, Jin-Dong and Klie, Jan-Christoph and Suderman, Keith (2019):
Towards cross-platform interoperability for machine-assisted annotation.
In: Genomics & Informatics, 17 (2), pp. e19.. Genomics Inform, DOI: 10.5808/GI.2019.17.2.e19,
[Article]

2018

Eckart de Castilho, Richard and Klie, Jan-Christoph and Kumar, Naveen and Boullosa, Beto and Gurevych, Iryna (2018):
Linking Text and Knowledge using the INCEpTION annotation platform.
In: Proceedings of the 14th eScience IEEE International Conference, pp. 327-328,
The 14th eScience IEEE International Conference, Amsterdam, Netherlands, 29.10.2018--01.11.2018, DOI: 10.1109/eScience.2018.00077,
[Conference or Workshop Item]

Eckart de Castilho, Richard and Klie, Jan-Christoph and Kumar, Naveen and Boullosa, Beto and Gurevych, Iryna (2018):
INCEpTION - Corpus-based Data Science from Scratch.
Digital Infrastructures for Research (DI4R) 2018, Lisbon, Portugal, 9-11 October 2018, [Conference or Workshop Item]

Boullosa, Beto and Eckart de Castilho, Richard and Kumar, Naveen and Klie, Jan-Christoph and Gurevych, Iryna (2018):
Integrating Knowledge-Supported Search into the INCEpTION Annotation Platform.
Demo Papers, In: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pp. 127-132,
The 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, 31.10.2018--04.11.2018, [Conference or Workshop Item]

Labropoulou, Penny and Galanis, Dimitris and Lempesis, Antonis and Greenwood, Mark and Knoth, Petr and Eckart de Castilho, Richard and Sachtouris, Stavros and Georgantopoulos, Byron and Martziou, Stefania and Anastasiou, Lucas and Gkirtzou, Katerina and Manola, Natalia and Piperidis, Stelios (2018):
OpenMinTeD: A Platform Facilitating Text Mining of Scholarly Content.
In: Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018),
Proceedings of the 7th International Workshop on Mining Scientific Publications (WOSP 2018) at LREC 2018, Miyazaki, Japan, 07.05.2018--12.05.2018, [Conference or Workshop Item]

Klie, Jan-Christoph and Bugert, Michael and Boullosa, Beto and Eckart de Castilho, Richard and Gurevych, Iryna (2018):
The INCEpTION Platform: Machine-Assisted and Knowledge-Oriented Interactive Annotation.
In: Proceedings of the 27th International Conference on Computational Linguistics: System Demonstrations, pp. 5-9,
Association for Computational Linguistics, The 27th International Conference on Computational Linguistics (COLING 2018), Santa Fe, USA, 20.08.2018--26.08.2018, [Conference or Workshop Item]

Eckart de Castilho, Richard and Dore, Giulia and Labropoulou, Penny and Margoni, Thomas and Gurevych, Iryna (2018):
A Legal Perspective on Training Models for Natural Language Processing.
In: Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC 2018), pp. 1267-1274,
European Language Resources Association (ELRA), Miyazaki, Japan, [Conference or Workshop Item]

2017

Kouylekov, Milen and Lapponi, Emanuele and Oepen, Stephan and Eckart de Castilho, Richard (2017):
An Arranged Marriage: Integrating DKPro Core in the Language Analysis Portal.
In: Proceedings of the CLARIN Annual Conference 2017, pp. online,
CLARIN ERIC, Budapest, Hungary, [Conference or Workshop Item]

Biemann, Chris and Bontcheva, Kalina and Eckart de Castilho, Richard and Gurevych, Iryna and Yimam, Seid Muhie Ide, Nancy and Pustejovsky, James (eds.) (2017):
Collaborative Web-based Tools for Multi-layer Text Annotation.
In: Text, Speech, and Technology book series, In: The Handbook of Linguistic Annotation, pp. 229-256, Springer Netherlands, ISBN 978-94-024-0879-9,
DOI: 10.1007/978-94-024-0881-2,
[Book Section]

Boullosa, Beto and Eckart de Castilho, Richard and Geyken, Alexander and Lemnitzer, Lothar and Gurevych, Iryna (2017):
A tool for extracting sense-disambiguated example sentences through user feedback.
In: Proceedings of the Software Demonstrations of the 15th Conference of the European Chapter of the Association for Computational Linguistics, pp. 69-72,
Association for Computational Linguistics, Valencia, Spain, [Conference or Workshop Item]

Eckart de Castilho, Richard and Ide, Nancy and Lapponi, Emanuele and Oepen, Stephan and Suderman, Keith and Velldal, Erik and Verhagen, Marc (2017):
Representation and Interchange of Linguistic Annotation. An In-Depth, Side-by-Side Comparison of Three Designs.
In: Proceedings of the 11th Linguistics Annotation Workshop (LAW XI) at EACL 2017, pp. 67-75,
Association for Computational Linguistics, [Conference or Workshop Item]

2016

Eckart de Castilho, Richard (2016):
Automatic Analysis of Flaws in Pre-Trained NLP Models.
In: Proceedings of the Third International Workshop on Worldwide Language Service Infrastructure and Second Workshop on Open Infrastructures and Analysis Frameworks for Human Language Technologies (WLSI3nOIAF2) at COLING 2016, pp. 19-27,
Osaka, Japan, ISBN 978-4-87974-720-4,
[Conference or Workshop Item]

Eckart de Castilho, Richard and Mújdricza-Maydt, Éva and Yimam, Seid Muhie and Hartmann, Silvana and Gurevych, Iryna and Frank, Anette and Biemann, Chris (2016):
A Web-based Tool for the Integrated Annotation of Semantic and Syntactic Structures.
In: Proceedings of the workshop on Language Technology Resources and Tools for Digital Humanities (LT4DH) at COLING 2016, pp. 76-84,
Osaka, Japan, [Conference or Workshop Item]

Przybyła, Piotr and Shardlow, Matthew and Aubin, Sophie and Bossy, Robert and Eckart de Castilho, Richard and Piperidis, Stelios and McNaught, John and Ananiadou, Sophia (2016):
Text mining resources for the life sciences.
In: Database, 2016, pp. 1-30. Oxford Academic, DOI: 10.1093/database/baw145,
[Article]

Miller, Tristan and Khemakhem, Mohamed and Eckart de Castilho, Richard and Gurevych, Iryna (2016):
Sense-annotating a lexical substitution data set with Ubyline.
In: Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016), pp. 828-835,
European Language Resources Association (ELRA), ISBN 978-2-9517408-9-1,
[Conference or Workshop Item]

Eckart de Castilho, Richard (2016):
Interoperability = f(community, division of labour).
In: Proceedings of the Workshop on Cross-Platform Text Mining and Natural Language Processing Interoperability collocated with LREC 2016, pp. 24-28,
Portoroz, Slovenia, DOI: 10.5281/zenodo.161848,
[Conference or Workshop Item]

2015

Do Dinh, Erik-Lân and Eckart de Castilho, Richard and Gurevych, Iryna (2015):
In-tool Learning for Selective Manual Annotation in Large Corpora.
In: Proceedings of ACL-IJCNLP 2015 System Demonstrations, pp. 13-18,
Association for Computational Linguistics and The Asian Federation of Natural Language Processing, Beijing, China, [Conference or Workshop Item]

Moulin, Claudine and Gurevych, Iryna and Filatkina, Natalia and Eckart de Castilho, Richard Gippert, Jost and Gehrke, Ralf (eds.) (2015):
Analyzing Formulaic Patterns in Historical Corpora.
In: Corpus Linguistics and Interdisciplinary Perspectives on Language (CLIP), In: Historical Corpora. Challenges and Perspectives., pp. 51-64, Narr Publishing House, ISBN 978-3-8233-6922-6,
[Book Section]

2014

Schnober, Carsten and Heuwing, Ben and Weiß, Andreas and Eckart de Castilho, Richard and Strötgen, Robert and Gurevych, Iryna and Lässig, Simone and Womser-Hacker, Christa (2014):
Welt der Kinder: Knowledge and Interpretation of the World as Portrayed in Textbooks and Children Books between 1850 and 1918.
The Hague, CLARIN ERIC, Exploring Historical Sources with Language Technology: Results and Perspectives, The Hague, Netherlands, December, 8-9, 2014, [Conference or Workshop Item]

Eckart de Castilho, Richard and Gurevych, Iryna Ide, Nancy and Grivolla, Jens (eds.) (2014):
A broad-coverage collection of portable NLP components for building shareable analysis pipelines.
In: Proceedings of the Workshop on Open Infrastructures and Analysis Frameworks for HLT (OIAF4HLT) at COLING 2014, pp. 1-11,
Association for Computational Linguistics and Dublin City University, Dublin, Ireland, [Conference or Workshop Item]

Yimam, Seid Muhie and Eckart de Castilho, Richard and Gurevych, Iryna and Biemann, Chris Bontcheva, Kalina and Jingbo, Zhu (eds.) (2014):
Automatic Annotation Suggestions and Custom Annotation Layers in WebAnno.
In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics. System Demonstrations, pp. 91-96,
Association for Computational Linguistics, Baltimore, MD, USA, [Conference or Workshop Item]

Eckart de Castilho, Richard (2014):
Natural Language Processing: Integration of Automatic and Manual Analysis.
Darmstadt, TU Darmstadt,
[Ph.D. Thesis]

Eckart de Castilho, Richard and Biemann, Chris and Gurevych, Iryna and Yimam, Seid Muhie (2014):
WebAnno: a flexible, web-based annotation tool for CLARIN.
In: Proceedings of the CLARIN Annual Conference (CAC) 2014, pp. online,
CLARIN ERIC, Soesterberg, Netherlands, [Conference or Workshop Item]

2013

Yimam, Seid Muhie and Gurevych, Iryna and Eckart de Castilho, Richard and Biemann, Chris (2013):
WebAnno: A Flexible,Web-based and Visually Supported System for Distributed Annotations.
In: Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (System Demonstrations) (ACL 2013), pp. 1-6,
Association for Computational Linguistics, Sofia, Bulgaria, [Conference or Workshop Item]

2012

Eckart de Castilho, Richard and Bartsch, Sabine and Gurevych, Iryna (2012):
CSniper - Annotation-by-query for non-canonical constructions in large corpora.
In: Proceedings of the 50th Meeting of the Association for Computational Linguistics (ACL) 2012 (Demo section), pp. 85-90,
Association for Computational Linguistics, Jeju Island, Korea, [Conference or Workshop Item]

2011

Eckart de Castilho, Richard and Gurevych, Iryna Agosti, Maristella and Ferro, Nicola and Thanos, Costantino (eds.) (2011):
A Lightweight Framework for Reproducible Parameter Sweeping in Information Retrieval.
In: DESIRE '11, In: Proceedings of the 2011 workshop on Data infrastructurEs for supporting information retrieval evaluation, pp. 7-10,
ACM, Glasgow, Scotland, UK, ISBN 978-1-4503-0952-3,
DOI: http://doi.acm.org/10.1145/2064227.2064248,
[Conference or Workshop Item]

Eckart de Castilho, Richard and Gurevych, Iryna (2011):
Semantic Service Retrieval based on Natural Language Querying and Semantic Similarity.
In: Proceedings of the 5th IEEE International Conference on Semantic Computing (IEEE-ICSC), pp. 173-176,
Stanford, CA, USA, [Conference or Workshop Item]

2009

Eckart de Castilho, Richard and Holtz, Mônica and Teich, Elke (2009):
Computational support for corpus analysis work flows: The case of integrating automatic and manual annotations.
In: Linguistic Processing Pipelines Workshop at GSCL 2009 - Book of Abstracts (electronic proceedings),
[Conference or Workshop Item]

Eckart de Castilho, Richard and Gurevych, Iryna (2009):
DKPro-UGD: A Flexible Data-Cleansing Approach to Processing User-Generated Discourse.
In: Online-proceedings of the First French-speaking meeting around the framework Apache UIMA,
Nantes, France, [Conference or Workshop Item]

Chiarcos, Christian and Eckart de Castilho, Richard and Stede, Manfred (eds.) (2009):
Von der Form zur Bedeutung: Texte automatisch verarbeiten/From Form to Meaning: Processing Texts Automatically.
Gunter Narr Verlag, Potsdam, Germany, ISBN 978-3823365112,
[Conference or Workshop Item]

2008

Schwarz, Lara and Bartsch, Sabine and Eckart de Castilho, Richard and Teich, Elke (2008):
Exploring Automatic Theme Identification: A Rule-Based Approach.
In: Text Resources and Lexical Knowledge. Selected Papers from the 9th Conference on Natural Language Processing KONVENS 2008, pp. 15-26,
Mouton de Gruyter, [Conference or Workshop Item]

This list was generated on Fri Apr 16 04:04:27 2021 CEST.