UBY 0.6.0 released


A Large-Scale Unified Lexical-Semantic Resource Based on LMF

We are pleased to announce the release of UBY 0.6.0.

Project homepage: http://code.google.com/p/uby/


Highlights of UBY 0.6.0

  • Integration of new UBY lexicon OntoWiktionary (Meyer/Gurevych, 2012).
  • Major revision of the lexicon converters including.
    • Standardization of syntactic labels.
    • Standardization of external references.
  • Support for embedded H2 databases.
  • Support for using Maven-packaged H2 databases in a UIMA pipeline.
  • Numerous fixes and performance improvements.


Christian M. Meyer and Iryna Gurevych: OntoWiktionary — Constructing an Ontology from the Collaborative Online Dictionary Wiktionary, chapter 6 in M. T. Pazienza and A. Stellato (Eds.): Semi-Automatic Ontology Development: Processes and Resources, p. 131-161, Hershey, PA: IGI Global, February 2012.

Project summary

UBY is a linked lexical-semantic resource for natural language processing based on the ISO standard Lexical Markup Framework (LMF). It combines a wide range of information from ten expert-constructed and collaboratively constructed resources for English and German:

English WordNet, Wiktionary, Wikipedia, FrameNet, and VerbNet,

German Wikipedia, Wiktionary, GermaNet, and IMSLex-Subcat, as well as

multilingual OmegaWiki.

A subset of these resources is inter-linked at the word sense level. UBY is a valuable resources for many knowledge-based NLP applications. Get more information on the UBY website.

UBY is a research project at the Ubiquitous Knowledge Processing (UKP) Lab of the Technische Universität Darmstadt, Germany. The following people have mainly contributed to this project (in alphabetical order): Richard Eckart de Castilho, Judith Eckle-Kohler, Iryna Gurevych, Silvana Hartmann, Michael Matuschek, Christian M. Meyer, Tristan Miller, and Tri-Duc Nghiem.


A more detailed description of UBY is available in our scientific articles:

  • Iryna Gurevych, Judith Eckle-Kohler, Silvana Hartmann, Michael Matuschek, Christian M. Meyer, and Christian Wirth: UBY – A Large-Scale Unified Lexical-Semantic Resource Based on LMF, in: Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics (EACL), p. 580-590, April 2012. Avignon, France.
  • Judith Eckle-Kohler, Iryna Gurevych, Silvana Hartmann, Michael Matuschek, and Christian M. Meyer: UBY-LMF – A Uniform Model for Standardizing Heterogeneous Lexical-Semantic Resources in ISO-LMF, in: Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC), p. 275-282, May 2012. Istanbul, Turkey.

Please cite our EACL paper if you use UBY in your work. Please cite the LREC paper if you use the lexicon model UBY-LMF.

License and Availability

The latest version of UBY is available from our Google code page:


Besides the lexicon access and creation code, you can find tutorials and precompiled UBY database dumps on this homepage.

UBY is available as open source software under the Apache License 2.0 (ASL) and the GNU General Public License (GPL). Please see LICENSE.txt for details.


Please direct any questions or suggestions to: uby-developers(a-t)googlegroups.com