GLASS is a German-language data set annotated with lexical substitutions and word senses from GermaNet 9.0. It is an extended and corrected version of the lexical substitution data set used at GermEval 2015: LexSub.


The data set is released under the Creative Commons Attribution-ShareAlike 3.0 Unported licence.


Tristan Miller, Mohamed Khemakhem, Richard Eckart de Castilho, and Iryna Gurevych. Sense-annotating a lexical substitution data set with Ubyline. In Nicoletta Calzolari et al., editors, Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016), pages 828–835. European Language Resources Association, May 2016. ISBN 978-2-9517408-9-1.


Download the GLASS data set