DKPro Core 1.7.0 Released
2014/11/28 by Pedro Santos, for the DKPro Core development team
We are pleased to announce the release of
DKPro Core, version 1.7.0 (ASL & GPL)
a collection of interoperable software components for natural language processing (NLP) based on the Apache UIMA framework.
code.google.com/p/dkpro-core-asl
code.google.com/p/dkpro-core-gpl
Analysis components
- hunpos – wrapper for , a HMM pos tagger including models for many languages; hunpos
- langdetect – wrapper for a language detection tool for java; language-detection,
- mallet – wrapper for topic modelling using ; MALLET
- textnormalizer – original components for text normalization, e.g. spelling correction, umlaut normalization, expressive lengthening normalization.
Data formats
- io.conll – support for CoNLL , 2000, 2002 and 2009 formats; 2012
- io.ditop – support for topic model visualization format; DiTop
- io.penntree – support for combined and chunked formats;
- io.tueppdz – support for format. TüPP-D/Z
Further highlights in this release include:
- Upgrade to Apache UIMA 2.6.0;
- Upgrade to version 2.7; LanguageTools
- Upgrade to version 1.8; MaltParser
- Upgrade to version 3.4.1; Stanford CoreNLP
- Support additional models: MaltParser, Bengali, Farsi; Polish
- Support additional models: MSTParser; Croatian
- Support additional models: OpenNLP; Spanish
- Support additional models: Spanish, English caseless, shift-reduce parser models. Stanford CoreNLP
A more detailed overview of the changes in this release can be found . here
When upgrading, please mind that you should not mix different versions of DKPro Core components in your projects – they may not be compatible with each other.