UKPConvArg1 Corpus from ACL 2016
The UKPConvArg1 Corpus is introduced in the following paper:
Habernal, I. & Gurevych, I. (2016). Which argument is more convincing? Analyzing and predicting convincingness of Web arguments using bidirectional LSTM. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Pages: 1589-1599. Berlin, Germany. Association for Computational Linguistics.
- Data download: UKPConvArg1.tar.bz2 (13 MB)
- See readme.txt in the archive for data format description
- See the related source codes at GitHub: https://github.com/UKPLab/acl2016-convincing-arguments
- This is the full corpus as referred in the article (Table 2, UKPConvArgAll). It contains 32 xml files, each file corresponding to one debate/side. Total number of argument pairs is 16,081.
- Exported tab-delimited file with 1,052 arguments with their ID, rank score, and text (Table 2, UKPConvArgRank)
- Cleaned version used for experiments in the article (Table 2, UKPConvArgSctrict). It contains 11,650 argument pairs in 32 XML files.
- The same as UKPConvArg1Strict-XML but exported into tab-delimited CSV with ID, more convincing argument label (a1 or a2) and both arguments (a1, tab, a2)
- The data are licensed under CC-BY (Creative Commons Attribution 4.0 International License)
- The source arguments originate from
Ivan Habernal, firstname.lastname@example.org