Argument-annotated Product Reviews
This corpus consists of argument annotations (argument components and argumentative relations between them) for 982 Amazon product reviews from Julian McAuley's Amazon product data.
Details regarding the annotation scheme and the creation of the corpus can be found in the following paper, which we ask that you cite in the event that you use the data:
- Tristan Miller, Maria Sukhareva, and Iryna Gurevych.
A streamlined method for sourcing discourse-level argumentation annotations from the crowd.
In Proceedings of the 17th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT 2019), volume 1, pages 1790–1796, June 2019. ISBN 978-1-950737-13-0.
The data is distributed as a set of XML Metadata Interchange (XMI) files containing stand-off UIMA annotations that cross-reference the original texts from McAuley. Because the original review texts are not available under a free licence, we do not include them in our distribution, but we provide a script for extracting them from the original corpus and merging them into our XMIs.
To obtain the data, the extract/merge script, associated software, the documentation, and the copyright/licensing information, please visit the project's GitHub repository or download the latest pre-bundled release archive.
Questions regarding the data, annotation guidelines, or software may be directed to Tristan Miller.