Cross-genre argument mining: Can language models automatically fill in missing discourse markers?

Argument and Computation:1-41 (forthcoming)
  Copy   BIBTEX

Abstract

Available corpora for Argument Mining differ along several axes, and one of the key differences is the presence (or absence) of discourse markers to signal argumentative content. Exploring effective ways to use discourse markers has received wide attention in various discourse parsing tasks, from which it is well-known that discourse markers are strong indicators of discourse relations. To improve the robustness of Argument Mining systems across different genres, we propose to automatically augment a given text with discourse markers such that all relations are explicitly signaled. Our analysis unveils that popular language models taken out-of-the-box fail on this task; however, when fine-tuned on a new heterogeneous dataset that we construct (including synthetic and real examples), they perform considerably better. We demonstrate the impact of our approach on an Argument Mining downstream task, evaluated on different corpora, showing that language models can be trained to automatically fill in discourse markers across different corpora, improving the performance of a downstream model in some, but not all, cases. Our proposed approach can further be employed as an assistant tool for better discourse understanding.

Links

PhilArchive



    Upload a copy of this work     Papers currently archived: 92,682

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Similar books and articles

Discourse Markers in Different Types of Reporting.Péter Furkó, András Kertész & Ágnes Abuczki - 2018 - In Alessandro Capone, Una Stojnic, Ernie Lepore, Denis Delfitto, Anne Reboul, Gaetano Fiorin, Kenneth A. Taylor, Jonathan Berg, Herbert L. Colston, Sanford C. Goldberg, Edoardo Lombardi Vallauri, Cliff Goddard, Anna Wierzbicka, Magdalena Sztencel, Sarah E. Duffy, Alessandra Falzone, Paola Pennisi, Péter Furkó, András Kertész, Ágnes Abuczki, Alessandra Giorgi, Sona Haroutyunian, Marina Folescu, Hiroko Itakura, John C. Wakefield, Hung Yuk Lee, Sumiyo Nishiguchi, Brian E. Butler, Douglas Robinson, Kobie van Krieken, José Sanders, Grazia Basile, Antonino Bucca, Edoardo Lombardi Vallauri & Kobie van Krieken (eds.), Indirect Reports and Pragmatics in the World Languages. Springer Verlag. pp. 243-276.
Genre analysis of the letters of appeal.Moses Samuel & Vahid Sadeghi - 2013 - Discourse Studies 15 (2):229-245.
Discourse markers in writing.Jean E. Fox Tree - 2015 - Discourse Studies 17 (1):64-82.
Epistemic Modality Constructions as Stable Idiolectal Features: A Cross-genre Study of Spanish.Andrea Mojedano Batel, Amparo Soler Bonafont & Krzysztof Kredens - 2024 - International Journal for the Semiotics of Law - Revue Internationale de Sémiotique Juridique 37 (2):595-621.

Analytics

Added to PP
2024-03-24

Downloads
150 (#128,064)

6 months
150 (#23,373)

Historical graph of downloads
How can I increase my downloads?

Citations of this work

No citations found.

Add more citations