Unfair clause detection in terms of service across multiple languages

Artificial Intelligence and Law (forthcoming)
  Copy   BIBTEX

Abstract

Most of the existing natural language processing systems for legal texts are developed for the English language. Nevertheless, there are several application domains where multiple versions of the same documents are provided in different languages, especially inside the European Union. One notable example is given by Terms of Service (ToS). In this paper, we compare different approaches to the task of detecting potential unfair clauses in ToS across multiple languages. In particular, after developing an annotated corpus and a machine learning classifier for English, we consider and compare several strategies to extend the system to other languages: building a novel corpus and training a novel machine learning system for each language, from scratch; projecting annotations across documents in different languages, to avoid the creation of novel corpora; translating training documents while keeping the original annotations; translating queries at prediction time and relying on the English system only. An extended experimental evaluation conducted on a large, original dataset indicates that the time-consuming task of re-building a novel annotated corpus for each language can often be avoided with no significant degradation in terms of performance.

Links

PhilArchive



    Upload a copy of this work     Papers currently archived: 93,296

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Similar books and articles

Rights Metaphors Across Hybrid Legal Languages, Such as Euro English and Legal Chinese.Michele Mannoni - 2021 - International Journal for the Semiotics of Law - Revue Internationale de Sémiotique Juridique 34 (5):1375-1399.
Querying linguistic trees.Catherine Lai & Steven Bird - 2010 - Journal of Logic, Language and Information 19 (1):53-73.

Analytics

Added to PP
2024-04-05

Downloads
6 (#1,485,580)

6 months
6 (#587,658)

Historical graph of downloads
How can I increase my downloads?