A novel network-based paragraph filtering technique for legal document similarity analysis

Artificial Intelligence and Law:1-23 (forthcoming)
  Copy   BIBTEX

Abstract

The common law system is a legal system that values precedent, or previous court decisions, in the resolution of current cases. As the availability of legal documents in digital form has increased, it has become more difficult for legal professionals to manually identify relevant past cases due to the vast amount of data. Researchers have developed automated systems for determining the similarity between legal documents to address this issue. Our research explores various representations of a legal document and discusses a novel paragraph filtering process to identify key paragraphs using legal citation information to remove unnecessary text paragraphs without disturbing the concept of the legal document. State-of-the-art techniques like TF-IDF, BERT, Legal Bert, Doc2Vec, and Legal-longformer are used for the performance analysis of the proposed approach with document comparison. It has been shown that a model trained on the proposed filtered paragraphs can achieve better results than a model trained on the complete text and can also shorten the document by over 40%. The proposed filtering strategy could be helpful for models like BERT, where the maximum token length is fixed.

Links

PhilArchive



    Upload a copy of this work     Papers currently archived: 91,532

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Similar books and articles

Instructions for Authors.[author unknown] - 2004 - Artificial Intelligence and Law 12 (4):447-452.
Instructions for Authors.[author unknown] - 2002 - Artificial Intelligence and Law 10 (4):303-308.
Instructions for Authors.[author unknown] - 2002 - Artificial Intelligence and Law 10 (1):219-224.
Instructions for Authors.[author unknown] - 2001 - Artificial Intelligence and Law 9 (4):315-320.
Index of Key Words.[author unknown] - 1997 - Artificial Intelligence and Law 5 (4):347-347.
Editors' introduction.Henry Prakken & Giovanni Sartor - 1996 - Artificial Intelligence and Law 4 (3-4):157-161.
A Bayesian model of legal syllogistic reasoning.Axel Constant - forthcoming - Artificial Intelligence and Law:1-22.

Analytics

Added to PP
2023-10-20

Downloads
7 (#1,378,468)

6 months
4 (#779,417)

Historical graph of downloads
How can I increase my downloads?

Citations of this work

No citations found.

Add more citations