NORMA eResearch @NCI Library

Enhancing Risk Assessment in Legal Documents through Advanced Machine Learning

Pusarla, Sai Teja (2024) Enhancing Risk Assessment in Legal Documents through Advanced Machine Learning. Masters thesis, Dublin, National College of Ireland.

[thumbnail of Master of Science]
Preview
PDF (Master of Science)
Download (1MB) | Preview
[thumbnail of Configuration Manual]
Preview
PDF (Configuration Manual)
Download (1MB) | Preview

Abstract

In this thesis, we will conduct research to examine the use of BERT, Legal-BERT and Isolation Forest models to strengthen the risk assessment and anomaly detection of these legal documents. It shows that Legal-BERT, a BERT model pretrained on legal texts can outperform a general BERT model well beyond chance level with a test accuracy of 76.47%, and excellent specificity in precision, recall and F1 score measures. In addition, the Isolation Forest Algorithm, an unsupervised learning model, was able to find 975 anomalies among 19,501 clauses, and we can say that it is able to find deviants from normal threads of law. Most notably, real practice was the proving ground for these models, as they were tested in real-time on new legal texts and their applicability was confirmed. These findings support the opportunity advanced machine learning models provide for automation, increased accuracy, and scale in the analysis of legal documents and constitute a meaningful step toward automated risk assessments applied to real-world legal scenarios within the field of legal informatics.

Item Type: Thesis (Masters)
Supervisors:
Name
Email
Basilio, Jorge
UNSPECIFIED
Uncontrolled Keywords: BERT; Legal BERT; Clause; Risky; Not Risky; Isolation Forest; Anomaly
Subjects: H Social Sciences > HD Industries. Land use. Labor > HD61 Risk Management
K Law > K Law (General)
Q Science > QA Mathematics > Electronic computers. Computer science
T Technology > T Technology (General) > Information Technology > Electronic computers. Computer science
Q Science > Q Science (General) > Self-organizing systems. Conscious automata > Machine learning
Divisions: School of Computing > Master of Science in Data Analytics
Depositing User: Ciara O'Brien
Date Deposited: 04 Sep 2025 10:46
Last Modified: 04 Sep 2025 10:46
URI: https://norma.ncirl.ie/id/eprint/8778

Actions (login required)

View Item View Item