NORMA eResearch @NCI Library

Hybrid Machine Learning Approach Towards Anomaly Detection and Data Quality Assessment Of IoT Weather Data

Abhaykumar Kulkarni, Saket (2024) Hybrid Machine Learning Approach Towards Anomaly Detection and Data Quality Assessment Of IoT Weather Data. Masters thesis, Dublin, National College of Ireland.

[thumbnail of Master of Science]
Preview
PDF (Master of Science)
Download (1MB) | Preview
[thumbnail of Configuration Manual]
Preview
PDF (Configuration Manual)
Download (939kB) | Preview

Abstract

The growing rate of climate change requires high standards of analysis of weather data. This research focuses on climate science and its data quality assessment though clustering and anomaly detection using traditional machine learning with a combination of PCA for dimensional reduction. By using IoT weather data from almost 1200 cities around the world, the hybrid model is built to detect anomalies and classify data according to its quality. The final percentage of anomalies found in the data are between 14-15 % of the entire data with an ROC of 0.70 and the anomalies are visualised which shows clear separation between the values. The hybrid model performed better compared to individual models with evaluations metrics of Homogeneity: 0.702, Completeness: 0.527, V-Measure: 0.602, Adjusted Rand Index: 0.713. The following model is a strong base to prove that there is space for traditional machine learning in this field rather that heavy deep learning models which require much higher computational power. The pre-trained models perform well and are a the base for future development in clustering analysis of weather data.

Item Type: Thesis (Masters)
Supervisors:
Name
Email
Singh, Jaswinder
UNSPECIFIED
Uncontrolled Keywords: Machine Learning; Climate Science; Clustering; Anomalies; Data Quality
Subjects: G Geography. Anthropology. Recreation > GE Environmental Sciences
Q Science > QA Mathematics > Electronic computers. Computer science
T Technology > T Technology (General) > Information Technology > Electronic computers. Computer science
T Technology > TK Electrical engineering. Electronics. Nuclear engineering > Telecommunications > Computer networks > Internet of things
Q Science > Q Science (General) > Self-organizing systems. Conscious automata > Machine learning
Divisions: School of Computing > Master of Science in Data Analytics
Depositing User: Ciara O'Brien
Date Deposited: 06 Aug 2025 14:40
Last Modified: 06 Aug 2025 14:40
URI: https://norma.ncirl.ie/id/eprint/8448

Actions (login required)

View Item View Item