NORMA eResearch @NCI Library

Benchmarking Hive on Spark and SQL Server with the Real Time Data Warehousing Chain

Bassett, Ian (2016) Benchmarking Hive on Spark and SQL Server with the Real Time Data Warehousing Chain. Masters thesis, Dublin, National College of Ireland.

[thumbnail of Master of Science]
Preview
PDF (Master of Science)
Download (759kB) | Preview
[thumbnail of Configuration File]
Preview
PDF (Configuration File)
Download (1MB) | Preview

Abstract

The following paper focuses on the field of Data Warehousing in two aspects. The first aspect will review Big Data performance comparing the emerging Hive on Apache Spark with SQL Server to determine when it would be appropriate to switch to a big data platform. The other aspect will investigate current software in the industry and how the continuous support of communities are creating to solve current and future barriers in the profession. A current issue in Data Warehousing and Business Intelligence is the development of Real Time Data Warehousing. This paper documents the research and progress of tools in the automation process of Real Time Data Warehousing.

Item Type: Thesis (Masters)
Subjects: Q Science > QA Mathematics > Electronic computers. Computer science
T Technology > T Technology (General) > Information Technology > Electronic computers. Computer science
Q Science > QA Mathematics > Computer software
T Technology > T Technology (General) > Information Technology > Computer software
Divisions: School of Computing > Master of Science in Data Analytics
Depositing User: Caoimhe Ní Mhaicín
Date Deposited: 03 Dec 2016 12:00
Last Modified: 03 Dec 2016 12:00
URI: https://norma.ncirl.ie/id/eprint/2491

Actions (login required)

View Item View Item