NORMA eResearch @NCI Library

Medical Assistant Utilizing Large Language Models with Retrieval Augmented Generation and Vector Search

Harikrishnan, Divyasree (2025) Medical Assistant Utilizing Large Language Models with Retrieval Augmented Generation and Vector Search. Masters thesis, Dublin, National College of Ireland.

[thumbnail of Master of Science]
Preview
PDF (Master of Science)
Download (948kB) | Preview
[thumbnail of Configuration Manual]
Preview
PDF (Configuration Manual)
Download (512kB) | Preview

Abstract

A trending strategy in the development of domain-specific assistants is to integrate the concepts of the Retrieval-Augmented Generation (RAG) with Large Language Models (LLMs) to build efficient high-quality assistants. The given research will develop a chatbot medical assistant offering credible data on health conditions, diseases, treatments, and preventative measures. The knowledge base is a pre-processed knowledge base based on “The Gale Encyclopedia of Medicine (Second Edition).” A custom set of domain-specific embeddings is produced with the PubMedBERT model and embedded representations are stored in the Qdrant cloud vector database, serving to allow rapid retrieval. The use of cosine similarity means that it uses the vector search which is provided to a RAG pipeline supported by two LLMs, LLaMA-3 70B through Groq API and Google MedGemma 4B through Hugging Face API. I compared the PubMedQA datasets to conduct an evaluation on their performance with metrics being ROUGE-L, BLEU scores, and response latency. The preliminary results show that the LLaMA-3 with the help of PubMedBERT embeds is considerably ahead of MedGemma, reaching the ROUGE-L score of 0.1741 and BLEU score of 0.0229 with lower latency. The chatbot was implemented on Streamlit framework which supports session-based memory to create continuity in conversation. The proposed research proves that the combination of LLMs, vector search, and RAG pipelines is effective to create medicalspecific assistants.

Item Type: Thesis (Masters)
Supervisors:
Name
Email
Thomas, Lavish
UNSPECIFIED
Uncontrolled Keywords: Medical Assistant; Large Language Models; Retrieval-Augmented Generation; Vector Search; PubMedBERT; LLaMA-3; MedGemma
Subjects: Q Science > QH Natural history > QH301 Biology > Methods of research. Technique. Experimental biology > Data processing. Bioinformatics > Artificial intelligence
Q Science > Q Science (General) > Self-organizing systems. Conscious automata > Artificial intelligence
P Language and Literature > P Philology. Linguistics > Computational linguistics. Natural language processing
R Medicine > Healthcare Industry
Divisions: School of Computing > Master of Science in Artificial Intelligence
Depositing User: Ciara O'Brien
Date Deposited: 28 May 2026 14:08
Last Modified: 28 May 2026 14:08
URI: https://norma.ncirl.ie/id/eprint/9322

Actions (login required)

View Item View Item