Harikrishnan, Divyasree (2025) Medical Assistant Utilizing Large Language Models with Retrieval Augmented Generation and Vector Search. Masters thesis, Dublin, National College of Ireland.
Preview |
PDF (Master of Science)
Download (948kB) | Preview |
Preview |
PDF (Configuration Manual)
Download (512kB) | Preview |
Abstract
A trending strategy in the development of domain-specific assistants is to integrate the concepts of the Retrieval-Augmented Generation (RAG) with Large Language Models (LLMs) to build efficient high-quality assistants. The given research will develop a chatbot medical assistant offering credible data on health conditions, diseases, treatments, and preventative measures. The knowledge base is a pre-processed knowledge base based on “The Gale Encyclopedia of Medicine (Second Edition).” A custom set of domain-specific embeddings is produced with the PubMedBERT model and embedded representations are stored in the Qdrant cloud vector database, serving to allow rapid retrieval. The use of cosine similarity means that it uses the vector search which is provided to a RAG pipeline supported by two LLMs, LLaMA-3 70B through Groq API and Google MedGemma 4B through Hugging Face API. I compared the PubMedQA datasets to conduct an evaluation on their performance with metrics being ROUGE-L, BLEU scores, and response latency. The preliminary results show that the LLaMA-3 with the help of PubMedBERT embeds is considerably ahead of MedGemma, reaching the ROUGE-L score of 0.1741 and BLEU score of 0.0229 with lower latency. The chatbot was implemented on Streamlit framework which supports session-based memory to create continuity in conversation. The proposed research proves that the combination of LLMs, vector search, and RAG pipelines is effective to create medicalspecific assistants.
| Item Type: | Thesis (Masters) |
|---|---|
| Supervisors: | Name Email Thomas, Lavish UNSPECIFIED |
| Uncontrolled Keywords: | Medical Assistant; Large Language Models; Retrieval-Augmented Generation; Vector Search; PubMedBERT; LLaMA-3; MedGemma |
| Subjects: | Q Science > QH Natural history > QH301 Biology > Methods of research. Technique. Experimental biology > Data processing. Bioinformatics > Artificial intelligence Q Science > Q Science (General) > Self-organizing systems. Conscious automata > Artificial intelligence P Language and Literature > P Philology. Linguistics > Computational linguistics. Natural language processing R Medicine > Healthcare Industry |
| Divisions: | School of Computing > Master of Science in Artificial Intelligence |
| Depositing User: | Ciara O'Brien |
| Date Deposited: | 28 May 2026 14:08 |
| Last Modified: | 28 May 2026 14:08 |
| URI: | https://norma.ncirl.ie/id/eprint/9322 |
Actions (login required)
![]() |
View Item |
Tools
Tools