Keecheril George Mathew, Nirmal (2024) Medical Visual Question Answering using Bootstrapping Language Image Pre-train model. Masters thesis, Dublin, National College of Ireland.
Preview |
PDF (Master of Science)
Download (2MB) | Preview |
Preview |
PDF (Configuration Manual)
Download (2MB) | Preview |
Abstract
Medical Visual Question Answering, or mVQA, is slowly revealing its applicability to the medical field, particularly to the enhancement of the prognosis and diagnostic features. As medical imaging is one of the diagnostic processes, it is necessary to consider how it is possible to decrease the time of analysis of images and give the results in a short time. Hence the current issue is on the particular processing of noisy medical data and the actual diagnostic output of the technology. Regarding these difficulties in mVQA, applied in this study will be the Bootstrapping Language Image Pre-Trained (BLIP) model. The study involved two key case studies: the first compared the ability of BLIP in identifying noisy medical data, for which the model achieved a validation accuracy of 51.68%. Stil moderate, this result shows that BLIP is quite proficient in dealing with complex data. The second case study was to enhance the training of the model by the track of loss values, and the validation loss decreased to 0. 0930 the final epoch. Each of the above periods can further be divided into smaller sub-periods based on general classifications of technological evolutions still used today, such as the following: Another such conclusion runs that BLIP could be beneficial, particularly in the context of medical diagnostics, for the next instances with the key channels of the image analysis enhanced as far as main steps, as well as with the greater general efficiency and accuracy of the final diagnostic conclusions. This work also shows the successful implementation of the proposed technique, BLIP, in mVQA and will be helpful for the future advancement of medical AI to contribute to the improvement of health care services.
Actions (login required)
![]() |
View Item |