NORMA eResearch @NCI Library

Deep learning apporach for image captioning in Hindi language

Rathi, Ankit (2020) Deep learning apporach for image captioning in Hindi language. In: 2020 International Conference on Computer, Electrical & Communication Engineering (ICCECE). IEEE, Kolkata, India, pp. 1-8. ISBN 978-172814475-7

Full text not available from this repository.
Official URL: https://doi.org/10.1109/ICCECE48148.2020.9223087

Abstract

Generating image description automatically from the content of an image is one of the fundamental problems in artificial intelligence. This task involves the knowledge of both computer vision and natural language processing, called “Image caption generation” Many researches have been carried out in this field, but it was mainly focused on generating image descriptions in English, as existing image caption datasets are mostly in English. However, the image captioning should not be restricted by language. The lack of image captioning dataset other than English is a problem, especially for a morphologically rich language such as Hindi. Thus, to tackle this problem, this research constructed Hindi image caption dataset based on images from Flickr8k dataset using Google cloud translator, which is called Flickr8k-Hindi Datasets The Flickr8k-Hindi Datasets consist of four datasets based on a number of descriptions per image and clean or unclean descriptions. This study also finds the best effective method to generate image description using the encoder-decoder neural network model. The experiments showed that training the model with a single clean description per image generates higher-quality caption than a model trained with five uncleaned descriptions per image. Although model trained with five uncleaned descriptions per image achieved BLEU-I score of 0.585, which is the current state of the art.

Item Type: Book Section
Uncontrolled Keywords: Training; Deep learning; Computer vision; Art; Neural networks; Predictive models; Internet; Image captioning; Hindi language; Monolingual learning; encoder-decoder framework
Subjects: P Language and Literature > PK Indo-Iranian
Q Science > QA Mathematics > Electronic computers. Computer science
T Technology > T Technology (General) > Information Technology > Electronic computers. Computer science
Q Science > QH Natural history > QH301 Biology > Methods of research. Technique. Experimental biology > Data processing. Bioinformatics > Artificial intelligence
Q Science > Q Science (General) > Self-organizing systems. Conscious automata > Artificial intelligence
P Language and Literature > P Philology. Linguistics > Computational linguistics. Natural language processing
Q Science > Q Science (General) > Self-organizing systems. Conscious automata > Machine learning
Divisions: School of Computing > Master of Science in Data Analytics
Depositing User: Tamara Malone
Date Deposited: 11 Aug 2023 08:09
Last Modified: 11 Aug 2023 08:09
URI: https://norma.ncirl.ie/id/eprint/6773

Actions (login required)

View Item View Item