Automated Image Captioning: Harnessing Machine Learning for Image Description Generation

Authors

  • Y RAVIKUMAR Associate Professor, Dept. of Computer Science Engineering,, A.M Reddy Memorial College of Engineering and Technology, Andhra Pradesh Author
  • A RAVI KIRAN, D. SUBBA RAO Assistant Professor, Dept. of Computer Science Engineering. A.M Reddy Memorial College of Engineering and Technology, Andhra Pradesh Author

Keywords:

Computer Vision, Convolutional Neural Network (CNN), Recurrent Neural Network (RNN), Xception, Flicker 8K, LSTM, Preprocessing

Abstract

Advancements in computer vision have led to its widespread application across various domains.
This project focuses on a specific aspect of computer vision: image captioning. While generating descriptive
language for images remains a challenging task, recent research has made significant progress, particularly in
the realm of still images. Although earlier efforts primarily concentrated on video content, there has been a shift
towards enhancing image descriptions using natural language understandable to humans. Our project aims to
leverage convolutional neural networks (CNNs) and explore various hyperparameters using extensive datasets
such as Flickr8k and ResNet. By combining the outputs of these image classifiers with recurrent neural
networks (RNNs), we seek to generate accurate captions for images. This paper provides a comprehensive
overview of the architecture and methodology employed in our image captioning model.

Downloads

Published

2021-12-08

How to Cite

Automated Image Captioning: Harnessing Machine Learning for Image Description Generation. (2021). INTERNATIONAL JOURNAL OF MANAGEMENT RESEARCH AND REVIEW, 11(4). https://ijmrr.com/index.php/ijmrr/article/view/468