Automated Image Captioning: Harnessing Machine Learning for Image Description Generation

Y RAVIKUMAR; A RAVI KIRAN, D. SUBBA RAO

Authors

Y RAVIKUMAR Associate Professor, Dept. of Computer Science Engineering,, A.M Reddy Memorial College of Engineering and Technology, Andhra Pradesh Author
A RAVI KIRAN, D. SUBBA RAO Assistant Professor, Dept. of Computer Science Engineering. A.M Reddy Memorial College of Engineering and Technology, Andhra Pradesh Author

Keywords:

Computer Vision, Convolutional Neural Network (CNN), Recurrent Neural Network (RNN), Xception, Flicker 8K, LSTM, Preprocessing

Abstract

Advancements in computer vision have led to its widespread application across various domains.
This project focuses on a specific aspect of computer vision: image captioning. While generating descriptive
language for images remains a challenging task, recent research has made significant progress, particularly in
the realm of still images. Although earlier efforts primarily concentrated on video content, there has been a shift
towards enhancing image descriptions using natural language understandable to humans. Our project aims to
leverage convolutional neural networks (CNNs) and explore various hyperparameters using extensive datasets
such as Flickr8k and ResNet. By combining the outputs of these image classifiers with recurrent neural
networks (RNNs), we seek to generate accurate captions for images. This paper provides a comprehensive
overview of the architecture and methodology employed in our image captioning model.

Automated Image Captioning: Harnessing Machine Learning for Image Description Generation

Authors

Keywords:

Abstract

Downloads

Published

Issue

Section

How to Cite

Make a Submission

Call For Paper

Menu

International Journal of Management Research and Review

IJMRR Visitors

Information