Skip to content

RichieOnData/Image-Captioning-Using-CNN-and-LSTM

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 

Repository files navigation

Image Captioning Using CNN and LSTM

Caption generation is a challenging artificial intelligence problem where a textual description must be generated for a given photograph. It requires:

  • Computer Vision techniques to understand the content of the image.
  • A Language Model from the field of Natural Language Processing (NLP) to turn the understanding into words in the correct order.

Recently, deep learning methods have achieved state-of-the-art results in caption generation problems. The most impressive part of these methods is that a single end-to-end model can predict a caption for a given photo without requiring sophisticated data preparation or multiple specifically designed models.


Dataset


Model Architecture

Model Diagram

Model Diagram

Training Diagram

Training Process

Final Results

Successful Examples

Successful Example 1 Successful Example 2

Failure Examples

Failure Example 1 Failure Example 2

Image-Captioning-Using-CNN-and-LSTM

This repository implements a deep learning approach combining CNN and LSTM for generating image captions. The method leverages both visual and textual data to provide accurate, context-aware captions for images.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published