This project demonstrates how to build an image captioning model using TensorFlow. The model combines a pre-trained Convolutional Neural Network (CNN) for image feature extraction and a Long Short-Term Memory (LSTM) network for generating captions.
Image captioning is the task of generating a descriptive sentence for a given image. This project uses a CNN to extract features from an image and an LSTM to generate a corresponding caption.
- Python 3.6 or higher
- TensorFlow 2.x
- NumPy
- Pillow (PIL)
-
Clone this repository:
https://github.com/IcodeG00D/Image-Captioning-Using-CNN-and-LSTM.git
-
Install the required packages:
pip install tensorflow numpy pillow
- Place your image in the
images
folder. - Update the
image_path
andcaption
in the script as needed. - Run the script to train the model and generate captions.