项目作者: TanyaChutani

项目描述 :
Image Captioning Generator Keras
高级语言: Jupyter Notebook
项目地址: git://github.com/TanyaChutani/Image-Captioning-Generator.git
创建时间: 2020-04-12T23:35:52Z
项目社区:https://github.com/TanyaChutani/Image-Captioning-Generator

开源协议:

下载


ImageCaptionGenerator

Image Captioning Generator Keras

Data

Dataset - Flickr 8k Dataset



Flicker8k_Dataset

Flickr8k_text



Flicker8k_Dataset - Contains 8092 images in jpeg format.

Flickr8k_text - Each image contains 5 description.

Model

Built and trained a deep learning model for captioning real world image.

  • Used pre-trained InceptionV3 to extract feature from image.
  • Used pre-trained fasttext embedding, these were feed into a Stacked Bi-directional GRU layer.
  • They both were combined and predicted the next word till the end of caption using greedy search (during testing).

Result

Weights

To Do

  • Add attention
  • Use beam search instead of greddy seaech