Abstract
Automatic caption generation of images has gained significant interest. It gives rise to a lot of interesting image-related applications.
For example, it could help in image/video retrieval and management of vast amount of multimedia data available on the Internet. It
could also help in development of tools that can aid visually impaired individuals in accessing multimedia content. In this paper, we
particularly focus on news images and propose a methodology for automatically generating captions for news paper articles consisting
of a text paragraph and an image. We propose several deep neural network architectures built upon Recurrent Neural Networks. Results
on a BBC News dataset show that our proposed approach outperforms a traditional method based on Latent Dirichlet Allocation using
both automatic evaluation based on BLEU scores and human evaluation.
For example, it could help in image/video retrieval and management of vast amount of multimedia data available on the Internet. It
could also help in development of tools that can aid visually impaired individuals in accessing multimedia content. In this paper, we
particularly focus on news images and propose a methodology for automatically generating captions for news paper articles consisting
of a text paragraph and an image. We propose several deep neural network architectures built upon Recurrent Neural Networks. Results
on a BBC News dataset show that our proposed approach outperforms a traditional method based on Latent Dirichlet Allocation using
both automatic evaluation based on BLEU scores and human evaluation.
Original language | English |
---|---|
Title of host publication | The 11th International Conference on Language Resources and Evaluation (LREC) |
Publication status | Published - 1 Jan 2019 |
Event | The 11th International Conference on Language Resources and Evaluation (LREC) - Miyazaki, Japan Duration: 7 May 2018 → 12 May 2018 http://lrec2018.lrec-conf.org/en/ |
Conference
Conference | The 11th International Conference on Language Resources and Evaluation (LREC) |
---|---|
Country/Territory | Japan |
City | Miyazaki |
Period | 7/05/18 → 12/05/18 |
Internet address |
Bibliographical note
The LREC 2018 Proceedings are licensed under a Creative Commons Attribution-NonCommercial 4.0 International LicenseKeywords
- Recurrent Neural Networks
- Image caption generation
- Deep learning
- Order Embedding