Deep Learning Models for Generating Text Summaries from Videos
In recent years, there has been a growing interest in bridging different modalities in the deep learning community. This project aims at developing deep learning models for automatically generating text summaries from videos. Through this project, we expect to find better ways to integrate textual and visual modalities into an end-to-end model. Apart from that, text summarization techniques, including abstractive text summarization and extractive text summarization, are explored. To better evaluate our models, alternative ways for the evaluation of text summaries are considered as well.