Chitta, Subrahmanyasarma, Shashi Thota, Sai Manoj Yellepeddi, Amit Kumar Reddy, and Ashok Kumar Pamidi Venkata. 2020. “Multimodal Deep Learning: Integrating Vision and Language for Real-World Applications”. Asian Journal of Multidisciplinary Research & Review 1 (2): 262-82. https://ajmrr.org/journal/article/view/211.