EXPLOITING IMAGE–TEXT SYNERGY FOR CONTEXTUAL IMAGE CAPTIONING
Sreyasi Nag Chowdhury, Rajarshi Bhowmik, Hareesh Ravi, Gerard de Melo, Simon Razniewski and Gerhard Weikum
LARGE-SCALE ZERO-SHOT IMAGE CLASSIFICATION FROM RICH AND DIVERSE TEXTUAL DESCRIPTIONS
Sebastian Bujwid and Josephine Sullivan
REASONING OVER VISION AND LANGUAGE: EXPLORING THE BENEFITS OF SUPPLEMENTAL KNOWLEDGE
Violetta Shevchenko, Damien Teney, Anthony Dick and Anton van den Hengel
VISUAL GROUNDING STRATEGIES FOR TEXT-ONLY NATURAL LANGUAGE PROCESSING
Damien Sileo
WHAT DID THIS CASTLE LOOK LIKE BEFORE? EXPLORING REFERENTIAL RELATIONS IN NATURALLY OCCURRING MULTIMODAL TEXTS
Ronja Utescher and Sina Zarrieß