SUT at SemEval-2023 Task 1: Prompt Generation for Visual Word Sense Disambiguation

Omid Ghahroodi, Seyed Arshan Dalili, Sahel Mesforoush, Ehsaneddin Asgari

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Citations (Scopus)

Abstract

Visual Word Sense Disambiguation (V-WSD) identifies the correct visual sense of a multisense word in a specific context. This can be challenging as images may need to provide additional context and words may have multiple senses. A proper V-WSD system can benefit applications like image retrieval and captioning. This paper proposes a Prompt Generation approach to solve this challenge. This approach improves the robustness of language-image models like CLIP to contextual ambiguities and helps them better correlate between textual and visual contexts of different senses of words.

Original languageEnglish
Title of host publication17th International Workshop on Semantic Evaluation, SemEval 2023 - Proceedings of the Workshop
EditorsAtul Kr. Ojha, A. Seza Dogruoz, Giovanni Da San Martino, Harish Tayyar Madabushi, Ritesh Kumar, Elisa Sartori
PublisherAssociation for Computational Linguistics
Pages2160-2163
Number of pages4
ISBN (Electronic)9781959429999
Publication statusPublished - 2023
Externally publishedYes
Event17th International Workshop on Semantic Evaluation, SemEval 2023, co-located with the 61st Annual Meeting of the Association for Computational Linguistics, ACL 2023 - Hybrid, Toronto, Canada
Duration: 13 Jul 202314 Jul 2023

Publication series

Name17th International Workshop on Semantic Evaluation, SemEval 2023 - Proceedings of the Workshop

Conference

Conference17th International Workshop on Semantic Evaluation, SemEval 2023, co-located with the 61st Annual Meeting of the Association for Computational Linguistics, ACL 2023
Country/TerritoryCanada
CityHybrid, Toronto
Period13/07/2314/07/23

Fingerprint

Dive into the research topics of 'SUT at SemEval-2023 Task 1: Prompt Generation for Visual Word Sense Disambiguation'. Together they form a unique fingerprint.

Cite this