HBKU at TREC 2020: Conversational Multi-Stage Retrieval with Pseudo-Relevance Feedback

Haya Al-Thani, Bernard J. Jansen, Tamer Elsayed

Research output: Contribution to conferencePaperpeer-review

1 Citation (Scopus)

Abstract

Passage retrieval in a conversational context is extremely challenging due to limited data resources. Information seeking in a conversational setting may contain omissions, implied context, and topic shifts. TREC CAsT promotes research in this field by aiming to create a reusable dataset for open-domain conversational information seeking (CIS). The track achieves this goal by defining a passage retrieval task in a multi-turn conversation setting. Understanding conversation context and history is a key factor in this challenge. This solution addresses this challenge by implementing a multi-stage retrieval pipeline inspired by last year's winning algorithm. The first stage in this retrieval process is a historical query expansion step from last year's winning algorithm where context is extracted from historical queries in the conversation. The second stage is the addition of a pseudo-relevance feedback step where the query is expanded using top-k retrieved passages. Finally, a pre-trained BERT passage re-ranker is used. The solution performed better than the median results of other submitted runs with an NDCG@3 of 0.3127 for the best performing run.

Original languageEnglish
Publication statusPublished - 2020
Event29th Text REtrieval Conference, TREC 2020 - Virtual, Online, United States
Duration: 16 Nov 202020 Nov 2020

Conference

Conference29th Text REtrieval Conference, TREC 2020
Country/TerritoryUnited States
CityVirtual, Online
Period16/11/2020/11/20

Keywords

  • Conversational Information Seeking
  • Conversational Search Systems
  • Multi-Stage Retrieval Systems
  • Open-Domain

Fingerprint

Dive into the research topics of 'HBKU at TREC 2020: Conversational Multi-Stage Retrieval with Pseudo-Relevance Feedback'. Together they form a unique fingerprint.

Cite this