Abstract
Passage retrieval in a conversational context is extremely challenging due to limited data resources. Information seeking in a conversational setting may contain omissions, implied context, and topic shifts. TREC CAsT promotes research in this field by aiming to create a reusable dataset for open-domain conversational information seeking (CIS). The track achieves this goal by defining a passage retrieval task in a multi-turn conversation setting. Understanding conversation context and history is a key factor in this challenge. This solution addresses this challenge by implementing a multi-stage retrieval pipeline inspired by last year's winning algorithm. The first stage in this retrieval process is a historical query expansion step from last year's winning algorithm where context is extracted from historical queries in the conversation. The second stage is the addition of a pseudo-relevance feedback step where the query is expanded using top-k retrieved passages. Finally, a pre-trained BERT passage re-ranker is used. The solution performed better than the median results of other submitted runs with an NDCG@3 of 0.3127 for the best performing run.
Original language | English |
---|---|
Publication status | Published - 2020 |
Event | 29th Text REtrieval Conference, TREC 2020 - Virtual, Online, United States Duration: 16 Nov 2020 → 20 Nov 2020 |
Conference
Conference | 29th Text REtrieval Conference, TREC 2020 |
---|---|
Country/Territory | United States |
City | Virtual, Online |
Period | 16/11/20 → 20/11/20 |
Keywords
- Conversational Information Seeking
- Conversational Search Systems
- Multi-Stage Retrieval Systems
- Open-Domain