Document detail
ID

oai:arXiv.org:2406.03411

Topic
Computer Science - Computer Vision...
Author
Lee, Saehyung Yu, Sangwon Park, Junsung Yi, Jihun Yoon, Sungroh
Category

Computer Science

Year

2024

listing date

7/31/2024

Keywords
context interactive retrieval
Metrics

Abstract

In this paper, we primarily address the issue of dialogue-form context query within the interactive text-to-image retrieval task.

Our methodology, PlugIR, actively utilizes the general instruction-following capability of LLMs in two ways.

First, by reformulating the dialogue-form context, we eliminate the necessity of fine-tuning a retrieval model on existing visual dialogue data, thereby enabling the use of any arbitrary black-box model.

Second, we construct the LLM questioner to generate non-redundant questions about the attributes of the target image, based on the information of retrieval candidate images in the current context.

This approach mitigates the issues of noisiness and redundancy in the generated questions.

Beyond our methodology, we propose a novel evaluation metric, Best log Rank Integral (BRI), for a comprehensive assessment of the interactive retrieval system.

PlugIR demonstrates superior performance compared to both zero-shot and fine-tuned baselines in various benchmarks.

Additionally, the two methodologies comprising PlugIR can be flexibly applied together or separately in various situations.

Our codes are available at https://github.com/Saehyung-Lee/PlugIR.

;Comment: ACL 2024 Oral

Lee, Saehyung,Yu, Sangwon,Park, Junsung,Yi, Jihun,Yoon, Sungroh, 2024, Interactive Text-to-Image Retrieval with Large Language Models: A Plug-and-Play Approach

Document

Open

Share

Source

Articles recommended by ES/IODE AI

High-Frequency Repetitive Magnetic Stimulation at the Sacrum Alleviates Chronic Constipation in Parkinson’s Patients
magnetic stimulation parkinson’s significant patients scale sacrum pd hf-rms chronic constipation scores
The mechanism of PFK-1 in the occurrence and development of bladder cancer by regulating ZEB1 lactylation
bladder cancer pfk-1 zeb1 lactylation glycolysis inhibits lactate glucose bc pfk-1 cancer lactylation cells bladder