How Does the Textual Information Affect the Retrieval of Multimodal In-Context Learning?

Published in EMNLP, 2024