1.
A Robust Two-Stage Retrieval-Augmented Vision-Language Framework for Knowledge-Intensive Multimodal Reasoning and Alignment. CDIS [Internet]. 2026 Feb. 5 [cited 2026 Feb. 22];2(2):42-5. Available from: https://pub.scientificirg.com/index.php/CDIS/article/view/40