1.
A Robust Two-Stage Retrieval-Augmented Vision-Language Framework for Knowledge-Intensive Multimodal Reasoning and Alignment. CDIS. 2026;2(2):42-52. Accessed February 22, 2026. https://pub.scientificirg.com/index.php/CDIS/article/view/40