A Robust Two-Stage Retrieval-Augmented Vision-Language Framework for Knowledge-Intensive Multimodal Reasoning and Alignment. (2026). Computational Discovery and Intelligent Systems (CDIS), 2(2), 42-52. https://doi.org/10.66279/2da0zk02