Topic

RAG

Retrieval-augmented generation systems, evaluation, and retrieval-heavy workflows.

14 papers · latest 2026-04-23

Most active fields for this topic

NLP · 12 Reasoning & Agents · 2

HaS: Accelerating RAG through Homology-Aware Speculative Retrieval

Peng Peng, Weiwei Lin, Wentai Wu et al.

significant🔴 AdvancedNLP RAG

cs.IRcs.CLcs.IR

Proposes HaS, a speculative retrieval method that accelerates RAG systems by leveraging homology-aware caching, reducing latency without accuracy loss in large-scale knowledge retrieval.

Details → arXiv →

Beyond Explicit Refusals: Soft-Failure Attacks on Retrieval-Augmented Generation

Wentao Zhang, Yan Zhuang, ZhuHang Zheng et al.

breakthrough🔴 AdvancedNLP RAG

cs.CRcs.AIcs.CR

DEJA exposes stealthy RAG failures that mimic valid responses, forcing a paradigm shift in security evaluation—essential for deploying reliable RAG systems that must detect subtle, non-obvious degradation.

Details → arXiv →

CHOP: Chunkwise Context-Preserving Framework for RAG on Multi Documents

Hyunseok Park, Jihyeon Kim, Jongeun Kim et al.

breakthrough🟡 IntermediateNLP RAG

cs.CLcs.CL

CHOP reduces RAG hallucinations by iteratively chunking and reassembling documents with LLMs—directly improving factual accuracy in production systems without requiring retraining or new embeddings.

Details → arXiv →

SLQ: Bridging Modalities via Shared Latent Queries for Retrieval with Frozen MLLMs

Haoran Lou, Ziyan Liu, Chunxiao Fan et al.

breakthrough🔴 AdvancedNLP RAG LLM Reasoning

cs.CVcs.CV

SLQ enables retrieval with frozen MLLMs via shared latent queries—preserving pre-trained knowledge while avoiding costly fine-tuning, a game-changer for scalable, stable multimodal retrieval systems.

Details → arXiv →

MM-Doc-R1: Training Agents for Long Document Visual Question Answering through Multi-turn Reinforcement Learning

Jiahang Lin, Kai Hu, Binghai Wang et al.

breakthrough🔴 AdvancedReasoning & Agents AI Agents RAG

cs.CLcs.CL

Introduces a multi-turn RL agent for visual QA over long documents, enabling iterative retrieval and synthesis—transforming RAG from static lookup to dynamic reasoning for complex document systems.

Details → arXiv →

From Relevance to Authority: Authority-aware Generative Retrieval in Web Search Engines

Sunkyung Lee, Jihye Back, Donghyeon Jeon et al.

breakthrough🟡 IntermediateNLP RAG

cs.IRcs.CLcs.IR

Introduces authority-aware generation in retrieval, directly improving trustworthiness in high-stakes domains by biasing LLMs toward credible sources—not just relevance—enabling safer deployment in healthcare and finance.

Details → arXiv →

FRESCO: Benchmarking and Optimizing Re-rankers for Evolving Semantic Conflict in Retrieval-Augmented Generation

Sohyun An, Hayeon Lee, Shuibenyang Yuan et al.

breakthrough🔴 AdvancedNLP RAG

cs.IRcs.AIcs.IR

FRESCO introduces dynamic evaluation for RAG re-rankers under evolving data, exposing severe performance drops in static benchmarks. Builders must test re-rankers with temporal drift to ensure real-world reliability.

Details → arXiv →

MultiDocFusion: Hierarchical and Multimodal Chunking Pipeline for Enhanced RAG on Long Industrial Documents

Joongmin Shin, Chanjun Park, Jeongbae Park et al.

breakthrough🟡 IntermediateNLP RAG Multimodal Understanding

cs.AIcs.CLcs.AI

MultiDocFusion integrates vision and text to preserve structural context in long industrial documents, dramatically improving RAG accuracy—essential for enterprises relying on precise QA from complex PDFs, manuals, and reports.

Details → arXiv →

Retrieval as Generation: A Unified Framework with Self-Triggered Information Planning

Bo Li, Mingda Wang, Gexiang Fang et al.

significant🔴 AdvancedNLP RAG

cs.CLcs.AIcs.CL

GRIP turns retrieval into a native decoding action so the model can decide when to search, rewrite queries, and stop inside one reasoning trace instead of bolting on a controller.

Details → arXiv →

Synthius-Mem: Brain-Inspired Hallucination-Resistant Persona Memory Achieving 94.4% Memory Accuracy and 99.6% Adversarial Robustness on LoCoMo

Artem Gadzhiev, Andrew Kislov

significant🟡 IntermediateNLP RAG

cs.CLcs.AIcs.LG

Synthius-Mem replaces retrieval-heavy agent memory with structured persona memory, improving both long-term recall and adversarial robustness against invented facts.

Details → arXiv →

RecaLLM: Addressing the Lost-in-Thought Phenomenon with Explicit In-Context Retrieval

Kyle Whitecross, Negin Rahimi

significant🔴 AdvancedNLP RAG

cs.CLcs.AIcs.IR

RecaLLM tackles the lost-in-thought problem by interleaving reasoning with explicit in-context retrieval, giving long-context models a practical way to stay grounded at up to 128K tokens.

Details → arXiv →

VISOR: Agentic Visual Retrieval-Augmented Generation via Iterative Search and Over-horizon Reasoning

Yucheng Shen, Jiulong Wu, Jizhou Huang et al.

significant🔴 AdvancedReasoning & Agents RAG AI Agents

cs.CVcs.AIcs.CV

VISOR pushes visual RAG toward real agent behavior with iterative search, evidence-space tracking, and drift control for long-horizon multimodal question answering over documents.

Details → arXiv →

BRIDGE: Multimodal-to-Text Retrieval via Reinforcement-Learned Query Alignment

Mohamed Darwish Mounis, Mohamed Mahmoud, Shaimaa Sedek et al.

significant🟡 IntermediateNLP RAG Alignment & Safety

cs.IRcs.CVcs.IR

Shows multimodal retrieval is often a query-alignment problem, not an encoder problem, and beats strong baselines by rewriting image-text queries into retrieval-optimized text.

Details → arXiv →

A Systematic Study of Retrieval Pipeline Design for Retrieval-Augmented Medical Question Answering

Nusrat Sultana, Abdullah Muhammad Moosa, Kazi Afzalur Rahman et al.

incremental🟡 IntermediateNLP RAG

cs.CLcs.AIcs.LG

A careful 40-setting RAG study shows dense retrieval, query reformulation, and reranking matter more than many heavyweight choices, offering practical tuning guidance that extends beyond medical QA.

Details → arXiv →