Skip to main content

Research

Discover our latest research

Selected Papers

Representative research work from our team in the RAG domain.

2025.04ICLR 2025

VisRAG: Vision-based Retrieval-Augmented Generation on Multi-modality Documents

Shi Yu, Chaoyue Tang, et al.

Proposes a "vision-first" retrieval-augmented generation paradigm that fundamentally solves the information degradation problem of complex layout documents in traditional text parsing by converting documents directly into visual vectors for matching and generation.

Read More
2025.04ICLR 2025

RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards

Xinze Li, Sen Mei, et al.

Proposes a new RAG optimization paradigm based on "differentiable data rewards", significantly improving the model's ability to extract core information from external knowledge and resolve knowledge conflicts through end-to-end reward alignment between retriever and generator.

Read More
2025.07ACL 2025

RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework

Kunlun Zhu, Yifan Luo, et al.

Proposes a new paradigm for automated RAG evaluation benchmark construction, enabling efficient customization of evaluation datasets for specific vertical scenarios (such as finance, law, healthcare) through Schema-based knowledge distillation and document generation.

Read More
2025.11EMNLP 2025

DeepNote: Note-Centric Deep Retrieval-Augmented Generation

Ruobing Wang, et al.

Proposes a "note-centric" adaptive retrieval-augmented generation paradigm that significantly improves the model's depth and robustness in handling complex open-domain QA tasks by introducing an iterative knowledge accumulation mechanism.

Read More