My research explores the frontiers of Large Language Models (LLMs), with a focus on building robust Retrieval-Augmented Generation (RAG) pipelines and intelligent Agents that can act like human-experts. I am passionate about applying this work as "AI for X," creating systems that serve as expert assistants to accelerate discovery in science and industry.
CRAG - Comprehensive RAG Benchmark
Xiao Yang, Kai Sun, Hao Xin, Yushi Sun, Nikita Bhalla, Xiangsen Chen, Sajal Choudhary, Rongze Daniel Gui, Ziran Will Jiang, Ziyu Jiang, Lingkun Kong, Brian Moran, Jiaqi Wang, Yifan Ethan Xu, An Yan, Chenyu Yang, Eting Yuan, Hanwen Zha, Nan Tang, Lei Chen, Nicolas Scheffer, Yue Liu, Nirav Shah, Rakesh Wanga, Anuj Kumar, Wen-tau Yih, Xin Luna Dong
KDD Cup, 2024
The CRAG benchmark laid the groundwork for a KDD Cup 2024 challenge and submitted to NeurIPS 2024. The homepage of the competition is here.