I'm a second-year MPhil student in Data Science and Analytics at HKUST(GZ) in Guangzhou, China. My supervisor is Prof. Nan Tang and Prof. Xuming Hu. Before that, I got my Bachelor's degree at Huazhong University of Science and Technology(华中科技大学).
I am interested in Data Analytics over Data Lakes (especially tabular and text) using Large Language Models(LLMs). My research goal is to explore and develop novel retrieval methods for data lakes like textual and tabular data, and to evaluate and improve the performance of retrieval-augmented generation for data analytics tasks.
CRAG - Comprehensive RAG Benchmark
Xiao Yang, Kai Sun, Hao Xin, Yushi Sun, Nikita Bhalla, Xiangsen Chen, Sajal Choudhary, Rongze Daniel Gui, Ziran Will Jiang, Ziyu Jiang, Lingkun Kong, Brian Moran, Jiaqi Wang, Yifan Ethan Xu, An Yan, Chenyu Yang, Eting Yuan, Hanwen Zha, Nan Tang, Lei Chen, Nicolas Scheffer, Yue Liu, Nirav Shah, Rakesh Wanga, Anuj Kumar, Wen-tau Yih, Xin Luna Dong
KDD Cup, 2024
The CRAG benchmark laid the groundwork for a KDD Cup 2024 challenge and submitted to NeurIPS 2024. The homepage of the competition is here.