Xiangsen CHEN

I'm an MPhil student in Data Science and Analytics at HKUST(GZ) in Guangzhou, China. My supervisor is Prof. Nan Tang and Prof. Xuming Hu. I am also fortunate to work with Dr. Shuo Chen and Dr. Xuan Feng during my internship at Microsoft Research Asia. Before that, I got my Bachelor's degree at Huazhong University of Science and Technology(华中科技大学).

Email / CV / Bio / Scholar / Twitter / Github

Research

My research explores the frontiers of Large Language Models (LLMs), with a focus on building robust Retrieval-Augmented Generation (RAG) pipelines and intelligent Agents that can act like human-experts (like threat researchers) with agents. I am passionate about applying this work as "AI for X," creating systems that serve as expert assistants to accelerate discovery in science and industry.

CyberThreat-Eval: Can Large Language Models Automate Real-World Threat Research?
Xiangsen Chen, Xuan Feng, Shuo Chen, Matthieu Maitre, Sudipto Rakshit, Diana Duvieilh, Ashley Picone, Nan Tang
TMLR, 2025

CRAG - Comprehensive RAG Benchmark
Xiao Yang, Kai Sun, Hao Xin, Yushi Sun, Nikita Bhalla, Xiangsen Chen, Sajal Choudhary, Rongze Daniel Gui, Ziran Will Jiang, Ziyu Jiang, Lingkun Kong, Brian Moran, Jiaqi Wang, Yifan Ethan Xu, An Yan, Chenyu Yang, Eting Yuan, Hanwen Zha, Nan Tang, Lei Chen, Nicolas Scheffer, Yue Liu, Nirav Shah, Rakesh Wanga, Anuj Kumar, Wen-tau Yih, Xin Luna Dong
KDD Cup, 2024

The CRAG benchmark laid the groundwork for a KDD Cup 2024 challenge and submitted to NeurIPS 2024. The homepage of the competition is here.

Review-Then-Refine: A Dynamic Framework for Multi-Hop Question Answering with Temporal Adaptability
Xiangsen Chen, Xuming Hu, Nan Tang
Arxiv, 2024

You can get the source code of this page here. Thanks Jon Barron for the wonderful template!. Also, thanks Leonid Keselman's Jekyll fork of this page.