Qingyi Si (佀庆一)

About Me

I am currently an LLM researcher at JD Explore Academy (京东探索研究院), through the TGT Program, working with Jiaqi Wang, and Nan Duan, focusing on Multimodal Understanding, Post-training (RL & SFT) and Inference Efficiency.

Previously, I was with Huawei ICT BG as a member of TopMinds program (天才少年), and received my Ph.D. from the Institute of Information Engineering, Chinese Academy of Sciences in 2024, advised by Prof. Zheng Lin and Prof. Weiping Wang. I have published 30+ peer-reviewed papers in top-tier AI conferences including ACL, EMNLP, ICLR, NeurIPS, ICRA, IROS, AAAI, IJCAI, MM. My open-source projects have accumulated 3000+ stars on GitHub.

We're hiring interns for the TGT program! If you have top-tier publications and are passionate about video understanding and VLMs, we'd love to hear from you.

Selected Publications

* indicates co-first author, † indicates corresponding author. Full list available on Google Scholar.

LLM Test-time Optimization

Preprint System 1&2 Synergy via Dynamic Model Interpolation

Chenxu Yang*, Qingyi Si*, Chong Tian, Xiyu Liu, Dingyu Yao, Chuanyu Qin, Zheng Lin, Weiping Wang, Jiaqi Wang

Preprint

[PDF]

AAAI'26 Test-time Prompt Intervention

Chenxu Yang*, Qingyi Si*, Mz Dai*, Dingyu Yao, Mingyu Zheng, Minghui Chen, Zheng Lin, Weiping Wang

AAAI 2026

[PDF]

ICLR'26 Dynamic Early Exit in Reasoning Models

Chenxu Yang*, Qingyi Si*, Yongjie Duan, Zheliang Zhu, Chenyu Zhu, Qiaowei Li, Minghui Chen, Zheng Lin, Weiping Wang

ICLR 2026

[PDF] [Code]

Reinforcement Learning and Post-training

Preprint Outcome-Grounded Advantage Reshaping for Fine-Grained Credit Assignment in Mathematical Reasoning

Ziheng Li, Liu Kang, Feng Xiao, Luxi Xing, Qingyi Si, Zhuoran Li, Weikang Gong, Deqing Yang, Yanghua Xiao, Hongcheng Guo

Preprint

[PDF]

NeurIPS'25 S-GRPO: Early Exit via Reinforcement Learning in Reasoning Models

Muzhi Dai*, Chenxu Yang*, Qingyi Si†

NeurIPS 2025

[PDF]

NeurIPS'25 workshop Stable Reinforcement Learning for Efficient Reasoning

Muzhi Dai*, Shixuan Liu*, Qingyi Si†

NeurIPS workshop 2025

[PDF]

ACL'24 Multimodal Table Understanding

Mingyu Zheng*, Xinwei Feng*, Qingyi Si*, Qiaoqiao She, Zheng Lin, Wenbin Jiang, Weiping Wang

ACL 2024

[PDF] [Code]

EMNLP'23 An Empirical Study of Instruction-tuning Large Language Models in Chinese

Qingyi Si*, Tong Wang*, Zheng Lin, Xu Zhang, Yanan Cao, Weiping Wang

EMNLP 2023

[PDF] [Code]

ACL'23 Combo of Thinking and Observing for Outside-Knowledge VQA

Qingyi Si, Yuchen Mo, Zheng Lin, Huishan Ji, Weiping Wang

ACL 2023

[PDF] [Code]

Long/Streaming Context and Sparse Attention

ICLR'26 LouisKV: Efficient KV Cache Retrieval for Long Input-Output Sequences

Wenbo Wu*, Qingyi Si*, Xiurui Pan, Ye Wang, Jie Zhang

ICLR 2026

[PDF]

AAAI'26 Sparse Attention across Multiple-context KV Cache

Ziyi Cao*, Qingyi Si*, Jingbin Zhang, Bingquan Liu

AAAI 2026

[PDF]

ACL'25 AdaReTaKe: Adaptive Redundancy Reduction to Perceive Longer for Video-language Understanding

Xiao Wang*, Qingyi Si* Shiyu Zhu, Jianlong Wu, Li Cao, Liqiang Nie

ACL 2025

[PDF] [Code]

Open Source Projects

Unified Cache Manager (UCM) - UCM addresses the challenges of low efficiency and high costs in long-context inference through a multi-tier KV Cache and inference memory management system. [huawei]
Alpaca-CoT (2.8k+ stars) - A unified instruction-tuning platform for LLMs with chain-of-thought data integration. Used by 60+ models including Qwen. [GitHub]
DEER - Training-free efficient reasoning algorithms for early exit or intervention in reasoning models. [GitHub]
AdaReTaKe - Sparse attention for long video understanding, achieving 8x length extension. Ranked #1 on VideoMME, MLVU, LongVideoBench, LVBench leaderboards. [GitHub]

Qingyi Si (佀庆一)

About Me

Recent News

Selected Publications

LLM Test-time Optimization

Reinforcement Learning and Post-training

Long/Streaming Context and Sparse Attention

Open Source Projects

Academic Services

Honors & Awards