I am an undergraduate student starting from September 2022. My research focuses on Transfer learning and Large Language Model, especially in llm evaluation and post-training. Previously, I was a research intern at Tsinghua Shenzhen International Graduate School supervised by Zhi Wang.I’m also a research intern at M-A-P. I have served as a reviewer for top-tier conferences including ICLR and CVPR.

🔥 News

  • 2025.05:  🎉 1 paper accepted to ACL (main) 2025.
  • 2025.04:  🎉 1 paper accepted to IJCAI 2025.
  • 2025.03:  🎉 1 paper accepted to IJCNN 2025.
  • 2025.02:  🎉 1 paper accepted to ICASSP 2025.
  • 2025.02:  🎉 SuperGPQA is out! I devoted 8 mounths for this project and it’s promoted by 量子位 and 字节跳动Seed. Several models used SuperGPQA including Qwen-v3, seed-thinking-v1.5, Hunyuan-TurboS, MiMo, MiMo-VL, dots.llm1, pangu-MoE

📝 Publications

Arxiv 2025
KORGym

KORGym: A Dynamic Game Platform for LLM Reasoning Evaluation
Jiajun Shi, Jian Yang, Jiaheng Liu, Xingyuan Bu, Jiangjie Chen, Junting Zhou, Kaijing Ma, Zhoufutu Wen, Bingli Wang, Yancheng He, Liang Song, Hualei Zhu, Shilong Li, Xingjian Wang, Wei Zhang, Ruibin Yuan, Yifan Yao, Wenjun Yang, Yunli Wang, Siyuan Fang, Siyu Yuan, Qianyu He, Xiangru Tang, Yingshui Tan, Wangchunshu Zhou, Zhaoxiang Zhang, Zhoujun Li, Wenhao Huang, Ge Zhang
under review
arXiv  |  Code  |  Project Page

Arxiv 2025
SuperGPQA

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines
I’m one of the leading authors:).
M-A-P, ByteDance.Inc and 2077AI.
under review arXiv  |  Code  |  Project Page

ICASSP 2025
VBSA

Towards Fully Test-Time Adaptation via Variance Balancing and Semantic Augmentation
Houcheng Su*, Bingli Wang*, Daixian Liu, Jiao Li, Chen-Bin Feng, Chi Man Vong
ICASSP 2025
paper

arXiv
SMS

Singular Value Maximization and Suppression: Addressing Imbalanced & Indistinct Classes for Domain Generalization
Bingli Wang, Jiao Li, Daixian Liu, Houcheng Su …
IJCNN 2025
paper coming soon!

arXiv
CII-Bench

CII-Bench: Can MLLMs Understand the Deep Implication Behind Chinese Images?
Chenhao Zhang∗, Xi Feng∗, Yuelin Bai∗, Xinrun Du∗, Jinchang Hou, Kaixin Deng, Guangzeng Han, Qinrui Li, Bingli Wang, Jiaheng Liu, Xingwei Qu, Yifei Zhang, Qixuan Zhao, Yiming Liang, Ziqiang Liu, Feiteng Fang, Min Yang, Wenhao Huang, Chenghua Lin, Ge Zhang, Shiwen Ni
ACL 2025 main
arXiv  |  Code  |  Project Page

ICASSP 2025
ESBN

ESBN: Estimation Shift of Batch Normalization for Source-free Universal Domain Adaptation
Jiao Li, Houcheng Su, Bingli Wang, Yuandong Min, Mengzhu Wang, Nan Yin, Jincai Guo, Shanshan Wang
IJCAI 2025
paper coming soon!

Arxiv 2024
DiM

DiM: $f$-Divergence Minimization Guided Sharpness-Aware Optimization for Semi-supervised Medical Image Segmentation
Bingli Wang, Houcheng Su, Nan Yin, Mengzhu Wang, Li Shen
Arxiv 2024
arXiv

💼 Internships

  • 2024.4 - Present: Tsinghua University(Shenzhen), Transfer Learning Research Intern
  • 2024.8 - Present: M-A-P, LLM Research Intern

🎖 Honors and Awards

  • 2023.11 National Encouragement Scholarship