About Me
I am a PhD student at the National University of Singapore, fortunate to be supervised by Prof. HUANG Zhiyong. My research is primarily focused on multimodality, autonomous agents, and code generation.
Previously, I was a research intern at Qwen, supervised by Junyang Lin and Binyuan Hui, where I contributed to building the Qwen model series, including Qwen-VL, Qwen-3.5, and Qwen-Coder. I have also had the great opportunity to work with Prof. Tat-Seng Chua, Prof. Junxian He, and Prof. Michael Qizhe Shieh.
Recent News
- Apr 2026: One paper accepted to ICML 2026.
- Feb 2026: Released Qwen-3.5 and Qwen 3 Coder Next.
- Jan 2026: Three papers accepted to ICLR 2026.
- Sep 2025: Released Qwen3-VL.
- Sep 2025: One paper (SE-GUI) accepted to NeurIPS 2025.
- Aug 2025: Two papers accepted to EMNLP 2025.
- Jul 2025: ScreenSpot-Pro accepted to ACMMM 2025 as an Oral Presentation.
- May 2025: Three papers accepted to ACL 2025.
Selected Publications
Agents & GUI Interaction
ScreenSpot-Pro: GUI Grounding for Professional High-Resolution Computer Use (Oral Presentation)
ACMMM 2025 · ICLR 2025 Workshop on Reasoning and Planning for LLMs
Used by Google Gemini, GPT-5.2, Qwen-VL Series, Microsoft OmniParser, and Meta's Muse Spark.
65,000+ dataset downloads.
Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning
NeurIPS 2025
HackWorld: Evaluating Computer-Use Agents on Exploiting Web Application Vulnerabilities
ICLR 2026
Robi Butler: Remote Multimodal Interactions with Household Robot Assistant
ICRA 2025
Coding & Code Intelligence
MMCode: Evaluating Multi-Modal Code Large Language Models with Visually Rich Programming Problems
EMNLP 2024 Findings · ICLR 2025 Workshop on Reasoning and Planning for LLMs
Used by Qwen2.5-VL and Qwen3-VL.
BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution
arXiv
Tree-of-Evolution: Tree-Structured Instruction Evolution for Code Generation in Large Language Models
ACL 2025
MCTS-Judge: Test-Time Scaling in LLM-as-a-Judge for Code Correctness Evaluation
arXiv preprint (2502.12468)
Open Source
Services
Peer Reviewer for:
- ICLR (2024, 2025, 2026)
- CVPR (2025)
- ECCV (2026)
- ACL (2025)
- EMNLP (2025)
- AAAI (2025)
- ACMMM (2025)