Publications

Selected publications and research outputs.

VisAidMath: Benchmarking Visual-Aided Mathematical Reasoning

Jingkun Ma, Runzhe Zhan, Yang Li, Di Sun, Hou Pong Chan, Lidia S. Chao, Derek F. Wong

ACL 2026NeurIPS 2025 Workshop@VLM4RWD (Best Paper Award)

A benchmark for visual-aided mathematical reasoning, accepted by ACL 2026 and also recognized with the Best Paper Award at the 1st VLM4RWD Workshop at NeurIPS 2025.

FIPO-Prover: Formalization-Oriented Informal Proof Optimization for Efficient Formal Theorem Proving

Jingkun Ma^*, Yuchao Wang^*, Yujia Huo, Derek F. Wong

ICML 2026 Workshop@AI4Math (Spotlight Award)

FIPO-Prover bridges the gap between informal reasoning and formal verification by optimizing an intermediate proof representation that is diagnosable, repairable, and guided by verifier feedback.

Idioms Understood, Yet Translated Literally: Diagnosing Literal Translation Bias in Multilingual LLMs

Yanming Sun, Runzhe Zhan, Shuo Wang, Jingkun Ma, Yu Chao, Lidia S. Chao, Derek F. Wong

arxiv

LLMs can recognize and interpret idioms yet still translate them literally, and we diagnose this literal translation bias through a cascaded framework and representation-level intervention.

Lean2Isabelle: Factorized Cross-Assistant Proof Translation

Guanyu Liu^*, Jingkun Ma^*, Derek F. Wong

ICML 2026 Workshop@AI4Math

Lean2Isabelle factorizes Lean-to-Isabelle proof translation into statement prediction and proof generation, showing that verifier-guided RL and Lean proof inputs improve semantically accepted cross-assistant proof migration.

From Local Success to Composite Failure: Diagnosing Local-to-Composite Transfer in Mathematical Reasoning

Jingkun Ma^*, Haoran Wang^*, Yang Li^‡, Rongsheng Zhang^‡, Derek F. Wong

arxiv

We introduce a paired diagnostic framework showing that mathematical reasoning models can solve local components yet fail when those components are bound into composed dependency structures.

Activate Integrated Controllable Generation with Soft Prompt

Jingkun Ma, Runzhe Zhan, Derek F. Wong, Lidia S. Chao

NLPCC 2024

A parameter-efficient soft-prompt method for controllable generation under complex multi-attribute settings.

DOI

Yu Sheng: Human-in-Loop Classical Chinese Poetry Generation System

Jingkun Ma, Runzhe Zhan, Derek F. Wong

EACL 2023 Demo

A human-in-the-loop system for Classical Chinese poetry generation with interactive customization and polishing.

DOI Code