Pengfei Cao - HomePage(曹鹏飞@CASIA)

Pengfei Cao / 曹鹏飞

Assistant Professor at Institute of Automation, Chinese Academy of Sciences

Intelligence Building, Zhongguancun East Road 95#, Beijing, China.

pengfei.cao@nlpr.ia.ac.cn

Biography

I am currently an Assistant Professor at Institute of Automation, Chinese Academy of Sciences. Before that, I received my Ph.D. degree from Institute of Automation, Chinese Academy of Sciences (CASIA) in 2023, under the supervision of Prof. Jun Zhao. My research interests include: 1) Interpretability of LLMs: Analysis of the Activation and Emergence Mechanism of Knowledge and Ability. 2) Enhancement of LLMs: Enhancement of Knowledge, Reasoning and Planning Abilities. I have published over 50 papers in top-tier conferences such as NeurIPS, AAAI, ICLR, ACL, EMNLP, NAACL, COLING, CIKM and so on.
Looking for collaborators to do meaningful research. If interested please feel free to contact with me!

News

课题组招收实习生，欢迎有意向的同学与我联系（长期有效）.
[2026/02] One paper was accepted to Artificial Intelligence.
[2026/01] Four papers were accepted to ICLR 2026.
The evaluation on Abductive Event Reasoning we are organizing at SemEval-2026 is currently in full swing！
[2025/11] One paper was accepted to AAAI 2026.
[2025/09] One paper was accepted to Scientific Data.
[2025/09] One paper was accepted to EMNLP 2025.
[2025/05] Nine paper were accepted to ACL 2025.
[2025/01] Two paper were accepted to ICLR 2025.
[2025/01] Two paper were accepted to NAACL 2025.
[2024/12] Three paper were accepted to AAAI 2025.
[2024/12] One paper was accepted to NeurIPS 2024.
[2024/09] Four papers were accepted to EMNLP 2024.
[2024/05] Four papers were accepted to ACL 2024.
[2023/12] Four papers were accepted to LREC-COLING 2024.
[2023/12] One paper was accepted to AAAI 2024.
[2023/11] Seven papers were accepted to EMNLP 2023.
[2023/07] Accepted to serve as ACL ARR Area Chair.
[2023/07] Accepted to serve as PC member for AAAI 2024.
[2022/11] One long paper was accepted to AAAI 2023.
[2022/10] One main conference paper and one demo paper were accepted to EMNLP 2022.
[2021/08] Two main conference papers were accepted to EMNLP 2021.
[2021/05] Three main conference papers and one findings paper were accepted to ACL 2021.

Publications

# denotes equal contribution

2026

One Mind, Many Tongues: A Deep Dive into Language-Agnostic Knowledge Neurons in Large Language Models
Pengfei Cao, Yuheng Chen, Zhuoran Jin, Yubo Chen, Kang Liu, Jun Zhao.
Artificial Intelligence, 2026

Omni-Reward: Towards Generalist Omni-Modal Reward Modeling with Free-Form Preferences
Zhuoran Jin, Hongbang Yuan, Kejian Zhu, Jiachun Li, Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao.
The Fourteenth International Conference on Learning Representations (ICLR 2026, Oral)

Fixing the Broken Compass: Diagnosing and Improving Inference-Time Reward Modeling
Jiachun Li, Pengfei Cao, Zhuoran Jin, Yubo Chen, Jiexin Xu, Huaijun Li, Xiaojian Jiang, Kang Liu, Jun Zhao.
The Fourteenth International Conference on Learning Representations (ICLR 2026)

MMR-Life: Piecing Together Real-life Scenes for Multimodal Multi-image Reasoning
Jiachun Li, Shaoping Huang, Zhuoran Jin, Chenlong Zhang, Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao.
The Fourteenth International Conference on Learning Representations (ICLR 2026)

MMR-V: What's Left Unsaid? A Benchmark for Multimodal Deep Reasoning in Videos
Kejian Zhu, Zhuoran Jin, Hongbang Yuan, Jiachun Li, Shangqing Tu, Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao.
The Fourteenth International Conference on Learning Representations (ICLR 2026)

Bias-Restrained Prefix Representation Finetuning for Mathematical Reasoning
Sirui Liang, Pengfei Cao, Jian Zhao, Cong Huang, Jun Zhao, Kang Liu.
The 40th Annual AAAI Conference on Artificial Intelligence (AAAI 2026)

2025

A Multimodal Depression Consultation Dataset of Speech and Text with HAMD-17 Assessments
Pengfei Cao, Yuanzhe Zhang, Chenxiang Zhang, Wei Chen, Yan Liu, Shuang Xu, Miao Xu, Wenqing Jin, Jinjie Xu, Dan Wang, Wei Wang, Xue Wang, Wen Wang, Yanping Ren, Jun Zhao, Rena Li, Kang Liu.
Scientific Data

M2Edit: Locate and Edit Multi-Granularity Knowledge in Multimodal Large Language Model
Yang Zhou, Pengfei Cao, Yubo Chen, Qingbin Liu, Dianbo Sui, Xi Chen, Kang Liu, Jun Zhao.
The 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP 2025)

Know-MRI: A Knowledge Mechanisms Revealer and Interpreter for Large Language Models
Jiaxiang Liu, Boxuan Xing, Chenhao Yuan, ChenxiangZhang, Di Wu, Xiusheng Huang, Haida Yu, Chuhan Lang, Pengfei Cao, Jun Zhao, Kang Liu.
The 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025): system demonstration

The Knowledge Microscope: Features as Better Analytical Lenses than Neurons
Yuheng Chen, Pengfei Cao, Kang Liu, Jun Zhao.
The 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025)

Cracking Factual Knowledge: A Comprehensive Analysis of Degenerate Knowledge Neurons in Large Language Models
Yuheng Chen, Pengfei Cao, Yubo Chen, Yining Wang, Shengping Liu, Kang Liu, Jun Zhao.
The 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025)

Revealing the Deceptiveness of Knowledge Editing: A Mechanistic Analysis of Superficial Editing
Jiakuan Xie, Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao.
The 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025)

Evaluating Personalized Tool-Augmented LLMs from the Perspectives of Personalization and Proactivity
Yupu Hao, Pengfei Cao, Zhuoran Jin, Huanxuan Liao, Yubo Chen, Kang Liu, Jun Zhao.
The 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025)

A Troublemaker with Contagious Jailbreak Makes Chaos in Honest Towns
Tianyi Men, Pengfei Cao, Zhuoran Jin, Yubo Chen, Kang Liu, Jun Zhao.
The 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025, SAC Highlights)

Agent-RewardBench: Towards a Unified Benchmark for Reward Modeling across Perception, Planning, and Safety in Real-World Multimodal Agents
Tianyi Men, Zhuoran Jin, Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao.
The 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025)

Towards Better Chain-of-Thought: A Reflection on Effectiveness and Faithfulness
Jiachun Li, Pengfei Cao, Yubo Chen, Jiexin Xu, Huaijun Li, Xiaojian Jiang, Kang Liu, Jun Zhao.
The Findings of 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025)

RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment
Zhuoran Jin, Hongbang Yuan, Tianyi Men, Pengfei Cao, Yubo Chen, Jiexin Xu, Huaijun Li, Xiaojian Jiang, Kang Liu, Jun Zhao.
The Findings of 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025)

Knowledge Localization: Mission Not Accomplished? Enter Query Localization!
Yuheng Chen, Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao.
The Thirteenth International Conference on Learning Representations (ICLR 2025, Spotlight)

MIRAGE: Evaluating and Explaining Inductive Reasoning Process in Language Models
Jiachun Li, Pengfei Cao, Zhuoran Jin, Yubo Chen, Kang Liu, Jun Zhao.
The Thirteenth International Conference on Learning Representations (ICLR 2025)

Beyond Under-Alignment: Atomic Preference Enhanced Factuality Tuning for Large Language Models
Hongbang Yuan, Yubo Chen, Pengfei Cao, Zhuoran Jin, Kang Liu.
The Findings of 2025 Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics (NAACL 2025)

DTELS: Towards Dynamic Granularity of Timeline Summarization
Chenlong Zhang, Tong Zhou, Pengfei Cao, Zhuoran Jin, Yubo Chen, Kang Liu, Jun Zhao.
The 2025 Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics (NAACL 2025)

Knowledge in Superposition: Unveiling the Failures of Lifelong Knowledge Editing for Large Language Models
Chenhui Hu, Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao.
The 39th Annual AAAI Conference on Artificial Intelligence (AAAI 2025, Oral)

CITI: Enhancing Tool Utilizing Ability in Large Language Models without Sacrificing General Performance
Yupu Hao, Pengfei Cao, Zhuoran Jin, Huanxuan Liao, Yubo Chen, Kang Liu, Jun Zhao.
The 39th Annual AAAI Conference on Artificial Intelligence (AAAI 2025)

Towards Robust Knowledge Unlearning: An Adversarial Framework for Assessing and Improving Unlearning Robustness in Large Language Models
Hongbang Yuan, Zhuoran Jin, Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao.
The 39th Annual AAAI Conference on Artificial Intelligence (AAAI 2025)

2024

RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models
Zhuoran Jin, Pengfei Cao, Chenhao Wang, Zhitao He, Hongbang Yuan, Jiachun Li, Yubo Chen, Kang Liu, Jun Zhao.
The Thirty-eight Conference on Neural Information Processing Systems Datasets and Benchmarks Track (NeurIPS 2024)

Unlocking the Future: Exploring Look-Ahead Planning Mechanistic Interpretability in Large Language Models
Tianyi Men, Pengfei Cao, Zhuoran Jin, Yubo Chen, Kang Liu, Jun Zhao.
The 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP 2024)

Whispers that Shake Foundations: Analyzing and Mitigating False Premise Hallucinations in Large Language Models
Hongbang Yuan, Pengfei Cao, Zhuoran Jin, Yubo Chen, Daojian Zeng, Kang Liu, Jun Zhao.
The 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP 2024)

LINKED: Eliciting, Filtering and Integrating Knowledge in Large Language Model for Commonsense Reasoning
Jiachun Li, Pengfei Cao, Chenhao Wang, Zhuoran Jin, Yubo Chen, Kang Liu, Xiaojian Jiang, Jiexin Xu, Jun Zhao.
The Findings of the 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP 2024)

AgentsCourt: Building Judicial Decision-Making Agents with Court Debate Simulation and Legal Knowledge Augmentation
Zhitao He, Pengfei Cao, Chenhao Wang, Zhuoran Jin, Yubo Chen, Jiexin Xu, Huaijun Li, Kang Liu, Jun Zhao.
The Findings of the 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP 2024)

Focus on Your Question! Interpreting and Mitigating Toxic CoT Problems in Commonsense Reasoning
Jiachun Li, Pengfei Cao, Chenhao Wang, Zhuoran Jin, Yubo Chen, Daojian Zeng, Kang Liu, Jun Zhao.
The 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024)

MULFE: A Multi-Level Benchmark for Free Text Model Editing
Chenhao Wang, Pengfei Cao, Zhuoran Jin, Yubo Chen, Daojian Zeng, Kang Liu, Jun Zhao.
The 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024)

Cutting Off the Head Ends the Conflict: A Mechanism for Interpreting and Mitigating Knowledge Conflicts in Language Models
Zhuoran Jin, Pengfei Cao, Hongbang Yuan, Yubo Chen, Jiexin Xu, Huaijun Li, Xiaojian Jiang, Kang Liu, Jun Zhao.
The Findings of the 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024)

WilKE: Wise-Layer Knowledge Editor for Lifelong Knowledge Editing
Chenhui Hu, Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao.
The Findings of the 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024)

Oasis: Data Curation and Assessment System for Pretraining of Large Language Models
Tong Zhou, Yubo Chen, Pengfei Cao, Kang Liu, Jun Zhao, Shengping Liu.
The 33rd International Joint Conference on Artificial Intelligence (IJCAI 2024): system demonstrations

ZhuJiu-Knowledge: A Fairer Platform for Evaluating Multiple Knowledge Types in Large Language Models
Pengfan Du, Sirui Liang, Baoli Zhang, Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao.
The 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2024): system demonstrations

Tug-of-War Between Knowledge: Exploring and Resolving Knowledge Conflicts in Retrieval-Augmented Language Models
Zhuoran Jin, Pengfei Cao, Yubo Chen, Kang Liu, Xiaojian Jiang, Jiexin Xu, Qiuxia Li, Jun Zhao.
The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)

Leros: Learning Explicit Reasoning on Synthesized Data for Commonsense Question Answering
Chenhao Wang, Pengfei Cao, Jiachun Li, Yubo Chen, Kang Liu, Xiaojian Jiang, Jiexin Xu, Qiuxia Li, Jun Zhao.
The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)

Zero-Shot Cross-Lingual Document-Level Event Causality Identification with Heterogeneous Graph Contrastive Transfer Learning
Zhitao He, Pengfei Cao, Zhuoran Jin, Yubo Chen, Kang Liu, Zhiqiang Zhang, Mengshu Sun, Jun Zhao.
The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)

Continual Few-shot Event Detection via Hierarchical Augmentation Networks
Chenlong Zhang#, Pengfei Cao#, Yubo Chen, Kang Liu, Zhiqiang Zhang, Mengshu Sun, Jun Zhao.
The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)

Information Bottleneck based Knowledge Selection for Commonsense Reasoning
Zhao Yang, Yuanzhe Zhang, Pengfei Cao, Cao Liu, Jiansong Chen, Jun Zhao, Kang Liu.
Information Sciences, 2024

Journey to the Center of the Knowledge Neurons: Discoveries of Language-Independent Knowledge Neurons and Degenerate Knowledge Neurons
Yuheng Chen#, Pengfei Cao#, Yubo Chen, Kang Liu, Jun Zhao.
The 2024 AAAI Conference on Artifcial Intelligence (AAAI 2024)

2023

Event Ontology Completion with Hierarchical Structure Evolution Networks
Pengfei Cao, Yupu Hao, Yubo Chen, Kang Liu, Jiexin Xu, Huaijun Li, Xiaojian Jiang, Jun Zhao.
The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023)

Complex Event Schema Induction with Knowledge-Enriched Diffusion Model
Yupu Hao, Pengfei Cao, Yubo Chen, Kang Liu, Jiexin Xu, Huaijun Li, Xiaojian Jiang, Jun Zhao.
The Findings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023)

LEGO: A Multi-agent Collaborative Framework with Role-playing and Iterative Feedback for Causality Explanation Generation
Zhitao He, Pengfei Cao, Yubo Chen, Kang Liu, Ruopeng Li, Mengshu Sun, Jun Zhao.
The Findings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023)

InstructoR: Instructing Unsupervised Conversational Dense Retrieval with Large Language Models
Zhuoran Jin, Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao.
The Findings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023)

Alignment Precedes Fusion: Open-Vocabulary Named Entity Recognition as Context-Type Semantic Matching
Zhuoran Jin, Pengfei Cao, Zhitao He, Yubo Chen, Kang Liu, Jun Zhao.
The Findings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023)

DiffusionSL: Sequence Labeling via Tag Diffusion Process
Ziyang Huang, Pengfei Cao, Jun Zhao, Kang Liu.
The Findings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023)

ZhuJiu: A Multi-dimensional, Multi-faceted Chinese Benchmark for Large Language Models
Baoli Zhang, Haining Xie, Pengfan Du, Junhao Chen, Pengfei Cao, Yubo Chen, Shengping Liu, Kang Liu and Jun Zhao .
The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023): system demonstrations

Zero-Shot Cross-Lingual Event Argument Extraction with Language-Oriented Prefix-tuning
Pengfei Cao, Zhuoran Jin, Yubo Chen, Kang Liu, Jun Zhao.
The 2023 AAAI Conference on Artifcial Intelligence (AAAI 2023)

2022

A Good Neighbor, A Found Treasure: Mining Treasured Neighbors for Knowledge Graph Entity Typing
Zhuoran Jin, Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao.
The 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP 2022)

CogKTR: A Knowledge-Enhanced Text Representation Toolkit for Natural Language Understanding
Zhuoran Jin, Tianyi Men, Hongbang Yuan, Yuyang Zhou, Pengfei Cao, Yubo Chen, Zhipeng Xue, Kang Liu, Jun Zhao.
The 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP 2022): system demonstrations

2021

Uncertain Local-to-Global Networks for Document-Level Event Factuality Identification
Pengfei Cao, Yubo Chen, Yuqing Yang, Kang Liu, Jun Zhao.
The 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP 2021)

Domain-Lifelong Learning for Dialogue State Tracking via Knowledge Preservation Networks
Qingbin Liu, Pengfei Cao, Cao Liu, Jiansong Chen, Xunliang Cai, Fan Yang, Shizhu He, Kang Liu, Jun Zhao.
The 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP 2021)

Uncertainty-Aware Self-Training for Semi-Supervised Event Temporal Relation Extraction
Pengfei Cao, Xinyu Zuo, Yubo Chen, Kang Liu, Jun Zhao, Wei Bi.
The 30th ACM International Conference on Information and Knowledge Management (CIKM 2021)

Knowledge-Enriched Event Causality Identification via Latent Structure Induction Networks
Pengfei Cao, Xinyu Zuo, Yubo Chen, Kang Liu, Jun Zhao, Yuguang Chen, Weihua Peng.
The 59th Annual Meeting of the Association for Computational Linguistics (ACL 2021)

LearnDA: Learnable Knowledge-Guided Data Augmentation for Event Causality Identification
Xinyu Zuo, Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao, Weihua Peng, Yuguang Chen.
The 59th Annual Meeting of the Association for Computational Linguistics (ACL 2021)

Automatic ICD Coding via Interactive Shared Representation Networks with Self-distillation Mechanism
Tong Zhou, Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao, Kun Niu, Weifeng Chong, Shengping Liu.
The 59th Annual Meeting of the Association for Computational Linguistics (ACL 2021)

Improving Event Causality Identification via Self-Supervised Representation Learning on External Causal Statement
Xinyu Zuo, Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao, Weihua Peng, Yuguang Chen.
The Findings of 59th Annual Meeting of the Association for Computational Linguistics (ACL 2021)

2020 and Before

Incremental Event Detection via Knowledge Consolidation Networks
Pengfei Cao, Yubo Chen, Jun Zhao, Taifeng Wang.
The 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020)

HyperCore: Hyperbolic and Co-graph Representation for Automatic ICD Coding
Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao, Shengping Liu, Weifeng Chong.
The 58th Annual Meeting of the Association for Computational Linguistics (ACL 2020)

Clinical-Coder: Assigning Interpretable ICD-10 Codes to Chinese Clinical Notes
Pengfei Cao, Chenwei Yan, Xiangling Fu, Yubo Chen, Kang Liu, Jun Zhao, Shengping Liu, Weifeng Chong.
The 58th Annual Meeting of the Association for Computational Linguistics (ACL 2020): system demonstrations

Chinese Named Entity Recognition via Adaptive Multi-pass Memory Network with Hierarchical Tagging Mechanism
Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao.
The 19th China National Conference on Computational Linguistics (CCL 2020)

Adversarial Transfer Learning for Chinese Named Entity Recognition with Self-Attention Mechanism
Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao, Shengping Liu.
The 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP 2018)

Adversarial Training for Relation Classification with Attention based Gate Mechanism
Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao.
The 3rd China Conference on Knowledge Graph and Semantic Computing (CCKS 2018)

Academic Services

Executive Committee Member, Chinese Information Processing Society of China, Student Committee of Youth Working Committee (2020-2022).
Co-chair of the Doctoral Forum of the First China Student Symposium on Natural Language Processing (CSSNLP 2020).
Committee Member of the Youth Working Committee of Chinese Information Processing Society of China.
Area Chair of Conferences: ACL, EMNLP, NAACL, EACL, etc.
PC Member of Conferences: ICLR, ICML, NeurIPS, AAAI, ACL, EMNLP, COLING, etc.
Journal Reviewer: IEEE TPAMI, TKDE, TNNLS, TALLIP, KBS, etc.

Selected Awards

2023. 中国中文信息学会“博士学位论文激励计划”.
2023. 中国科学院特别研究助理资助项目.
2023. President Excellence Award, Chinese Academy of Sciences.
2023. Outstanding doctor graduates of Beijing.
2023. Outstanding graduates of Chinese Academy of Sciences.
2021. National scholarship for doctoral students.
2020. CAS Institute of Automation “Pandeng” Scholarship.
2019 and 2020. University of CAS Merit Student.