Pengfei Cao / 曹鹏飞


Assistant Professor at Institute of Automation, Chinese Academy of Sciences


 Intelligence Building, Zhongguancun East Road 95#, Beijing, China.

 pengfei.cao[at]nlpr.ia.ac.cn

Google Scholar / Semantic Scholar


Biography

I am currently an Assistant Professor at Institute of Automation, Chinese Academy of Sciences. Before that, I received my Ph.D. degree from Institute of Automation, Chinese Academy of Sciences (CASIA) in 2023, under the supervision of Prof. Jun Zhao. My research interests broadly include Natural Language Processing, Large Language Models, and Information Extraction. I have published several papers in top-tier conferences such as AAAI, NeurIPS, ACL, EMNLP, NAACL, COLING, CIKM and so on.


News

  • 课题组招收实习生,欢迎有意向的同学与我联系(长期有效).
  • [2024/05] One paper was accepted to NeurIPS 2024.
  • [2024/05] Four papers were accepted to EMNLP 2024.
  • [2024/05] Four papers were accepted to ACL 2024.
  • [2023/12] Four papers were accepted to LREC-COLING 2024.
  • [2023/12] One paper was accepted to AAAI 2024.
  • [2023/11] Seven papers were accepted to EMNLP 2023.
  • [2023/07] Accepted to serve as ACL ARR Area Chair.
  • [2023/07] Accepted to serve as PC member for AAAI 2024.
  • [2022/11] One long paper was accepted to AAAI 2023.
  • [2022/10] One main conference paper and one demo paper were accepted to EMNLP 2022.
  • [2021/08] Two main conference papers were accepted to EMNLP 2021.
  • [2021/05] Three main conference papers and one findings paper were accepted to ACL 2021.

Publications

# denotes equal contribution

2024

RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models
Zhuoran Jin, Pengfei Cao, Chenhao Wang, Zhitao He, Hongbang Yuan, Jiachun Li, Yubo Chen, Kang Liu, Jun Zhao.
The Thirty-eight Conference on Neural Information Processing Systems Datasets and Benchmarks Track (NeurIPS 2024)

Unlocking the Future: Exploring Look-Ahead Planning Mechanistic Interpretability in Large Language Models
Tianyi Men, Pengfei Cao, Zhuoran Jin, Yubo Chen, Kang Liu, Jun Zhao.
The 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP 2024)

Whispers that Shake Foundations: Analyzing and Mitigating False Premise Hallucinations in Large Language Models
Hongbang Yuan, Pengfei Cao, Zhuoran Jin, Yubo Chen, Daojian Zeng, Kang Liu, Jun Zhao.
The 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP 2024)

LINKED: Eliciting, Filtering and Integrating Knowledge in Large Language Model for Commonsense Reasoning
Jiachun Li, Pengfei Cao, Chenhao Wang, Zhuoran Jin, Yubo Chen, Kang Liu, Xiaojian Jiang, Jiexin Xu, Jun Zhao.
The Findings of the 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP 2024)

AgentsCourt: Building Judicial Decision-Making Agents with Court Debate Simulation and Legal Knowledge Augmentation
Zhitao He, Pengfei Cao, Chenhao Wang, Zhuoran Jin, Yubo Chen, Jiexin Xu, Huaijun Li, Kang Liu, Jun Zhao.
The Findings of the 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP 2024)

Focus on Your Question! Interpreting and Mitigating Toxic CoT Problems in Commonsense Reasoning
Jiachun Li, Pengfei Cao, Chenhao Wang, Zhuoran Jin, Yubo Chen, Daojian Zeng, Kang Liu, Jun Zhao.
The 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024)

MULFE: A Multi-Level Benchmark for Free Text Model Editing
Chenhao Wang, Pengfei Cao, Zhuoran Jin, Yubo Chen, Daojian Zeng, Kang Liu, Jun Zhao.
The 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024)

Cutting Off the Head Ends the Conflict: A Mechanism for Interpreting and Mitigating Knowledge Conflicts in Language Models
Zhuoran Jin, Pengfei Cao, Hongbang Yuan, Yubo Chen, Jiexin Xu, Huaijun Li, Xiaojian Jiang, Kang Liu, Jun Zhao.
The Findings of the 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024)

WilKE: Wise-Layer Knowledge Editor for Lifelong Knowledge Editing
Chenhui Hu, Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao.
The Findings of the 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024)

Oasis: Data Curation and Assessment System for Pretraining of Large Language Models
Tong Zhou, Yubo Chen, Pengfei Cao, Kang Liu, Jun Zhao, Shengping Liu.
The 33rd International Joint Conference on Artificial Intelligence (IJCAI 2024): system demonstrations

ZhuJiu-Knowledge: A Fairer Platform for Evaluating Multiple Knowledge Types in Large Language Models
Pengfan Du, Sirui Liang, Baoli Zhang, Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao.
The 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2024): system demonstrations

Tug-of-War Between Knowledge: Exploring and Resolving Knowledge Conflicts in Retrieval-Augmented Language Models
Zhuoran Jin, Pengfei Cao, Yubo Chen, Kang Liu, Xiaojian Jiang, Jiexin Xu, Qiuxia Li, Jun Zhao.
The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)

Leros: Learning Explicit Reasoning on Synthesized Data for Commonsense Question Answering
Chenhao Wang, Pengfei Cao, Jiachun Li, Yubo Chen, Kang Liu, Xiaojian Jiang, Jiexin Xu, Qiuxia Li, Jun Zhao.
The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)

Zero-Shot Cross-Lingual Document-Level Event Causality Identification with Heterogeneous Graph Contrastive Transfer Learning
Zhitao He, Pengfei Cao, Zhuoran Jin, Yubo Chen, Kang Liu, Zhiqiang Zhang, Mengshu Sun, Jun Zhao.
The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)

Continual Few-shot Event Detection via Hierarchical Augmentation Networks
Chenlong Zhang#, Pengfei Cao#, Yubo Chen, Kang Liu, Zhiqiang Zhang, Mengshu Sun, Jun Zhao.
The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)

Information Bottleneck based Knowledge Selection for Commonsense Reasoning
Zhao Yang, Yuanzhe Zhang, Pengfei Cao, Cao Liu, Jiansong Chen, Jun Zhao, Kang Liu.
Information Sciences, 2024

Journey to the Center of the Knowledge Neurons: Discoveries of Language-Independent Knowledge Neurons and Degenerate Knowledge Neurons
Yuheng Chen#, Pengfei Cao#, Yubo Chen, Kang Liu, Jun Zhao.
The 2024 AAAI Conference on Artifcial Intelligence (AAAI 2024)

2023

Event Ontology Completion with Hierarchical Structure Evolution Networks
Pengfei Cao, Yupu Hao, Yubo Chen, Kang Liu, Jiexin Xu, Huaijun Li, Xiaojian Jiang, Jun Zhao.
The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023)

Complex Event Schema Induction with Knowledge-Enriched Diffusion Model
Yupu Hao, Pengfei Cao, Yubo Chen, Kang Liu, Jiexin Xu, Huaijun Li, Xiaojian Jiang, Jun Zhao.
The Findings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023)

LEGO: A Multi-agent Collaborative Framework with Role-playing and Iterative Feedback for Causality Explanation Generation
Zhitao He, Pengfei Cao, Yubo Chen, Kang Liu, Ruopeng Li, Mengshu Sun, Jun Zhao.
The Findings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023)

InstructoR: Instructing Unsupervised Conversational Dense Retrieval with Large Language Models
Zhuoran Jin, Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao.
The Findings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023)

Alignment Precedes Fusion: Open-Vocabulary Named Entity Recognition as Context-Type Semantic Matching
Zhuoran Jin, Pengfei Cao, Zhitao He, Yubo Chen, Kang Liu, Jun Zhao.
The Findings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023)

DiffusionSL: Sequence Labeling via Tag Diffusion Process
Ziyang Huang, Pengfei Cao, Jun Zhao, Kang Liu.
The Findings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023)

ZhuJiu: A Multi-dimensional, Multi-faceted Chinese Benchmark for Large Language Models
Baoli Zhang, Haining Xie, Pengfan Du, Junhao Chen, Pengfei Cao, Yubo Chen, Shengping Liu, Kang Liu and Jun Zhao .
The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023): system demonstrations

Zero-Shot Cross-Lingual Event Argument Extraction with Language-Oriented Prefix-tuning
Pengfei Cao, Zhuoran Jin, Yubo Chen, Kang Liu, Jun Zhao.
The 2023 AAAI Conference on Artifcial Intelligence (AAAI 2023)

2022

A Good Neighbor, A Found Treasure: Mining Treasured Neighbors for Knowledge Graph Entity Typing
Zhuoran Jin, Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao.
The 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP 2022)

CogKTR: A Knowledge-Enhanced Text Representation Toolkit for Natural Language Understanding
Zhuoran Jin, Tianyi Men, Hongbang Yuan, Yuyang Zhou, Pengfei Cao, Yubo Chen, Zhipeng Xue, Kang Liu, Jun Zhao.
The 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP 2022): system demonstrations

2021

Uncertain Local-to-Global Networks for Document-Level Event Factuality Identification
Pengfei Cao, Yubo Chen, Yuqing Yang, Kang Liu, Jun Zhao.
The 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP 2021)

Domain-Lifelong Learning for Dialogue State Tracking via Knowledge Preservation Networks
Qingbin Liu, Pengfei Cao, Cao Liu, Jiansong Chen, Xunliang Cai, Fan Yang, Shizhu He, Kang Liu, Jun Zhao.
The 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP 2021)

Uncertainty-Aware Self-Training for Semi-Supervised Event Temporal Relation Extraction
Pengfei Cao, Xinyu Zuo, Yubo Chen, Kang Liu, Jun Zhao, Wei Bi.
The 30th ACM International Conference on Information and Knowledge Management (CIKM 2021)

Knowledge-Enriched Event Causality Identification via Latent Structure Induction Networks
Pengfei Cao, Xinyu Zuo, Yubo Chen, Kang Liu, Jun Zhao, Yuguang Chen, Weihua Peng.
The 59th Annual Meeting of the Association for Computational Linguistics (ACL 2021)

LearnDA: Learnable Knowledge-Guided Data Augmentation for Event Causality Identification
Xinyu Zuo, Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao, Weihua Peng, Yuguang Chen.
The 59th Annual Meeting of the Association for Computational Linguistics (ACL 2021)

Automatic ICD Coding via Interactive Shared Representation Networks with Self-distillation Mechanism
Tong Zhou, Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao, Kun Niu, Weifeng Chong, Shengping Liu.
The 59th Annual Meeting of the Association for Computational Linguistics (ACL 2021)

Improving Event Causality Identification via Self-Supervised Representation Learning on External Causal Statement
Xinyu Zuo, Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao, Weihua Peng, Yuguang Chen.
The Findings of 59th Annual Meeting of the Association for Computational Linguistics (ACL 2021)

2020 and Before

Incremental Event Detection via Knowledge Consolidation Networks
Pengfei Cao, Yubo Chen, Jun Zhao, Taifeng Wang.
The 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020)

HyperCore: Hyperbolic and Co-graph Representation for Automatic ICD Coding
Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao, Shengping Liu, Weifeng Chong.
The 58th Annual Meeting of the Association for Computational Linguistics (ACL 2020)

Clinical-Coder: Assigning Interpretable ICD-10 Codes to Chinese Clinical Notes
Pengfei Cao, Chenwei Yan, Xiangling Fu, Yubo Chen, Kang Liu, Jun Zhao, Shengping Liu, Weifeng Chong.
The 58th Annual Meeting of the Association for Computational Linguistics (ACL 2020): system demonstrations

Chinese Named Entity Recognition via Adaptive Multi-pass Memory Network with Hierarchical Tagging Mechanism
Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao.
The 19th China National Conference on Computational Linguistics (CCL 2020)

Adversarial Transfer Learning for Chinese Named Entity Recognition with Self-Attention Mechanism
Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao, Shengping Liu.
The 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP 2018)

Adversarial Training for Relation Classification with Attention based Gate Mechanism
Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao.
The 3rd China Conference on Knowledge Graph and Semantic Computing (CCKS 2018)


Academic Services


Selected Awards