Pengfei Cao / 曹鹏飞


Assistant Professor at Institute of Automation, Chinese Academy of Sciences


 Intelligence Building, Zhongguancun East Road 95#, Beijing, China.

 pengfei.cao[at]nlpr.ia.ac.cn

Google Scholar / Semantic Scholar


Biography

I am currently an Assistant Professor at Institute of Automation, Chinese Academy of Sciences. Before that, I received my Ph.D. degree from Institute of Automation, Chinese Academy of Sciences (CASIA) in 2023, under the supervision of Prof. Jun Zhao. My research interests broadly include Natural Language Processing, Large Language Models, and Information Extraction. I have published several papers in top-tier conferences such as AAAI, ACL, EMNLP, NAACL, COLING, CIKM and so on.


News

  • 课题组招收实习生,欢迎有意向的同学与我联系(长期有效) .
  • [2023/12] Four papers were accepted to LREC-COLING 2024.
  • [2023/12] One paper was accepted to AAAI 2024.
  • [2023/11] Seven papers were accepted to EMNLP 2023.
  • [2023/07] Accepted to serve as ACL ARR Area Chair.
  • [2023/07] Accepted to serve as PC member for AAAI 2024.
  • [2022/11] One long paper was accepted to AAAI 2023.
  • [2022/10] One main conference paper and one demo paper were accepted to EMNLP 2022.
  • [2021/08] Two main conference papers were accepted to EMNLP 2021.
  • [2021/05] Three main conference papers and one findings paper were accepted to ACL 2021.

Publications

# denotes equal contribution

Tug-of-War Between Knowledge: Exploring and Resolving Knowledge Conflicts in Retrieval-Augmented Language Models
Zhuoran Jin, Pengfei Cao, Yubo Chen, Kang Liu, Xiaojian Jiang, Jiexin Xu, Qiuxia Li, Jun Zhao.
The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)

Leros: Learning Explicit Reasoning on Synthesized Data for Commonsense Question Answering
Chenhao Wang, Pengfei Cao, Jiachun Li, Yubo Chen, Kang Liu, Xiaojian Jiang, Jiexin Xu, Qiuxia Li, Jun Zhao.
The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)

Zero-Shot Cross-Lingual Document-Level Event Causality Identification with Heterogeneous Graph Contrastive Transfer Learning
Zhitao He, Pengfei Cao, Zhuoran Jin, Yubo Chen, Kang Liu, Zhiqiang Zhang, Mengshu Sun, Jun Zhao.
The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)

Continual Few-shot Event Detection via Hierarchical Augmentation Networks
Chenlong Zhang#, Pengfei Cao#, Yubo Chen, Kang Liu, Zhiqiang Zhang, Mengshu Sun, Jun Zhao.
The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)

Information Bottleneck based Knowledge Selection for Commonsense Reasoning
Zhao Yang, Yuanzhe Zhang, Pengfei Cao, Cao Liu, Jiansong Chen, Jun Zhao, Kang Liu.
Information Sciences, 2024

Journey to the Center of the Knowledge Neurons: Discoveries of Language-Independent Knowledge Neurons and Degenerate Knowledge Neurons
Yuheng Chen#, Pengfei Cao#, Yubo Chen, Kang Liu, Jun Zhao.
The 2024 AAAI Conference on Artifcial Intelligence (AAAI 2024)

Event Ontology Completion with Hierarchical Structure Evolution Networks
Pengfei Cao, Yupu Hao, Yubo Chen, Kang Liu, Jiexin Xu, Huaijun Li, Xiaojian Jiang, Jun Zhao.
The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023)

Complex Event Schema Induction with Knowledge-Enriched Diffusion Model
Yupu Hao, Pengfei Cao, Yubo Chen, Kang Liu, Jiexin Xu, Huaijun Li, Xiaojian Jiang, Jun Zhao.
The Findings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023)

LEGO: A Multi-agent Collaborative Framework with Role-playing and Iterative Feedback for Causality Explanation Generation
Zhitao He, Pengfei Cao, Yubo Chen, Kang Liu, Ruopeng Li, Mengshu Sun, Jun Zhao.
The Findings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023)

InstructoR: Instructing Unsupervised Conversational Dense Retrieval with Large Language Models
Zhuoran Jin, Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao.
The Findings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023)

Alignment Precedes Fusion: Open-Vocabulary Named Entity Recognition as Context-Type Semantic Matching
Zhuoran Jin, Pengfei Cao, Zhitao He, Yubo Chen, Kang Liu, Jun Zhao.
The Findings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023)

DiffusionSL: Sequence Labeling via Tag Diffusion Process
Ziyang Huang, Pengfei Cao, Jun Zhao, Kang Liu.
The Findings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023)

ZhuJiu: A Multi-dimensional, Multi-faceted Chinese Benchmark for Large Language Models
Baoli Zhang, Haining Xie, Pengfan Du, Junhao Chen, Pengfei Cao, Yubo Chen, Shengping Liu, Kang Liu and Jun Zhao .
The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023): system demonstrations

Zero-Shot Cross-Lingual Event Argument Extraction with Language-Oriented Prefix-tuning
Pengfei Cao, Zhuoran Jin, Yubo Chen, Kang Liu, Jun Zhao.
The 2023 AAAI Conference on Artifcial Intelligence (AAAI 2023)

A Good Neighbor, A Found Treasure: Mining Treasured Neighbors for Knowledge Graph Entity Typing
Zhuoran Jin, Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao.
The 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP 2022)

CogKTR: A Knowledge-Enhanced Text Representation Toolkit for Natural Language Understanding
Zhuoran Jin, Tianyi Men, Hongbang Yuan, Yuyang Zhou, Pengfei Cao, Yubo Chen, Zhipeng Xue, Kang Liu, Jun Zhao.
The 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP 2022): system demonstrations

Uncertain Local-to-Global Networks for Document-Level Event Factuality Identification
Pengfei Cao, Yubo Chen, Yuqing Yang, Kang Liu, Jun Zhao.
The 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP 2021)

Domain-Lifelong Learning for Dialogue State Tracking via Knowledge Preservation Networks
Qingbin Liu, Pengfei Cao, Cao Liu, Jiansong Chen, Xunliang Cai, Fan Yang, Shizhu He, Kang Liu, Jun Zhao.
The 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP 2021)

Uncertainty-Aware Self-Training for Semi-Supervised Event Temporal Relation Extraction
Pengfei Cao, Xinyu Zuo, Yubo Chen, Kang Liu, Jun Zhao, Wei Bi.
The 30th ACM International Conference on Information and Knowledge Management (CIKM 2021)

Knowledge-Enriched Event Causality Identification via Latent Structure Induction Networks
Pengfei Cao, Xinyu Zuo, Yubo Chen, Kang Liu, Jun Zhao, Yuguang Chen, Weihua Peng.
The 59th Annual Meeting of the Association for Computational Linguistics (ACL 2021)

LearnDA: Learnable Knowledge-Guided Data Augmentation for Event Causality Identification
Xinyu Zuo, Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao, Weihua Peng, Yuguang Chen.
The 59th Annual Meeting of the Association for Computational Linguistics (ACL 2021)

Automatic ICD Coding via Interactive Shared Representation Networks with Self-distillation Mechanism
Tong Zhou, Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao, Kun Niu, Weifeng Chong, Shengping Liu.
The 59th Annual Meeting of the Association for Computational Linguistics (ACL 2021)

Improving Event Causality Identification via Self-Supervised Representation Learning on External Causal Statement
Xinyu Zuo, Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao, Weihua Peng, Yuguang Chen.
The Findings of 59th Annual Meeting of the Association for Computational Linguistics (ACL 2021)

Incremental Event Detection via Knowledge Consolidation Networks
Pengfei Cao, Yubo Chen, Jun Zhao, Taifeng Wang.
The 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020)

HyperCore: Hyperbolic and Co-graph Representation for Automatic ICD Coding
Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao, Shengping Liu, Weifeng Chong.
The 58th Annual Meeting of the Association for Computational Linguistics (ACL 2020)

Clinical-Coder: Assigning Interpretable ICD-10 Codes to Chinese Clinical Notes
Pengfei Cao, Chenwei Yan, Xiangling Fu, Yubo Chen, Kang Liu, Jun Zhao, Shengping Liu, Weifeng Chong.
The 58th Annual Meeting of the Association for Computational Linguistics (ACL 2020): system demonstrations

Chinese Named Entity Recognition via Adaptive Multi-pass Memory Network with Hierarchical Tagging Mechanism
Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao.
The 19th China National Conference on Computational Linguistics (CCL 2020)

Adversarial Transfer Learning for Chinese Named Entity Recognition with Self-Attention Mechanism
Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao, Shengping Liu.
The 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP 2018)

Adversarial Training for Relation Classification with Attention based Gate Mechanism
Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao.
The 3rd China Conference on Knowledge Graph and Semantic Computing (CCKS 2018)


Academic Services


Selected Awards