Yinghao Zhu (朱英豪)

Yinghao Zhu Avatar

Email: yhzhu99 [at] gmail [dot] com

Phone: (+86) 15026559349

CV, GitHub, Google Scholar, ResearchGate, OpenReview, ORCID, Semantic Scholar, arXiv, WCA

[About Me] I am currently working at Peking University with Prof. Liantao Ma. My research primarily focuses on AI for Healthcare, with emphasis on the following areas: (1) Medical Large Language Models (LLMs): Developing LLM agents and multimodal LLMs for medical applications; (2) Healthcare Modeling: Clinical predictive modeling using electronic health record (EHR) data; and (3) Healthcare Benchmarks, Toolkits & Platforms: Creating benchmarks, toolkits, and platforms for the healthcare community.

Education

Professional Experience

Publications
(* indicates the equal contributions, indicates the corresponding author.)

  1. Learnable Prompt as Pseudo-Imputation: Rethinking the Necessity of Traditional EHR Data Imputation in Downstream Clinical Prediction Weibin Liao, Yinghao Zhu, Zhongji Zhang, Yuhang Wang, Zixiang Wang, Xu Chu, Yasha Wang, Liantao Ma ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2025
  2. Revolutionizing Newcomers' Onboarding Process in OSS Communities: The Future AI Mentor Xin Tan, Xiao Long, Yinghao Zhu, Lin Shi, Xiaoli Lian, Li Zhang ACM International Conference on the Foundations of Software Engineering (FSE), 2025
  3. ColaCare: Enhancing Electronic Health Record Modeling through Large Language Model-Driven Multi-Agent Collaboration Zixiang Wang*, Yinghao Zhu*, Huiya Zhao*, Xiaochen Zheng, Tianlong Wang, Wen Tang, Yasha Wang, Chengwei Pan, Ewen M. Harrison, Junyi Gao, Liantao Ma ACM International World Wide Web Conference (WWW), 2025
  4. Adaptive Activation Steering: A Tuning-Free LLM Truthfulness Improvement Method for Diverse Hallucinations Categories Tianlong Wang*, Xianfeng Jiao*, Yinghao Zhu, Zhongzhi Chen, Yifan He, Xu Chu, Junyi Gao, Yasha Wang, Liantao Ma ACM International World Wide Web Conference (WWW), 2025
  5. Medical MLLM is Vulnerable: Cross-Modality Jailbreak and Mismatched Attacks on Medical Multimodal Large Language Models Xijie Huang*, Xinyuan Wang*, Hantao Zhang*, Yinghao Zhu*, Jiawen Xi, Jingkun An, Hao Wang, Hao Liang, Chengwei Pan Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2025
  6. AGFSync: Leveraging AI-Generated Feedback for Preference Optimization in Text-to-Image Generation Jingkun An*, Yinghao Zhu*, Zongjian Li*, Enshen Zhou, Haoran Feng, Xijie Huang, Bohua Chen, Yemin Shi, Chengwei Pan Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2025
  7. Protocol for Processing Multivariate Time-series Electronic Health Records of COVID-19 Patients Zixiang Wang*, Yinghao Zhu*, Dehao Sui, Tianlong Wang, Yuntao Zhang, Yasha Wang, Chengwei Pan, Junyi Gao, Liantao Ma, Ling Wang, Xiaoyun Zhang STAR Protocols, 2025
  8. EMERGE: Enhancing Multimodal Electronic Health Records Predictive Modeling with Retrieval-Augmented Generation Yinghao Zhu*, Changyu Ren*, Zixiang Wang, Xiaochen Zheng, Shiyun Xie, Junlan Feng, Xi Zhu, Zhoujun Li, Liantao Ma, Chengwei Pan ACM International Conference on Information and Knowledge Management (CIKM), 2024
  9. PRISM: Mitigating EHR Data Sparsity via Learning from Missing Feature Calibrated Prototype Patient Representations Yinghao Zhu, Zixiang Wang, Long He, Shiyun Xie, Xiaochen Zheng, Liantao Ma, Chengwei Pan ACM International Conference on Information and Knowledge Management (CIKM), 2024
  10. A Comprehensive Benchmark For COVID-19 Predictive Modeling Using Electronic Health Records in Intensive Care Junyi Gao*, Yinghao Zhu*, Wenqing Wang*, Zixiang Wang, Guiying Dong, Wen Tang, Hao Wang, Yasha Wang, Ewen M. Harrison, Liantao Ma Cell Patterns, 2024
  11. Protocol to process follow-up electronic medical records of peritoneal dialysis patients to train AI models Tianlong Wang*, Yinghao Zhu*, Zixiang Wang, Wen Tang, Xinju Zhao, Tao Wang, Yasha Wang, Junyi Gao, Liantao Ma, Ling Wang STAR Protocols, 2024
  12. Prediction of feeding difficulties in neonates with hypoxic-ischemic encephalopathy using magnetic resonance imaging-derived radiomics features Yaqin Xia, Mingshu Yang, Tianyang Qian, Jiayu Zhou, Mei Bai, Siqi Luo, Chaogang Lu, Yinghao Zhu, Laishuan Wang, Zhongwei Qiao Pediatric Radiology, 2024
  13. EHRFlow: A Large Language Model-Driven Iterative Multi-Agent Electronic Health Record Data Analysis Workflow Hao Wu*, Yinghao Zhu*, Zixiang Wang, Xiaochen Zheng, Ling Wang, Wen Tang, Yasha Wang, Chengwei Pan, Ewen M. Harrison, Junyi Gao, Liantao Ma Artificial Intelligence and Data Science for Healthcare: Bridging Data-Centric AI and People-Centric Healthcare (KDD 2024 AIDSH Workshop), Oral, 2024
  14. RetCare: Towards Interpretable Clinical Decision Making through LLM-Driven Medical Knowledge Retrieval Zixiang Wang*, Yinghao Zhu*, Junyi Gao, Xiaochen Zheng, Yuhui Zeng, Yifan He, Bowen Jiang, Wen Tang, Ewen M. Harrison, Chengwei Pan, Liantao Ma, Ling Wang Artificial Intelligence and Data Science for Healthcare: Bridging Data-Centric AI and People-Centric Healthcare (KDD 2024 AIDSH Workshop), 2024
  15. DeepEST: A Python Library for Spatio-Temporal Epidemiology Prediction Yuhang Wang, Yinghao Zhu, Lifang Liang, Yasha Wang, Ewen M. Harrison, Liantao Ma, Junyi Gao Artificial Intelligence and Data Science for Healthcare: Bridging Data-Centric AI and People-Centric Healthcare (KDD 2024 AIDSH Workshop), 2024
  16. PIGWN: Physics-Informed Graph WaveNet for Airport Flight Traffic Flow Prediction Zhichao Yang*, Yinghao Zhu*, Ziyue Niu, Yanru Huang, Chengwei Pan, Xiwang Dong International Conference on Industrial Artificial Intelligence (IAI), 2024
  17. Is larger always better? Evaluating and prompting large language models for non-generative medical tasks. Yinghao Zhu*, Junyi Gao*, Zixiang Wang*, Weibin Liao*, Xiaochen Zheng, Lifang Liang, Yasha Wang, Chengwei Pan, Ewen M. Harrison, Liantao Ma Preprint, 2024
  18. SuperGS: Super-Resolution 3D Gaussian Splatting Enhanced by Variational Residual Features and Uncertainty-Augmented Learning Shiyun Xie, Zhiru Wang, Xu Wang, Yinghao Zhu, Chengwei Pan, Xiwang Dong Preprint, 2024
  19. Prompting Large Language Models for Zero-Shot Clinical Prediction with Structured Longitudinal Electronic Health Record Data Yinghao Zhu*, Zixiang Wang*, Junyi Gao, Yuning Tong, Jingkun An, Weibin Liao, Ewen M. Harrison, Liantao Ma, Chengwei Pan Preprint, 2024
  20. LightM-UNet: Mamba Assists in Lightweight UNet for Medical Image Segmentation Weibin Liao, Yinghao Zhu, Xinyuan Wang, Chengwei Pan, Yasha Wang, Liantao Ma Preprint, 2024
  21. Exploring the Relationship Between Dietary Intake and Clinical Outcomes in Peritoneal Dialysis Patients Yueying Wu*, Junyi Gao*, Wen Tang, Chunyan Su, Yinghao Zhu, Tianlong Wang, Weibin Liao, Xu Chu, Ewen M. Harrison, Yasha Wang, Liantao Ma Preprint, 2024
  22. How far are AI-powered programming assistants from meeting developers' needs? Xin Tan, Xiao Long, Xianjun Ni, Yinghao Zhu, Jing Jiang, Li Zhang Preprint, 2024
  23. Mortality Prediction with Adaptive Feature Importance Recalibration for Peritoneal Dialysis Patients Liantao Ma*, Chaohe Zhang*, Junyi Gao*, Xianfeng Jiao, Zhihao Yu, Yinghao Zhu, Tianlong Wang, Xinyu Ma, Yasha Wang, Wen Tang, Xinju Zhao, Wenjie Ruan, Tao Wang Cell Patterns, Cover, 2023
  24. M3Fair: Mitigating Bias in Healthcare Data through Multi-Level and Multi-Sensitive-Attribute Reweighting Method Junyi Gao*, Yinghao Zhu*, Wenqing Wang*, Zixiang Wang, Guiying Dong, Wen Tang, Hao Wang, Yasha Wang, Ewen M. Harrison, Liantao Ma Beijing Health Data Science Summit 2023, Health Data Science (HDSS), Abstract, 2023
  25. A Comprehensive Benchmark for COVID-19 Predictive Modeling Using Electronic Health Records in Intensive Care: Choosing the Best Model for COVID-19 Prognosis Junyi Gao*, Yinghao Zhu*, Wenqing Wang*, Yasha Wang, Wen Tang, Liantao Ma American Medical Informatics Association (AMIA) Informatics Summit, Podium Abstract Track, Oral, 2023
  26. Exploration of the feasibility of using examination time order to split small sample size data for radiomics Mingshu Yang, Zhongwei Qiao, Yinghao Zhu, Chaogang Lu, Yaqin Xia The Asian and Oceanic Society for Paediatric Radiology (AOSPR), Oral, 2023
  27. Assessing the value of the radiomics model based on MRI of the wrist joint in predicting the use of biologics in JIA Mingshu Yang, Zhongwei Qiao, Yinghao Zhu, Chaogang Lu, Yaqin Xia The Asian and Oceanic Society for Paediatric Radiology (AOSPR), Oral, 2023
  28. Domain-invariant Clinical Representation Learning by Bridging Data Distribution Shift across EMR Datasets Zhongji Zhang, Yuhang Wang, Yinghao Zhu, Xinyu Ma, Tianlong Wang, Chaohe Zhang, Yasha Wang, and Liantao Ma Preprint, 2023
  29. M3Care: Learning with Missing Modalities in Multimodal Healthcare Data Chaohe Zhang*, Xu Chu*, Liantao Ma, Yinghao Zhu, Yasha Wang, Jiangtao Wang, Junfeng Zhao ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2022
  30. Python Data Analysis Yunxiang Lyu, Zhipeng Wang, Lihua Xu, Zhaoyi Wang, Yinghao Zhu, Kun Yan, Shanzhao Qiu, Jiawei Tang, Kaiwen Feng, Wei Chen, Tianyi Chen, Zhendong Hong, Yunfei Yang, Jinman Xie, Zeliang Yao, Yangang Han, Yihang Wu Tsinghua University Press, 2023
  31. Big Data Visualization Techniques Yunxiang Lyu, Zeliang Yao, Jili Xie, Yinghao Zhu, Shanzhao Qiu, Yangang Han, Zehuan Huang Tsinghua University Press, 2023
  32. Theory and Practice of Artificial Intelligence Yunxiang Lyu, Luting Huang, Zezhong Liang, Wenzhi Yin, Xueting Han, Yinghao Zhu, Miaoran Chen Tsinghua University Press, 2022

Projects

  1. PyEHR: A Predictive Modeling Toolkit for Electronic Health Records Yinghao Zhu, Wenqing Wang, Junyi Gao, Liantao Ma GitHub
  2. Envisioning the Future Through AI: Perspectives on Global Landscapes and Lifestyles Yinghao Zhu, Ziyi Wang, Caixin Kang, Hao Li, Jingkun An, Enshen Zhou, Haoran Feng, Bo Hou, Long He, Xinlei Bao, Zihao Li, Chuang Wang, Xinyuan Wang Computer Vision and Pattern Recognition (CVPR) Art Gallery, 2023

Awards

  1. Outstanding Graduate of Beijing Beijing Municipal Education Commission, 2025
  2. Detecting Active Tuberculosis Bacilli Yinghao Zhu, Junyi Gao, Liantao Ma Top 4 out of all teams, Nightingale Open Science and Wellgen Medical, 2024
  3. Bias Detection Tools for Clinical Decision Making Challenge Yinghao Zhu, Jingkun An, Enshen Zhou, Hao Li, Haoran Feng Third Place Prize, NIH/NCATS, 2023
  4. Alibaba Tianchi UNiLAB Algorithm Competition Yinghao Zhu, Zhihao Yu, Xianfeng Jiao Encouragement Award, Top 11 out of 230 teams at Track 1 and Top 10 out of 324 teams at Track 2, 2023
  5. "Challenge Cup" Competition of Science Achievement in China Special Prize (Top 1), China Association for Science and Technology, etc, 2023
  6. High Risk Breast Cancer Prediction Contest 1 Yinghao Zhu, Junyi Gao, Xinze Li, Yifan He, Wenqing Wang, Liantao Ma Top 3 out of all teams, Nightingale Open Science, Association for Health Learning & Inference (AHLI), and Providence St. Joseph Health, 2022
  7. Outstanding Graduate of Beihang University Beihang University, 2022
  8. "Feng Ru Cup" Competition of Academic and Technological Works First Prize (Top 1% of all candidates of all majors), Beihang University, 2021, 2022

Talks

  1. Enhancing Electronic Health Record Modeling through Large Language Model-Driven Multi-Agent Collaboration Seminar on Advancing Trustworthy and Accessible Healthcare Informatics at Peking University & Cell Press, 2024.10.31
  2. What Makes a Next-Generation AI-Powered Healthcare System? University of Zurich & University Hospital Zurich, 2024.07.08
  3. Deep learning interpretable analysis of multivariate time-series electronic medical record data HIT Webinar, 2023.01.06
  4. Invited talk for the High Risk Breast Cancer Prediction Challenge Machine Learning for Health (ML4H), 2022.11.28

Services

Reviewer

Reviewer for NeurIPS, ICLR, ICML, KDD, WWW, etc.
  • ACM Transactions on Knowledge Discovery from Data (TKDD)
    2025
  • IJCNN 2025 Conference
    2025
  • ICML 2025 Conference
    2025
  • AMIA 2025 Clinical Informatics Conference
    2025
  • AISTATS 2025 Conference
    2025
  • TheWebConf 2025 Conference
    2025
  • ICASSP 2025 Conference
    2025
  • ICLR 2025 Conference
    2025
  • KDD 2025 Research Track
    2025
  • AMIA 2025 Informatics Summit
    2025
  • ML4H 2024 Conference
    2024
  • NeurIPS 2024 TSALM & OWA & FM4Science Workshop
    2024
  • NeurIPS 2024 Datasets and Benchmarks Track
    2024
  • NeurIPS 2024 Main Track
    2024
  • KDD 2024 Research Track
    2024
  • KDD 2024 AIDSH Workshop
    2024
  • AMIA 2024 Annual Symposium
    2024
  • AMIA 2024 Clinical Informatics Conference
    2024
  • NeurIPS 2023 Datasets and Benchmarks Track
    2023
  • AMIA 2023 Annual Symposium
    2023
  • Journal of Data-centric Machine Learning Research (DMLR)
    2023

Teaching Assistant

TA for Algorithms, Operating Systems, Discrete Mathematics, etc.
  • Fundamentals of Programming and Computer Science
    Spring 2023
  • Design and Analysis of Algorithms
    Spring 2022, Autumn 2022
  • Operating System
    Autumn 2021
  • Network Storage
    Autumn 2021
  • System Programming
    Spring 2021
  • Object-oriented Programming
    Spring 2021
  • Discrete Mathematics
    Spring 2020, Autumn 2020, Spring 2021