Short Academic CV

Research Profile

My research focuses on natural language processing (NLP), large language model (LLM) evaluation, trustworthy AI, and structured NLP under limited or evolving supervision. I develop data-centric methods for building reliable, robust, inspectable, and auditable AI systems, with applications in clinical decision support, financial services, and public-sector AI.

Education

PhD in Computer Science, Monash University, Australia, 2023 Thesis: Semantic Parsing in Limited Resource Conditions Supervisors: Prof. Gholamreza Haffari, Dr. Lizhen Qu, Prof. Philip R. Cohen
Master of Computing (Advanced) in Artificial Intelligence, Australian National University, Australia, 2015 Thesis: Representation Learning for Weakly Supervised Relation Extraction
Bachelor of Engineering in Electronic Information Engineering, Wuhan University of Science and Technology, China, 2013

Academic and Industry Appointments

Lecturer, School of Computing Technologies, RMIT University, Melbourne, 2024–present
Research Fellow, Faculty of Information Technology, Monash University, 2023–2024
Research Scientist, Openstream.ai, 2023–2024
Software Development Engineer, Microsoft Search Technology Center Asia, 2017–2018
Engineer, Hong Kong Applied Science and Technology Research Institute, 2016–2017
Visiting Student, National ICT Australia, 2015–2016

Selected Funding and Awards

Wellcome Trust AI4You(th) project, co-applicant, on safe clinical LLMs for youth mental health; approximately AUD 6M total project funding, in partnership with Google Health.
CSIRO Data61 Next Generation AI Graduate Program, co-principal investigator, on AI for next-generation food and waste systems.
EMNLP 2025 Outstanding Paper Award, senior author, for DiscoSG: Towards Discourse-Level Text Scene Graph Parsing through Iterative Graph Refinement; one of seven such awards among 3,200+ accepted papers; first confirmed Australia-affiliated EMNLP Outstanding Paper.
Monash Faculty of Information Technology Research Scholarship and International Postgraduate Research Scholarship.

Teaching

RMIT University
- Lecturer and course coordinator, ISYS1079 / ISYS3476: Managing Semi-structured and Unstructured Data
- Lecturer, COSC2610: Web Programming Studio
- Lecturer, COSC2502: C++ Programming Studio
Monash University
- Administrative tutor and guest lecturer, FIT5149: Applied Data Analysis
- Tutor, FIT5125: IT Research Methods

Supervision

My supervision is centred on Trustworthy LLMs, Domain-grounded AI, and Structured NLP. See the Supervision page for current supervision areas and students.

Academic Service

Senior Area Chair, EMNLP 2026
Area Chair, NeurIPS 2026, ACL Rolling Review, and NLPCC 2026
Program Chair, PersonaLLM Workshop at NeurIPS 2025; Publication Chair, ALTA Workshop 2026; Shared Task Organizer, ALTA 2024
Reviewer for ACL, EMNLP, NAACL, EACL, ICLR, NeurIPS, AAAI, IJCAI, and related NLP/AI venues

Software, Open Source, and Patents

FactualSceneGraph: toolkit for faithful and consistent textual scene-graph parsing, connecting FACTUAL (Findings of ACL 2023) with DiscoSG (EMNLP 2025 Outstanding Paper Award); .
StarCoder 2 / The Stack v2: contributor to the BigCode open-science code LLM initiative; The Stack v2 spans 619 programming languages and supports StarCoder2 models trained on 3.3-4.3T tokens. Paper
SCAR: ACL 2025 data selection method and toolkit for efficient instruction tuning; selecting as little as 0.7% of the full dataset can match or surpass full-data fine-tuning in reported benchmarks. Paper
US Patent 12,548,554: active learning based multilingual semantic parser.
US Patent Application 18/756,077: programmer-interpreter approach for LLM post-editing.

Languages

English
Mandarin Chinese

Zhuang Li