About Me
I am a Ph.D. candidate in the School of Humanities at Tsinghua University. My major is computational linguistics, combining artificial intelligence and natural languages. I am interested in computational lexical semantics (including polysemy, ambiguity and uncertainty) and AI interpretability for LLMs.
News
[January 2025] I just passed mid-term examination by defensing and submitting a report! Thank you for the comments from three committee members (Prof. Mingming Liu, Prof. Xiaoshi Hu, Prof. Ying Liu)!
[December 2024] Volunteer applications have been accepted by COLING 2025! See you in Abu Dhabi~
[December 2024] Two papers to CoMeDi Workshop have been accepted!
[December 2024] ACL SRW 2025 is recruiting mentors and reviewers. Refer to more info here.
[December 2024] I got the feedback from ARR meta (a little bit negative though) and committed it to NAACL 2025.
[December 2024] I submitted a thesis proposal on NAACL SRW 2025. Looking forward to the feedback!
Past news
[November 2024] I attended the Sixth Linguistics and Philology Forum and got the first awards!
[November 2024] I obtained the review from ARR Oct. Not bad:)
[November 2024] I sumbitted two papers to CoMeDi Workshop and reviewed another paper.
[October 2024] I sumbitted a paper to ARR Oct.
[August 2024] I attended the ACL 2024 in Bangkok, a really wonderful experience!
[July 2024] I attended the CCL 2024, the first in-person conference I ever attended.
[July 2024] I have finished my defense of Thesis Proposal Defense. Thank you for the insightful comments from three committee members (Dr. Sujian Li, Dr. Bing Qiu, and Dr. Ying Liu)!
[June 2024] I will review one paper from ACL workshop of Rep4NLP.`
[June 2024] I submitted a paper of thesis proposal to ACL SRW! Expected to helpful comments.
[May 2024] The paper has been accepted by Findings: ACL 2024
[May 2024] I lead a reading group on computational linguistics. Hope we can learn something together.
[May 2024] I have applied CSC-funded the joint Phd program, cosupervised by Professor Roberto Navigli at Sapienza University of Rome
[Febrary 2024] One paper is under review by ACL 2024
[December 2023] I have received Excellent Comprehensive Scholarship of Tsinghua University (First Prize).
[December 2023] I have finished two Linguistic lessions (auditing).
[December 2023] I attended the CDH 2023 and the paper was awarded Outstanding Creative Paper Award on Interactive and automatic mural captioning.
[November 2023] One paper titled "Evaluation of the semantics and agential degree of Chinese subject-object reversible sentences based on large language model" was presented and awarded the first prize.
[July 2023] Five papers in EMNLP 2023 have been reviewed.
[June 2023] A talk titled "To know, or not to know? Language, Uncertainty and Artificial Intelligence" at Shanxi University of Finance and Economics.
[May 2023] One paper has been accepted by ACL findings 2023.
Education
Tsinghua University (PhD Candidate)
Sept 2022 - May 2026 (Expected)
Major Computational Linguistics
courses: Linguistic typology (by Rui Guo, auditing), Morphology (by XiuFang Dong, auditing), Probability and Random Mathematics (by Jun Ye, auditing), Natural Language Processing (by Yue Zhang, online auditing), Topics in Chinese Morphology and Syntax (by Dun Deng, A), Research on Corpus Linguistics (by Ying Liu, A), Research of Seclected Topics on Chinese Semantics (by Bing Qiu, A, Thesis: An Empirical Study on Semantic Relations within Chinese Compound Words based on Word Embeddings)
Other interesting courses: Appreciation of Western Opera (by Yi Ding, auditing), History of Western Music (by Xiao Kang, auditing), Appreciation of Peking Opera (by Mengmei Zhou)
Road to Thesis:
A Survey of Word Sense Disambiguation (Thesis of Comprehensive Examination, A)
Aspect of uncertainty in WSD (Conference paper)
Research proposal for thesis (RP) [May, 2024]
Thesis proposal defense for Final Thesis (Chinese RP, PPT) [July, 2024]
Research proposal v2 for thesis (RP) [December, 2024], which was submitted to NAACL SRW 2025
Mid-term Examination Report for Final Thesis (Mid-term Report, PPT)
Southern University of Science and Technology (Master)
Sept 2019 - July 2022
Major: Computer Science
courses: Advanced Algorithm (A), Bayesian Data Analysis (A), Machine Learning (A)
Other interesting courses: Translation and Appreciation of Chinese Classical Poetry (by Mengwen Zhu, A), Classics and the Process of Canonization (by Mengwen Zhu, auditing)
Thesis: Towards Human-like Diverse Video Captioning Via a Latent Generative Model
Dalian University of Technology (Undergraduate)
Sept 2015 - July 2019
Major: Digital Media Techonology
Academic Experiences
Research Intern in THUNLP, cosupervised by Maosong Sun. Reasearch focuses on LLM interpretability and semantic representation.
October 2023 -
Reviewer of EMNLP 2023
Reviewer of Rep4NLP in ACL'24
Poster presenter in ACL'24
Attendee at CCL'24
Reviewer of CoMeDi, a workshop in Coling 2025
Presenter in Sixth Linguistics and Philology Forum, and got first award.
Volunteer in Coling 2025
Publications & Projects
Evaluating Distributed Representations for Multi-Level Lexical Semantics: A Research Proposal
Arxiv (under review)
[Paper]
JuniperLiu at CoMeDi Shared Task: Models as Annotators in Lexical Semantics Disagreements
CoMeDi workshop in Coling 2025
A Top-down Graph-based Tool for Modeling Classical Semantic Maps: A Crosslinguistic Case Study of Supplementary Adverbs
Arxiv (under review)
Fantastic Semantics and Where to Find Them: Investigating Which Layers of Generative LLMs Reflect Lexical Semantics
Findings of ACL 2024; CoMeDi workshop in Coling 2025 (non-achival)
Ambiguity Meets Uncertainty: Investigating Uncertainty Estimation for Word Sense Disambiguation
Findings of ACL 2023
Show, Tell and Rephrase: Diverse Video Captioning via Two-Stage Progressive Training
TMM 2022
Talks
To know, or not to know? Language, Uncertainty and Artificial Intelligence
At Shanxi University of Finance and Economnics, June 12th, [PPT]
In this talk, I reviewed my recent research from a philosophical perspective of epistemology: What does the model not know? (Known Knowns) What does the model unknow? (Known Unknowns).
Evaluation agentiveness of Subject-object reversible sentences in LLMs
At a linguistic seminar, November 26th, [PPT]
In this talk, I employ difference vetors in LLMs to represent agentiveness of different components (SVO) in a reversible but meaning-preserving sentence pair.
A Top-down Graph-based Tool for Modeling Classical Semantic Maps: A Crosslinguistic Case Study of Supplementary Adverbs
At a linguistic seminar, November 30th, 2025. [PPT in Chinese] [Paper] [CODE]
In this talk, I develop a graph algorithm to construct the semantic map automatically. The tool is available now, and feel free to use it.
Awards and Honors
First Prize for Outstanding Paper at the 6th Youth Academic Forum on Linguistics and Philology at Tsinghua University (2024)
First-Class University-Level Comprehensive Scholarship of Tsinghua University for the Year 2023
Outstanding Creative Paper Award at the 8th Shanghai Library Open Data Competition for the Year 2023
First Prize for Outstanding Paper at the 5th Youth Academic Forum on Linguistics and Philology at Tsinghua University (2023)
Third Prize for Outstanding Paper at the 4th Youth Academic Forum on Linguistics and Philology at Tsinghua University (2022)
Other Resources:
- A comprehensive repo about lexical ambiguity: Awesome Word Sense Disambiguation.