Papers
My research focuses on how computational methods can be applied to Indigenous Formosan languages in Taiwan. Selected works are grouped below for quick reference.
Journal Articles
- A Corpus-based Study of Causative Constructions in Paiwan. With Milingan Chia-hao Tai and Shu-Kai Hsieh. Foreign Language Studies, 41 (2025), 1–33. Demonstrates how a statistical method and balanced corpora explain the productivity of Paiwan causatives.
- Empowering Elementary Learning: Utilizing Large Language Models to Craft Tailored Textbooks with Expert Insight. With Da-Chen Lian, Po-Ya Angela Wang, Wei-Ling Chen, and Shu-Kai Hsieh. Journal of Library and Information Studies, 23(2), 145-183. Outlines an expert-in-the-loop pipeline for trustworthy textbook customization.
Conference Papers & Posters
- CorPilot: An Agentic Framework for Corpus Linguistics. Computing Conference 2026. Introduces a modular assistant that automates concordance building, annotation validation, and reproducible reporting.
- Automatic Speech Recognition for Endangered Languages: A Case Study on Amis in Taiwan. Workshop on Computational Typology and Speech (WCTS-5), 2025. Reports a pronunciation-variant-aware ASR recipe tailored to Amis documentation data.
- A Logistic Regression Analysis to Paiwan Causative Constructions. Academia Sinica Linguistics Forum-4, 2023. Quantifies how morphosyntactic cues condition speaker preferences across causative strategies.
- CxLM: A Construction and Context-aware Language Model. LREC 2022 (with Yu-Hsiang Tseng et al.). Details a pretrained model that encodes construction-level signals for downstream linguistic tasks.
- Keyword-centered Collocating Topic Analysis. ROCLING 2021. Combines topic modeling with constructional profiling to surface discourse patterns around pedagogical keywords.
- Lectal Variation of the Two Chinese Causative Auxiliaries. ROCLING 2020. Examines how regional dialects influence auxiliary selection through statistical methods.