arne rubehn
Hi! I am a computational linguist, currently pursuing my PhD at the Chair of Multilingual Computational Linguistics at the University of Passau. I focus on computer-assisted, data-driven methods for historical linguistics with the goal of advancing comparative historical linguistics by the means of intelligent algorithmic methods, alleviating researchers’ workload by processing large-scale data efficiently. I currently persue methods for modelling phonetic and semantic properties of basic vocabulary in an embedded space.
I have studied Computational Linguistics, General Linguistics, and Latin at the University of Tübingen. Within my MA thesis project I have trained a neural network that estimates global probabilities for arbitrary sound changes. Additionally, I have years of working experience as a software developer for EtInEn (Etymological Inference Engine), a software for historical linguists that is being developed at the Linguistic Department in Tübingen.
research interests
My research usually concerns the computational modelling of linguistic questions, especially within the domains of:
- historical linguistics
- phonetics and phonology
- lexical semantics and word formation
- typology
Instead of focusing on individual languages or families, I aim at developing “generalist” models and methods in the light of large-scale, cross-linguistic applications.
news
| Dec 02, 2025 | New preprint: Integration of Linguistic Legacy Data Collections through Digital Scholarly Editions: A Case Study on Vanuatu Languages (with Tihomir Rangelov, Luca Ciucci, John Burgess, Riccardo Rost & Johann-Mattis List). Available on Humanities Commons. |
|---|---|
| Jun 07, 2025 | Three accepted papers: Very happy to announce that our paper “Partial Colexifications Improve Concept Embeddings” (with Johann-Mattis List) has been accepted for the main conference of the Association for Computational Linguistics. Two more papers that I have co-authored were accepted to SIGTYP. |
| Jun 06, 2025 | Talk held: The Potential of Partial Colexifications for Comparative Linguistics (with Johann-Mattis List). C-LESTE Workshop, University of Frankfurt, Germany. |