publications

Papers

2025

  1. Annotating and Inferring Compositional Structures Across Languages
    Arne Rubehn, Christoph Rzymski, Luca Ciucci, Kellen Parker van Dam, Alžběta Kučerová, Katja Bocklage, David Snee, Abishek Stephen, and Johann-Mattis List
    arXiv preprint [not peer-reviewed]
  2. Unstable Grounds for Beautiful Trees? Testing the Robustness of Concept Translations in the Compilation of Multilingual Wordlists
    David Snee, Luca Ciucci, Arne Rubehn, Kellen Parker van Dam, and Johann-Mattis List
    arXiv preprint [not peer-reviewed]
  3. Partial Colexifications Improve Concept Embeddings
    Arne Rubehn and Johann-Mattis List
    arXiv preprint [not peer-reviewed]

2024

  1. Generating Feature Vectors from Phonetic Transcriptions in Cross-Linguistic Data Formats
    Arne Rubehn, Jessica Nieder, Robert Forkel, and Johann-Mattis List
    In Proceedings of the Society for Computation in Linguistics (SCiL) 2024, Irvine, CA
  2. Extracting Tuscan phonetic correspondences from dialect pronunciations automatically
    Arne Rubehn, Simonetta Montemagni, and John Nerbonne
    Language Dynamics and Change

Theses

2022

  1. A feature-based neural model of sound change informed by global lexicostatistical data
    Arne Rubehn
    University of Tübingen
    MA thesis, Computational Linguistics, supervised by Dr. Johannes Dellert and Prof. Dr. Gerhard Jäger.

2019

  1. Exploring the viability of polysemy networks for automated cognate detection under semantic shift
    Arne Rubehn
    University of Tübingen
    BA thesis, General Linguistics, supervised by Dr. Johannes Dellert.

Talks

2025

  1. Concept Embeddings: Applying graph embedding techniques to colexification networks
    Arne Rubehn
    Bridging the Gap: Unifying approaches to linguistic evolution, University of Zurich, Switzerland. 2025/03/14.

2024

  1. Word Embeddings
    Arne Rubehn
    Methoden To Go (Ringvorlesung), University of Passau, Germany. 2024/11/19.
  2. Automatically Segmenting Words into Morphemes: A Detailed Comparison of Unsupervised Approaches Applied to Monolingual Wordlists from Different Languages
    Arne Rubehn
    21st International Congress of Linguistics, Poznań, Poland. 2024/09/11.

Miscellaneous

2022

  1. Erweiterung der durch den Sprachassistenten generierten Antworten durch Bezugnahme auf relevante Objekte im Auto
    [Enhancing answers generated by the dialogue system by referencing relevant objects in the car]
    Christian Drescher, Teresa Botschen, and Arne Rubehn
    Patent registered at the German, European, and American Patent Offices.