Publications
Chosen filters:
Chosen filters:
Word Sense Disambiguation for Ancient Greek; Sourcing a training corpus through translation alignment KU Leuven
This paper seeks to leverage translations of Ancient Greek texts to enhance the performance of automatic word sense disambiguation (WSD). Satisfactory WSD in Ancient Greek is achievable, provided that the system can rely on annotated data. This study, acknowledging the challenges of manually assigning meanings to every Greek lemma, explores strategies to derive WSD data from parallel texts using sentence and word alignment. Our results suggest ...
In Search of the Flocks: How to Perform Onomasiological Queries in an Ancient Greek Corpus? KU Leuven
Seeing the Light through the Trees: How Treebanks Can Advance the Education of Classical Languages KU Leuven
A Corpus-Based Approach to Conceptual History of Ancient Greek. The Case of βάρβαρος KU Leuven
While detecting semasiological or onomasiological change in the past involved annotating hundreds of corpus examples manually, the rise of “distributional” approaches to semantics has enabled researchers to detect semantic change in automated ways. These developments in linguistics are of particular interest for conceptual history, a branch in history that focuses on the evolution of concepts over time. The aim of this paper is to bring ...
A Computational Approach to the Greek Papyri: Developing a Corpus to Study Variation and Change in the Post-Classical Greek Complementation System KU Leuven
The aim of this PhD project is to advance the corpus-linguistic study of the Greek papyri, a large diachronic corpus (3rd century BC - 8th century AD) of non-literary Greek. It consists of two central parts. The first part is focused on corpus design: starting from the transcribed (XML) version of these texts, it describes a pipeline model to supply the papyri step for step with linguistic information, using natural language processing (NLP) ...
Automatic Semantic Role Labeling in Ancient Greek Using Distributional Semantic Modeling KU Leuven
This paper describes a first attempt to automatic semantic role labeling in Ancient Greek, using a supervised machine learning approach. A Random Forest classifier is trained on a small semantically annotated corpus of Ancient Greek, annotated with a large amount of linguistic features, including form of the construction, morphology, part-of-speech, lemmas, animacy, syntax and distributional vectors of Greek words. These vectors turned out to be ...
Creating a richly annotated corpus of papyrological Greek: the possibilities of Natural Language Processing approaches to a highly inflected historical language KU Leuven
This article describes a first attempt to annotate the full Greek papyrus corpus automatically for linguistic information. It gives an overview of existing work on Ancient Greek and analyzes the typical problems one encounters when using natural language processing techniques on (1) a historical corpus of (2) a highly inflectional language (as opposed to the more analytic present-day English) and offers solutions to them, testing several ...