Search | Research Portal

Chosen filters:

Alek Keersmaekers

Chosen filters:

Alek Keersmaekers

1 - 10 of 13 results

Sorteren op

Applying Distributional Semantic Models to a Historical Corpus of a Highly Inflected Language: the Case of Ancient Greek KU Leuven

Alek Keersmaekers, Dirk Speelman

Word Sense Disambiguation for Ancient Greek; Sourcing a training corpus through translation alignment KU Leuven

Alek Keersmaekers, Wouter Mercelis, Toon Van Hal

This paper seeks to leverage translations of Ancient Greek texts to enhance the performance of automatic word sense disambiguation (WSD). Satisfactory WSD in Ancient Greek is achievable, provided that the system can rely on annotated data. This study, acknowledging the challenges of manually assigning meanings to every Greek lemma, explores strategies to derive WSD data from parallel texts using sentence and word alignment. Our results suggest ...

An ELECTRA Model for Latin Token Tagging Tasks KU Leuven

Wouter Mercelis, Alek Keersmaekers

In Search of the Flocks: How to Perform Onomasiological Queries in an Ancient Greek Corpus? KU Leuven

Alek Keersmaekers, Toon Van Hal

Seeing the Light through the Trees: How Treebanks Can Advance the Education of Classical Languages KU Leuven

Toon Van Hal, Alek Keersmaekers

The GLAUx corpus: methodological issues in designing a long-term, diverse, multilayered corpus of Ancient Greek KU Leuven

Alek Keersmaekers

A Corpus-Based Approach to Conceptual History of Ancient Greek. The Case of βάρβαρος KU Leuven

Alek Keersmaekers, Toon Van Hal

While detecting semasiological or onomasiological change in the past involved annotating hundreds of corpus examples manually, the rise of “distributional” approaches to semantics has enabled researchers to detect semantic change in automated ways. These developments in linguistics are of particular interest for conceptual history, a branch in history that focuses on the evolution of concepts over time. The aim of this paper is to bring ...

A Computational Approach to the Greek Papyri: Developing a Corpus to Study Variation and Change in the Post-Classical Greek Complementation System KU Leuven

Alek Keersmaekers

The aim of this PhD project is to advance the corpus-linguistic study of the Greek papyri, a large diachronic corpus (3rd century BC - 8th century AD) of non-literary Greek. It consists of two central parts. The first part is focused on corpus design: starting from the transcribed (XML) version of these texts, it describes a pipeline model to supply the papyri step for step with linguistic information, using natural language processing (NLP) ...

Automatic Semantic Role Labeling in Ancient Greek Using Distributional Semantic Modeling KU Leuven

Alek Keersmaekers

This paper describes a first attempt to automatic semantic role labeling in Ancient Greek, using a supervised machine learning approach. A Random Forest classifier is trained on a small semantically annotated corpus of Ancient Greek, annotated with a large amount of linguistic features, including form of the construction, morphology, part-of-speech, lemmas, animacy, syntax and distributional vectors of Greek words. These vectors turned out to be ...

Creating a richly annotated corpus of papyrological Greek: the possibilities of Natural Language Processing approaches to a highly inflected historical language KU Leuven

Alek Keersmaekers

This article describes a first attempt to annotate the full Greek papyrus corpus automatically for linguistic information. It gives an overview of existing work on Ancient Greek and analyzes the typical problems one encounters when using natural language processing techniques on (1) a historical corpus of (2) a highly inflectional language (as opposed to the more analytic present-day English) and offers solutions to them, testing several ...

Publications

Applying Distributional Semantic Models to a Historical Corpus of a Highly Inflected Language: the Case of Ancient Greek KU Leuven

Word Sense Disambiguation for Ancient Greek; Sourcing a training corpus through translation alignment KU Leuven

An ELECTRA Model for Latin Token Tagging Tasks KU Leuven

In Search of the Flocks: How to Perform Onomasiological Queries in an Ancient Greek Corpus? KU Leuven

Seeing the Light through the Trees: How Treebanks Can Advance the Education of Classical Languages KU Leuven

The GLAUx corpus: methodological issues in designing a long-term, diverse, multilayered corpus of Ancient Greek KU Leuven

A Corpus-Based Approach to Conceptual History of Ancient Greek. The Case of βάρβαρος KU Leuven

A Computational Approach to the Greek Papyri: Developing a Corpus to Study Variation and Change in the Post-Classical Greek Complementation System KU Leuven

Automatic Semantic Role Labeling in Ancient Greek Using Distributional Semantic Modeling KU Leuven

Creating a richly annotated corpus of papyrological Greek: the possibilities of Natural Language Processing approaches to a highly inflected historical language KU Leuven