A Surface-Syntactic UD Treebank for Naija
| dc.creator | Caron, Bernard | |
| dc.date.accessioned | 2025-08-28T16:01:10Z | |
| dc.date.issued | 2019 | |
| dc.description.abstract | This paper presents a syntactic treebank for spoken Naija, an English pidgincreole, which is rapidly spreading across Nigeria. The syntactic annotation is developed in the Surface-Syntactic Universal Dependency annotation scheme (SUD) (Gerdes et al., 2018) and automatically converted into UD. We present the workflow of the treebank development for this under-resourced language. A crucial step in the syntactic analysis of a spoken language consists in manually adding a markup onto the transcription, indicating the segmentation into major syntactic units and their internal structure. We show that this so-called "macrosyntactic" markup improves parsing results. We also study some iconic syntactic phenomena that clearly distinguish Naija from English. | |
| dc.identifier.other | halshs-03983518 | |
| dc.identifier.uri | https://hal.science/halshs-03983518 | |
| dc.identifier.uri | https://africarxiv.ubuntunet.net/handle/1/7457 | |
| dc.language.iso | en | |
| dc.subject | African Research | |
| dc.title | A Surface-Syntactic UD Treebank for Naija | |
| dc.type | Academic Publication |