Projects per year
Abstract
Danish is a North Germanic/Scandinavian language spoken primarily in Denmark, a country with a tradition of technological and scientific innovation. However, from a technological perspective, the Danish language has received relatively little attention and, as a result, Danish language technology is hard to develop, in part due to a lack of large or broad-coverage Danish corpora. This paper describes the Danish Gigaword project, which aims to construct a freely-available one billion word corpus of Danish text that represents the breadth of the written language.
Original language | English |
---|---|
Publisher | ArXiv |
Number of pages | 6 |
Publication status | Published - May 2020 |
Fingerprint
Dive into the research topics of 'The Danish Gigaword Project'. Together they form a unique fingerprint.Projects
- 1 Finished
-
The Puzzle of Danish
Christiansen, M. H. (Project coordinator), Tylén, K. (Participant), Fusaroli, R. (Participant), Bleses, D. (Participant), Højen, A. (Participant), Trecca, F. (Participant), Dideriksen, C. (Participant) & Ishkhanyan, B. (Participant)
Independent Research Fund Denmark
01/09/2017 → 31/08/2020
Project: Research