DaCy: A Unified Framework for Danish Natural Language processing

Project: Research

Project Details

Description

DaCy is a Danish text processing pipeline built using SpaCy. At the time of writing, it has achieved State-of-the-Art performance on part-of-speech (POS) tagging, named-entity recognition (NER) and Dependency parsing for Danish. Furthermore, it integrates state-of-the-art Danish resources in one unifying and extensible framework for allow for easy access to these resources by researchers and industry.

Layman's description

DaCy is a tool for processing free-form text such as news, online written text, historical Danish or similar. This processing include among other things automatic extraction of names of individuals, organisation and location, the detection of tone, subjectivity and emotional content in text, or even a syntactical analysis of the text.
Short titleDaCy
AcronymDaCy
StatusActive
Effective start/end date01/02/2021 → …

Keywords

  • Natural Language Processing
  • Danish Language processing

Fingerprint

Explore the research topics touched on by this project. These labels are generated based on the underlying awards/grants. Together they form a unique fingerprint.