Aarhus University Seal / Aarhus Universitets segl

Multilingual Sentiment Normalization for Scandinavian Languages

Publikation: Bidrag til tidsskrift/Konferencebidrag i tidsskrift /Bidrag til avisTidsskriftartikelForskningpeer review

In this paper, we address the challenge of multilingual sentiment analysis using a traditional lexicon and rule-based sentiment instrument that is tailored to capture sentiment patterns in a particular language. Focusing on a case study of three closely related Scandinavian languages (Danish, Norwegian, and Swedish) and using three tailored versions of VADER, we measure the relative degree of variation in valence using the OPUS corpus. We found that scores for Swedish are systematically skewed lower than Danish for translational pairs, and that scores for Norwegian are skewed higher for both other languages. We use a neural network to optimize the fit between Norwegian and Swedish respectively and Danish as the reference (target) language.
OriginalsprogEngelsk
TidsskriftScandinavian Studies in Language
Vol/bind12
Nummer1
Sider (fra-til)50-64
Antal sider15
ISSN1904-7843
DOI
StatusUdgivet - 31 dec. 2021

Se relationer på Aarhus Universitet Citationsformater

ID: 229829041