Institut for Statskundskab

Collaborating with the Machines: A hybrid method for classifying policy documents

Publikation: Bidrag til tidsskrift/Konferencebidrag i tidsskrift /Bidrag til avisTidsskriftartikelForskningpeer review



Burgeoning government activity has enabled an important and growing research program into the content and determinants of policy agendas around the world. However, tightening research budgets and the vast scale of available information force political science to aspire to do more with less. Meeting this challenge requires innovation in managing and preparing data. This paper makes two contributions to the practice of data coding to measure the content of political agendas. First, we propose a method of combining human content coding of political agendas and automated computer classification to classify large data sets. Second, we present software and supporting tools to apply a well-known algorithm for automated text classification, the Naı̈ve Bayes classifier. We demonstrate its usefulness for coding large sets of highly unbalanced multiclass data of the sort used to study the political agenda and demonstrate how our hybrid approach can maximize the returns on research budgets.
TidsskriftPolicy Studies Journal
Sider (fra-til)184-206
Antal sider23
StatusUdgivet - 2020

Se relationer på Aarhus Universitet Citationsformater


Ingen data tilgængelig

ID: 117659395