Collaborating with the Machines: A hybrid method for classifying policy documents

Matthew Loftis, Peter Bjerre Mortensen

Publikation: Bidrag til tidsskrift/Konferencebidrag i tidsskrift /Bidrag til avisTidsskriftartikelForskningpeer review

416 Downloads (Pure)


Burgeoning government activity has enabled an important and growing research program into the content and determinants of policy agendas around the world. However, tightening research budgets and the vast scale of available information force political science to aspire to do more with less. Meeting this challenge requires innovation in managing and preparing data. This paper makes two contributions to the practice of data coding to measure the content of political agendas. First, we propose a method of combining human content coding of political agendas and automated computer classification to classify large data sets. Second, we present software and supporting tools to apply a well-known algorithm for automated text classification, the Naı̈ve Bayes classifier. We demonstrate its usefulness for coding large sets of highly unbalanced multiclass data of the sort used to study the political agenda and demonstrate how our hybrid approach can maximize the returns on research budgets.
TidsskriftPolicy Studies Journal
Sider (fra-til)184-206
Antal sider23
StatusUdgivet - 2020


Dyk ned i forskningsemnerne om 'Collaborating with the Machines: A hybrid method for classifying policy documents'. Sammen danner de et unikt fingeraftryk.