Department of Economics and Business Economics

ViPAR: A software platform for the Virtual Pooling and Analysis of Research Data

Research output: Contribution to journal/Conference contribution in journal/Contribution to newspaperJournal articleResearchpeer-review

  • Kim W Carter, University of Western Australia, Perth, WA, Australia, Australia
  • Richard W Francis, University of Western Australia, Perth, WA, Australia, Australia
  • M Bresnahan, Columbia University, New Yordk, United States
  • M Gissler, The National Institute for Health and Welfare, Finland
  • T K Grønborg
  • R Gross, Sheba Medical Center, Tel Aviv, Israel., Israel
  • N Gunnes, Norwegian Institute of Public Health, Oslo, Norway., Norway
  • G Hammond, University of Western Australia, Perth, WA, Australia, Australia
  • M Hornig, Columbia University, New York, United States
  • C M Hultman, Karolinska Institute, Stockholm, Sweden
  • J Huttunen, Turku university, Finland
  • A Langridge, University of Western Australia, Perth, WA, Australia, Australia
  • H Leonard, University of Western Australia, Perth, WA, Australia, Australia
  • S Newman, Dept. of Physics, King's College, United Kingdom
  • E T Parner
  • G Petersson, Karolinska Institutet, Stockholm, Sverige, Sweden
  • A Reichenberg, Dept. of Physics, King's College, United Kingdom
  • S Sandin, Karolinska Institutet, Stockholm, Sverige, Sweden
  • Diana Schendel
  • L Schalkwyk, Dept. of Physics, King's College, United Kingdom
  • A Sourander, Turku University and Turku University Hospital, Department of Child Psychiatry, Finland
  • C Steadman, University of Western Australia, Perth, WA, Australia, Australia
  • C Stoltenberg, Norwegian Institute of Public Health, Oslo, Norway., Norway
  • A Suominen, Turku University and Turku University Hospital, Department of Child Psychiatry, Finland
  • P Surén, Norwegian Institute of Public Health, Norway
  • E Susser, Columbia University, United States
  • A Sylvester Vethanayagam, Denmark
  • Z Yusof, Karolinska Institute, Stockholm, Sweden
  • International Collaboration for Autism Registry Epidemiology

BACKGROUND: Research studies exploring the determinants of disease require sufficient statistical power to detect meaningful effects. Sample size is often increased through centralized pooling of disparately located datasets, though ethical, privacy and data ownership issues can often hamper this process. Methods that facilitate the sharing of research data that are sympathetic with these issues and which allow flexible and detailed statistical analyses are therefore in critical need. We have created a software platform for the Virtual Pooling and Analysis of Research data (ViPAR), which employs free and open source methods to provide researchers with a web-based platform to analyse datasets housed in disparate locations.

METHODS: Database federation permits controlled access to remotely located datasets from a central location. The Secure Shell protocol allows data to be securely exchanged between devices over an insecure network. ViPAR combines these free technologies into a solution that facilitates 'virtual pooling' where data can be temporarily pooled into computer memory and made available for analysis without the need for permanent central storage.

RESULTS: Within the ViPAR infrastructure, remote sites manage their own harmonized research dataset in a database hosted at their site, while a central server hosts the data federation component and a secure analysis portal. When an analysis is initiated, requested data are retrieved from each remote site and virtually pooled at the central site. The data are then analysed by statistical software and, on completion, results of the analysis are returned to the user and the virtually pooled data are removed from memory.

CONCLUSIONS: ViPAR is a secure, flexible and powerful analysis platform built on open source technology that is currently in use by large international consortia, and is made publicly available at [http://bioinformatics.childhealthresearch.org.au/software/vipar/].

Original languageEnglish
JournalInternational Journal of Epidemiology
Volume45
Issue2
Pages (from-to)408-416
Number of pages9
ISSN0300-5771
DOIs
Publication statusPublished - 2016

    Research areas

  • ViPAR, Data sharing, Data federation, Data pooling

See relations at Aarhus University Citationformats

ID: 93798118