Aarhus Universitets segl

Towards a machine-readable literature: finding relevant papers based on an uploaded powder diffraction pattern

Publikation: Bidrag til tidsskrift/Konferencebidrag i tidsskrift /Bidrag til avisTidsskriftartikelForskningpeer review


  • Berrak Ozer, Columbia University
  • ,
  • Martin A. Karlsen, Syddansk Universitet
  • ,
  • Zachary Thatcher, Columbia University
  • ,
  • Ling Lan, Columbia University
  • ,
  • Brian McMahon, International Union of Crystallography
  • ,
  • Peter R. Strickland, International Union of Crystallography
  • ,
  • Simon P. Westrip, International Union of Crystallography
  • ,
  • Koh S. Sang, International Union of Crystallography
  • ,
  • David G. Billing, University of the Witwatersrand
  • ,
  • Dorthe B. Ravnsbaek
  • Simon J. L. Billinge, Columbia University, Brookhaven National Laboratory

A prototype application for machine-readable literature is investigated. The program is called pyDataRecognition and serves as an example of a data-driven literature search, where the literature search query is an experimental data set provided by the user. The user uploads a powder pattern together with the radiation wavelength. The program compares the user data to a database of existing powder patterns associated with published papers and produces a rank ordered according to their similarity score. The program returns the digital object identifier and full reference of top-ranked papers together with a stack plot of the user data alongside the top-five database entries. The paper describes the approach and explores successes and challenges.

TidsskriftActa Crystallographica Section A: Foundations and Advances
NummerPart 5
Sider (fra-til)386-394
Antal sider9
StatusUdgivet - sep. 2022

Bibliografisk note

Publisher Copyright:
© 2022 International Union of Crystallography. All rights reserved.

Se relationer på Aarhus Universitet Citationsformater

ID: 282613282