Comparing performance between clinics of an embryo evaluation algorithm based on time-lapse images and machine learning

Martin N. Johansen*, Erik T. Parner, Mikkel F. Kragh, Keiichi Kato, Satoshi Ueno, Stefan Palm, Manuel Kernbach, Başak Balaban, İpek Keleş, Anette V. Gabrielsen, Lea H. Iversen, Jørgen Berntsen

*Corresponding author for this work

Research output: Contribution to journal/Conference contribution in journal/Contribution to newspaperJournal articleResearchpeer-review

6 Citations (Scopus)
8 Downloads (Pure)

Abstract

Purpose: This article aims to assess how differences in maternal age distributions between IVF clinics affect the performance of an artificial intelligence model for embryo viability prediction and proposes a method to account for such differences. Methods: Using retrospectively collected data from 4805 fresh and frozen single blastocyst transfers of embryos incubated for 5 to 6 days, the discriminative performance was assessed based on fetal heartbeat outcomes. The data was collected from 4 clinics, and the discrimination was measured in terms of the area under ROC curves (AUC) for each clinic. To account for the different age distributions between clinics, a method for age-standardizing the AUCs was developed in which the clinic-specific AUCs were standardized using weights for each embryo according to the relative frequency of the maternal age in the relevant clinic compared to the age distribution in a common reference population. Results: There was substantial variation in the clinic-specific AUCs with estimates ranging from 0.58 to 0.69 before standardization. The age-standardization of the AUCs reduced the between-clinic variance by 16%. Most notably, three of the clinics had quite similar AUCs after standardization, while the last clinic had a markedly lower AUC both with and without standardization. Conclusion: The method of using age-standardization of the AUCs that is proposed in this article mitigates some of the variability between clinics. This enables a comparison of clinic-specific AUCs where the difference in age distributions is accounted for.

Original languageEnglish
JournalJournal of Assisted Reproduction and Genetics
Volume40
Issue9
Pages (from-to)2129-2137
Number of pages9
ISSN1058-0468
DOIs
Publication statusPublished - Sept 2023

Keywords

  • Artificial intelligence
  • Embryo selection
  • Model performance
  • Time-lapse
  • Fertilization in Vitro
  • Humans
  • Artificial Intelligence
  • Blastocyst
  • Retrospective Studies
  • Time-Lapse Imaging
  • Machine Learning

Fingerprint

Dive into the research topics of 'Comparing performance between clinics of an embryo evaluation algorithm based on time-lapse images and machine learning'. Together they form a unique fingerprint.

Cite this