A statistical framework to model the meeting-in-the-middle principle using metabolomic data: application to hepatocellular carcinoma in the EPIC study

Research output: Contribution to journal/Conference contribution in journal/Contribution to newspaperJournal articleResearchpeer-review

  • Nada Assi, France
  • Anne Fages, France
  • Paolo Vineis, United Kingdom
  • Marc Chadeau-Hyam, United Kingdom
  • Magdalena Stepien, France
  • Talita Duarte-Salles, France
  • Graham Byrnes, France
  • Houda Boumaza, France
  • Sven Knüppel, Germany
  • Tilman Kühn, Germany
  • Domenico Palli, Italy
  • Christina Bamia, Greece
  • Hendriek Boshuizen, Netherlands
  • Catalina Bonet, Spain
  • Kim Overvad
  • Mattias Johansson, France
  • Ruth Travis, United Kingdom
  • Marc J Gunter, United Kingdom
  • Eiliv Lund, Norway
  • Laure Dossus, France
  • Bénédicte Elena-Herrmann, France
  • Elio Riboli, United Kingdom
  • Mazda Jenab, France
  • Vivian Viallon, France
  • Pietro Ferrari, France

Metabolomics is a potentially powerful tool for identification of biomarkers associated with lifestyle exposures and risk of various diseases. This is the rationale of the 'meeting-in-the-middle' concept, for which an analytical framework was developed in this study. In a nested case-control study on hepatocellular carcinoma (HCC) within the European Prospective Investigation into Cancer and nutrition (EPIC), serum (1)H nuclear magnetic resonance (NMR) spectra (800 MHz) were acquired for 114 cases and 222 matched controls. Through partial least square (PLS) analysis, 21 lifestyle variables (the 'predictors', including information on diet, anthropometry and clinical characteristics) were linked to a set of 285 metabolic variables (the 'responses'). The three resulting scores were related to HCC risk by means of conditional logistic regressions. The first PLS factor was not associated with HCC risk. The second PLS metabolomic factor was positively associated with tyrosine and glucose, and was related to a significantly increased HCC risk with OR = 1.11 (95% CI: 1.02, 1.22, P = 0.02) for a 1SD change in the responses score, and a similar association was found for the corresponding lifestyle component of the factor. The third PLS lifestyle factor was associated with lifetime alcohol consumption, hepatitis and smoking, and had negative loadings on vegetables intake. Its metabolomic counterpart displayed positive loadings on ethanol, glutamate and phenylalanine. These factors were positively and statistically significantly associated with HCC risk, with 1.37 (1.05, 1.79, P = 0.02) and 1.22 (1.04, 1.44, P = 0.01), respectively. Evidence of mediation was found in both the second and third PLS factors, where the metabolomic signals mediated the relation between the lifestyle component and HCC outcome. This study devised a way to bridge lifestyle variables to HCC risk through NMR metabolomics data. This implementation of the 'meeting-in-the-middle' approach finds natural applications in settings characterised by high-dimensional data, increasingly frequent in the omics generation.

Original languageEnglish
Pages (from-to)743-53
Number of pages11
Publication statusPublished - Nov 2015

See relations at Aarhus University Citationformats

ID: 90340117