Taxon selection using statistical learning techniques to improve transfer function prediction

Steve Juggins*, Gavin L. Simpson, Richard J. Telford

*Corresponding author af dette arbejde

Publikation: Bidrag til tidsskrift/Konferencebidrag i tidsskrift /Bidrag til avisTidsskriftartikelForskningpeer review

22 Citationer (Scopus)

Abstract

Transfer functions are widely used in palaeoecology to provide quantitative environmental reconstructions using biological proxies. Most models use all but the rarest taxa present in the training set, even though many may be unrelated to the environmental variable of interest. We hypothesise that retaining such non-informative taxa will reduce model robustness and present a method for variable selection motivated by the statistical learning algorithm in random forests. We apply our species-pruning algorithm into weighted averaging (WA) and maximum likelihood calibration of response curves (MLRCs), and compare results of boosted regression trees (BRTs) using artificial and real datasets. Results from the artificial data show that WA is particularly sensitive to the influence of both non-informative taxa and secondary environmental variables in the training set or fossil assemblage, and that BRTs are relatively immune to these effects. Furthermore, species-pruned WA and MLRCs offer substantial improvements over all-species models when the training set includes non-informative taxa but does not guard against confounding effects when species have bi- or multivariate responses to the primary and one or more secondary variables. Tests with a limited set of examples of real data indicate that BRTs, MLRCs or species-pruned models have no apparent advantage over WA. We discuss possible reasons for this contradiction and suggest that more tests are needed to properly evaluate BRTs and species-pruned models.

OriginalsprogEngelsk
TidsskriftThe Holocene
Vol/bind25
Nummer1
Sider (fra-til)130-136
Antal sider7
ISSN0959-6836
DOI
StatusUdgivet - 16 jan. 2015
Udgivet eksterntJa

Fingeraftryk

Dyk ned i forskningsemnerne om 'Taxon selection using statistical learning techniques to improve transfer function prediction'. Sammen danner de et unikt fingeraftryk.

Citationsformater