Skip to main content
padlock icon - secure page this page is secure

Assessing among‐lineage variability in phylogenetic imputation of functional trait datasets

Buy Article:

$52.00 + tax (Refund Policy)

Phylogenetic imputation has recently emerged as a potentially powerful tool for predicting missing data in functional traits datasets. As such, understanding the limitations of phylogenetic modelling in predicting trait values is critical if we are to use them in subsequent analyses. Previous studies have focused on the relationship between phylogenetic signal and clade‐level prediction accuracy, yet variability in prediction accuracy among individual tips of phylogenies remains largely unexplored. Here, we used simulations of trait evolution along the branches of phylogenetic trees to show how the accuracy of phylogenetic imputations is influenced by the combined effects of 1) the amount of phylogenetic signal in the traits and 2) the branch length of the tips to be imputed. Specifically, we conducted cross‐validation trials to estimate the variability in prediction accuracy among individual tips on the phylogenies (hereafter ‘tip‐level accuracy’). We found that under a Brownian motion model of evolution (BM, Pagel't λ = 1), tip‐level accuracy rapidly decreased with increasing tip branch‐lengths, and only tips of approximately 10% or less of the total height of the trees showed consistently accurate predictions (i.e. cross‐validation R‐squared >0.75). When phylogenetic signal was weak, the effect of tip branch‐length was reduced, becoming negligible for traits simulated with λ < 0.7, where accuracy was in any case low. Our study shows that variability in prediction accuracy among individual tips of the phylogeny should be considered when evaluating the reliability of phylogenetically imputed trait values. To address this challenge, we describe a Monte Carlo‐based method that allows one to estimate the expected tip‐level accuracy of phylogenetic predictions for continuous traits. Our approach identifies gaps in functional trait datasets for which phylogenetic imputation performs poorly, and will help ecologists to design more efficient trait collection campaigns by focusing resources on lineages whose trait values are more uncertain.
No References
No Citations
No Supplementary Data
No Article Media
No Metrics

Keywords: branch-lengths; missing data; phylogenetic signal

Document Type: Research Article

Publication date: October 1, 2018

  • Access Key
  • Free content
  • Partial Free content
  • New content
  • Open access content
  • Partial Open access content
  • Subscribed content
  • Partial Subscribed content
  • Free trial content
Cookie Policy
X
Cookie Policy
Ingenta Connect website makes use of cookies so as to keep track of data that you have filled in. I am Happy with this Find out more