Data diving with cross-validation: an investigation of broad-scale gradients in Swedish weed communities

Authors: Hallgren, Erik1; Palmer, Michael W.2; Milberg, Per2

Source: Journal of Ecology, Volume 87, Number 6, December 1999 , pp. 1037-1051(15)

Publisher: Wiley-Blackwell

Buy & download fulltext article:

OR

Price: $48.00 plus tax (Refund Policy)

Abstract:

Summary

1 Multivariate analysis of complex data sets is plagued by problems of subjectivity and of finding statistically valid ways to test a large number of plausible hypotheses. We show how patterns in the data can be identified (data diving) as well as rigorously tested statistically by subdividing the data set.

2 We analysed data on weed biomass and environmental variables from more than 2000 plots in cereal and oil-seed crops in Sweden during 1970-94. Half the data set was used in an exploratory phase while the other half was used in a subsequent confirmatory phase.

3 The exploratory analyses included multivariate statistics [detrended correspondence analysis (DCA) and canonical correspondence analysis (CCA)] with various options and combinations of variables, and led to the formation of hypotheses that were then tested.

4 We tested the hypotheses in a sequential analysis with CCA and Monte Carlo permutation tests: after establishing the influence of one set of environmental variables, this set was covaried out in subsequent analyses. In this way the influence of (i) season of sowing of the crop; (ii) geographical region; (iii) soil type; (iv) crop species; and (v) temporal trends was tested. The four latter were tested separately for spring- and autumn-sown crops.

5 The sowing season of the crop had an overwhelming influence on the weed flora, and many weed species, both annual and perennial, showed strong associations with either autumn or spring. There were significant differences in weed flora composition between the geographical regions and soil types as well as between crop species. There were significant temporal trends only in the weed flora of autumn-sown crops.

6 This study provides a protocol that combines exploratory `data diving' with strict hypothesis testing using direct gradient analysis methods such as CCA. Such two-phase analysis should improve the way complex data are analysed and patterns are interpreted.

Keywords: cross-validation; hypothesis testing; multivariate analysis; Sweden; variation partitioning

Document Type: Research article

DOI: http://dx.doi.org/10.1046/j.1365-2745.1999.00413.x

Affiliations: 1: Department of Ecology and Crop Production Science, Swedish University of Agricultural Sciences, Box 7043, S-750 07 Uppsala, Sweden; †Department of Botany, Oklahoma State University, Stillwater, OK 74078, USA; and 2: Department of Biology-IFM, Linköping University, S-581 83 Linköping, Sweden

Publication date: 1999-12-01

Related content

Tools

Key

Free Content
Free content
New Content
New content
Open Access Content
Open access content
Subscribed Content
Subscribed content
Free Trial Content
Free trial content

Text size:

A | A | A | A
Share this item with others: These icons link to social bookmarking sites where readers can share and discover new web pages. print icon Print this page