A comparison of machine learning approaches for identifying high-poverty counties: robust features of DMSP/OLS night-time light imagery
The goal of the present study is to demonstrate that high-poverty counties and robust classification features can be identified by machine learning approaches using only DMSP/OLS night-time light imagery. To accomplish this goal, a total of 96 high-poverty and 96 non-poverty counties were classified using 15 statistical and spatial features extracted from night-time light imagery in China in 2010 formed a training set for identifying high-poverty counties. Seven machine learning approaches were adopted to classify high-poverty counties, and five feature importance measures were used to select robust features. The resulting metrics, including the user’s (>63%), producer’s (>66%) and overall (>82%) accuracies of the poor county identification (probability of poverty greater than 0.6), show that the seven machine learning approaches used in this paper exhibit good performance, although some differences exist among the approaches. The order of feature importance reveals that the relative importance of each feature differs among the models; however, the important features remain consistent. The nine most important features ranked in each approach are relatively robust for poverty identification at the county level. Both spatial feature and statistical features calculated in part from the central tendency, degree of dispersion, and the distribution of the night-time light data were identified as indispensable robust features in all the approaches, indicating that the complex social phenomenon of poverty requires analysis from different aspects. Previous studies that utilized primarily night-time light imagery applied single features related to the central tendency or the distribution features of the imagery; this study provides a new method and can act as a reference for feature selection and identification of high-poverty counties using night-time light imagery and has potential applications across several scientific domains.
No Reference information available - sign in for access.
No Citation information available - sign in for access.
No Supplementary Data.
No Article Media
Document Type: Research Article
Affiliations: 1: School of Resource and Environmental Science, Wuhan University, Wuhan, China 2: The Second Survey and Mapping Institute of Guizhou Province, Guiyang, China
Publication date: August 3, 2019