To map human population distribution, Stevens uses a technique called ‘random forest’. This is a machine learning algorithm that simulates the distribution of a single dependent variable using multiple independent variables. This is done by taking the mapped gridcells of every covariate to create a predicted distribution of the dependent variable, which in Steven’s study is population density.
A machine learning algorithm is a method in which a researcher uses the computing power of a machine to identify trends within a very large set of data. Using these trends, the researcher can do a wide variety of actions from identifying causes and effects to creating entire models using data obtained from regression. In this particular study, Stevens distinguishes his work from classical statistical approaches by creating semi-randomized models of populations that are unique, rather than producing a model that would be the same as long as the same input data was given.
In the context of the modern world, the global community has the resources necessary to aid populations in developing countries but lacks the information to target those in need efficiently. Therefore, information has become an extremely valuable resource. With an accurate description of population distributions, aid can become both transparent and efficient.
In terms of my own study of urbanization within rural China, information such as migration and location of households is incredibly important since both are indicators of both wealth, public infrastructure, and technology. Generally, China is more developed than Nigeria, so studies typically use more targeted methods to identify populations such as cell phone data.