香港大学Guy J. Abel课题组提出了深度学习四十年的人类迁徙。相关论文于2026年6月10日发表于国际顶尖学术期刊《自然》杂志上。
在这里,研究团队提出了一个新的数据集,涵盖了从1990年到现在230个国家和地区的年度始发目的地移民,将不同的数据特征整合到一个统一的建模框架中。通过结合官方统计、基于人口普查的种群、净移徙估计数和过去的流量重建,他们的方法产生了时间上详细和空间上全面的估计数,大大扩展了现有资源。课题组研究人员使用由地理、经济、文化和政治协变量组成的深度循环神经网络集合,捕捉持续趋势和对变化条件的短期反应——所有这些都是在传播不确定性以产生信心界限的同时进行的。
他们的结果优于现有的基于滞留数据的五年流量估计,并提供了更精细的时间分辨率,揭示了以前模糊的全球迁移模式动态。该框架突出了不确定性仍然很高、最迫切需要收集数据的区域。通过发布所有数据、代码和经过训练的模型,研究小组为未来的工作提供了透明和可复制的基础。这些进展使人们能够更及时、更详细地了解人类的流动性,这对日益动态的全球系统中的研究和政策具有重要意义。
据介绍,人口迁移是全球人口变化的根本驱动力,影响着各国的人口结构、劳动力市场和社会政策。虽然长期的移徙模式往往与经济发展有关,但它们也可能因冲突、环境危机和政治变化等冲击而迅速改变。尽管移徙很重要,但仍然难以一致地衡量:现有数据稀疏,集中在高收入环境中,并且在不兼容的定义、时间分辨率和数据类型中分散。过去的工作依赖于部分数据集,包括流量记录、存量估计和基于模型的重建,覆盖范围有限。因此,一个核心挑战是建立一个全球一致的、高分辨率的长期移民流动记录。
附:英文原文
Title: Deep learning four decades of human migration
Author: Gaskin, Thomas, Abel, Guy J.
Issue&Volume: 2026-06-10
Abstract: Human migration is a fundamental driver of global demographic change, shaping population structure, labour markets and social policy across countries1,2,3. Although long-term migration patterns are often linked to economic development4, they can shift rapidly in response to shocks such as conflict, environmental crises and political change5. Despite its importance, migration remains difficult to measure consistently: existing data are sparse, concentrated in high-income settings and are fragmented across incompatible definitions, temporal resolutions and data types6,7,8. Past efforts have relied on partial datasets, including flow records, stock estimates and model-based reconstructions with limited coverage9,10,11,12,13,14. A central challenge is therefore to construct a globally consistent, high-resolution account of migration flows over time. Here we present a new dataset of annual origin-destination migration across 230 countries and regions from 1990 to the present, integrating diverse data sources into a unified modelling framework. By combining official statistics, census-based stocks, net migration estimates and past flow reconstructions, our approach produces temporally detailed and spatially comprehensive estimates that substantially extend existing resources. Using an ensemble of deep recurrent neural networks informed by geographic, economic, cultural and political covariates, we capture both persistent trends and short-term responses to changing conditions—all while propagating uncertainty to generate confidence bounds. Our results outperform existing five-year flow estimates on held-out data and provide finer temporal resolution, revealing previously obscured dynamics in global migration patterns. This framework highlights regions in which uncertainty remains high and data collection is most urgently needed. By releasing all data, code and trained models, we provide a transparent and reproducible foundation for future work. These advances enable a more timely and detailed understanding of human mobility, with implications for research and policy in an increasingly dynamic global system.
DOI: 10.1038/s41586-026-10611-7
Source: https://www.nature.com/articles/s41586-026-10611-7
Nature:《自然》,创刊于1869年。隶属于施普林格·自然出版集团,最新IF:69.504
官方网址:http://www.nature.com/
投稿链接:http://www.nature.com/authors/submit_manuscript.html
