Overview

This study uses human mobility networks and dengue case data (2003-2024) from Thailand to test whether mobility-based dengue importation could help understand dengue transmission. The study covered multiple spatial scales:

  • Province level: 77 provinces, 20,328 observations (monthly, 2003-2024)
  • Bangkok District level: 50 districts, 8,600 observations (monthly, 2008-2022)
  • Bangkok Subdistrict level: 161 subdistricts, 27,692 observations (monthly); 47,700 observations (weekly, 2020-2024)

We compared several mobility data sources: mobile phone call detail records (CDR), classical gravity models, Deep Gravity neural networks and gravity flow models (constructed from the transportation network), and the raw transport network itself.

Importation Risk Index

The importation risk index estimates the likelihood that infected travellers will arrive at a target location from other connected locations, based on two factors: the number of cases in connected locations, and the volume of travel from those locations. Following standard metapopulation models, we calculate the importation risk index at an area i in time t as:

Ii,t=jiWjix Cj,tI_{i,t}=\sum_{j\neq i}W_{ji} x\ C_{j,t}

where Ii,t represents the importation risk index at location i at time t, Wji is the probability of travel from location j to location i (derived from the mobility matrix), and Cj,tC_{j,t} is the number of dengue cases in origin location j at time t.

A location receives high importation risk when it has strong travel connections to places currently experiencing high dengue burden.

Example importation risk calculation for Saraburi province, July 2023. (a) Conceptual diagram showing importation risk accumulation. (b) Geographic sources of importation risk.

Each source contributes its case burden weighted by the share of travel it sends to Saraburi. Chiang Rai (2,550 cases, 0.04% travel share) contributes only 1.0, while Tak (551 cases, 0.59% share) contributes 3.3, illustrating that network connectivity matters as much as outbreak intensity. Provinces are coloured by July 2023 case burden; arrow colour and thickness indicate the magnitude of contribution. In the following sections, we examine whether this importation risk measure predicts dengue incidence in subsequent months at the receiving location.

Mobility Data Sources

We tested several approaches to construct the mobility weight matrix. The weight matrix W is row-stochastic:

Wij=TijkTikW_{ij} = \frac{T_{ij}}{\sum_{k} T_{ik}}

This ensures that jWij=1\sum_{j} W_{ij} = 1 for all areas.

The following transportation networks (T in the formula) were used to construct different mobility weighted matrices: 

Raw Transport

We directly aggregated observed transport flows from bus (8,036 routes), ferry (177 routes), train (386 routes), and flight (434 routes) data. Traffic volumes between provinces were computed by summing all district-to-district flows.

Call Detail Records (CDR)

Mobile phone call detail records from Kiang et al. (2021) capture actual human movement patterns from roughly 11 million subscribers across Thailand (August-October 2017). The CDR-derived mobility matrix provides a 76 by 76 province-level row-stochastic matrix with strong home bias (diagonal mean of 0.866).

Classical Gravity Model

This model predicts flows based solely on population and distance:

Tijgravity=κNiαNjβdijγT_{ij}^{gravity} = \kappa \frac{N_{i}^{\alpha} N_{j}^{\beta}}{d_{ij}^{\gamma}}

Here, NiN_i andNjN_j are the origin and destination populations, dijd_{ij} is the geographic distance, and we use standard parameters (α\alpha=β\beta=1,γ\gamma =2 for the inverse square law). This provides a baseline that requires no transport data.

Gravity Flow Model (Mobility-Weighted)

This approach incorporates observed transport connectivity as weights in the gravity formulation:

Tijgravity=κNiαNjβdijγT_{ij}^{gravity} = \kappa \frac{N_{i}^{\alpha} N_{j}^{\beta}}{d_{ij}^{\gamma}}

whereωij\omega_{ij} is the transport weight derived from observed flows. Destination choice probabilities are row-normalised, then home bias is incorporated:

Pij=(1ϕi)δij+ϕiT^ijϕi[0,1]P_{ij} = (1 – \phi_{i})\delta_{ij} + \phi_{i} \hat{T}_{ij} \phi_{i} \in [0, 1]

where ϕi[0,1]\phi_{i} \in [0,1] is the mean fraction of time residents spend away from home, and ij is the Kronecker delta.

Deep Gravity Model

This neural network-based approach follows Simini et al. (2021). The architecture comprises 15 hidden layers (6 layers of 256 units plus 9 layers of 128 units) with LeakyReLU activation and per-origin softmax for probability output. Input features (39 per origin-destination pair) include population totals from the Department of Provincial Administration (DOPA), socioeconomic indicators such as Gross Provincial Product (GPP) per capita, poverty rate, and Gini coefficient, network centrality measures (PageRank and Eigenvector centrality), and geographic information including coordinates, regional classification, and inter-province distances.

Model Specification

We fitted negative binomial generalised linear models with fixed effects.

Province Model (Lag-1 only)

log(E[Ci,t])=β0+β1log(Ci,t1+1)+β2log(Ii,t1+1)+FElog(E[C_{i,t}]) = \beta_{0} + \beta_{1}log(C_{i,t-1} + 1) + \beta_{2}log(I_{i,t-1} + 1) + FE

Here, Ci,t1C_{i,t-1}captures temporal autocorrelation, Ii,t1I_{i,t-1} is the importation risk at lag-1, and FE includes unit and time fixed effects.

Bangkok Monthly Model (Lags 1, 2, 3)

For the Bangkok district and subdistrict analyses:

log(E[Ci,t])=β0+β1log(Ci,t1+1)+k=13βk+1log(Ii,tk+1)+FElog(E[C_{i,t}]) = \beta_{0} + \beta_{1}log(C_{i,t-1} + 1) + \sum_{k=1}^{3} \beta_{k+1}log(I_{i,t-k} + 1) + FE

Bangkok Weekly Model (Lags 2, 3, 4 weeks)

Weekly resolution uses biologically appropriate lags that match dengue generation time (2-4 weeks):

log(E[Ci,t])=β0+β1log(Ci,t1+1)+k=24βklog(Ii,tk+1)+FElog(E[C_{i,t}]) = \beta_{0} + \beta_{1}log(C_{i,t-1} + 1) + \sum_{k=2}^{4} \beta_{k}log(I_{i,t-k} + 1) + FE

These coefficients are elasticities. A coefficient of 0.45 means that a 10% increase in importation risk is associated with a 4.5% increase in expected dengue cases.

Interpreting Autocorrelation Reduction

High dengue cases this month, following high cases last month, could reflect either local persistence (ongoing transmission from resident mosquitoes and infected people) or continuous importation (infected travellers arriving from connected outbreak areas). A baseline model without importation risk conflates both effects in the autocorrelation coefficient.

When adding importation risk reduces autocorrelation, it suggests that some apparent “persistence” may have been network-driven importation. Three findings support this interpretation: (1) COVID-19 travel restrictions were associated with a 12% reduction in inter-province importation risk effects, consistent with mobility influencing spread; (2) continued intra-city travel during COVID was associated with 23-71% higher district and subdistrict effects, tracking actual mobility changes; (3) CDR outperforms gravity models at province level while classical gravity works better at subdistrict level.

A 49% autocorrelation reduction means that nearly half of the apparent “local persistence” may have been continuous re-seeding from connected outbreak areas.

Results

Thailand Province Level

Data: 77 provinces, January 2003 to December 2024, 20,328 observations

Province-Level Panel Regression: Mobility Source Comparison

Mobility SourceAICβ\beta(IR Lag-1)AutocorrChange
Baseline (no IR)183,7560.799
CDR (Mobile Phone)181,5120.452***0.410-49%
Classical Gravity183,3920.226***0.662-17%
Gravity Flow (Combined)183,5380.201***0.670-16%
Deep Gravity183,5150.186***0.729-9%
Raw Transport183,6440.125***0.765-4%
**p<0.001

CDR mobility provides the strongest epidemiological signal, with a 2,244-point AIC improvement over baseline. Raw transport network is the strongest at the district level, and the classical gravity model serves as the best proxy when CDR data are unavailable, marginally outperforming both transport-weighted gravity flow (AIC improvement of 218) and neural network-based deep gravity (AIC improvement of 241). When CDR-based importation risk is added, the autocorrelation coefficient drops from 0.799 to 0.410 (a 49% reduction), indicating that CDR captures variance previously absorbed by temporal persistence.

Thailand-Province level regression coefficients comparing five mobility data sources. CDR provides the strongest signal (0.452), followed by classical gravity (0.226), gravity flow (0.201), deep gravity (0.186), and raw transport (0.125). Error bars show 95% confidence intervals. All coefficients are significant at p<0.001.

Bangkok District Level (Monthly)

Data: 50 districts, 2008-2022, 8,600 observations

Bangkok-District: Mobility Source Comparison (3-Lag Model)

Mobility SourceAICβ\beta(Lag-1)β\beta(Lag-2)β\beta(Lag-3)Autocorr
Raw Transit59,8130.611***0.118**-0.0380.260
Gravity Flow59,8230.470***0.060-0.0340.449
Classical Gravity59,8290.483***0.045-0.0250.456
**p<0.001, **p<0.01; N = 8,450

Raw transit schedules capture commuter movements well at the district level. The primary effect occurs at lag-1 (coefficient of 0.611), with a smaller secondary effect at lag-2 (0.118). The lag-3 coefficient is not statistically significant, indicating effect saturation in the monthly data.

Bangkok Subdistrict Level (Monthly)

Data: 161 subdistricts, 2008-2022, 27,692 observations

Bangkok-Subdistrict: Mobility Source Comparison (3-Lag Model)

Mobility SourceAICβ\beta(Lag-1)β\beta(Lag-2)β\beta(Lag-3)Autocorr
Classical Gravity116,8960.730***0.055***-0.069*0.399
Gravity Flow117,0450.640***0.063**-0.060*0.436
Raw Transport117,1940.642***0.125***-0.060*0.369
**p<0.001, **p<0.01, *p<0.05; N = 26,871

At the subdistrict level, simple population times distance predicts movement (classical gravity) better than transit schedules. The primary effect is at lag-1 (0.730 for classical gravity), with a secondary effect at lag-2 (0.055). The negligible negative lag-3 coefficient (-0.069) reflects effect saturation in the monthly data.

COVID-19 as a Natural Experiment

The COVID-19 pandemic provided a natural experiment to test the importance of the mobility networks in explaining transmission. During this period, mobility restrictions reduced travel, allowing us to test whether network effects changed accordingly.

COVID Period Comparison: Thailand-Province vs Bangkok-District/Subdistrict (Single-Lag Models)

ScalePeriodAutocorrIR Lag-1Change
Thailand-ProvincePre-COVID0.384+0.452
COVID0.554+0.398-12%
Post-COVID0.494+0.413-9%
Bangkok-DistrictPre-COVID0.236+0.568
COVID0.144+0.970+71%
Post-COVID0.200+0.700+23%
Bangkok-SubdistrictPre-COVID0.401+0.723
COVID0.298+0.891+23%
Post-COVID0.352+0.780+8%

Temporal period analysis showing COVID-19 effects on regression coefficients. The importation risk effect decreased during COVID, while autocorrelation increased, consistent with reduced inter-province travel.

COVID travel restrictions were associated with a 12% reduction in inter-province network effects and a 23-71% increase in intra-city effects. This pattern is consistent with mobility-associated transmission operating through distinct mechanisms at different spatial scales.

Weekly Validation (Bangkok, 2020-2024)

Although higher-resolution weekly dengue data were only available for a shorter period (2020-2024), we used this data to validate the importation risk findings at lags matching dengue transmission patterns (2-4 weeks from importation).

Weekly Combined Multi-Lag Model (Bangkok District, 2020-2024)

VariableCoefficientSEp-valueMeaning
Autocorrelation0.335***0.012<0.001Local persistence
Importation Risk (2-week)0.553***0.031<0.001Primary transmission
Importation Risk (3-week)0.297***0.029<0.001Secondary cases
Importation Risk (4-week)0.242***0.028<0.001Tertiary cases
N = 13,050; AIC = 46,747; Autocorrelation reduction = -62%

Weekly Combined Multi-Lag Model (Bangkok Subdistrict, 2020-2024)

VariableCoefficientSEp-valueMeaning
Autocorrelation0.461***0.008<0.001Local persistence
Importation Risk (2-week)0.731***0.029<0.001Primary transmission
Importation Risk (3-week)0.377***0.027<0.001Secondary cases
Importation Risk (4-week)0.242***0.025<0.001Tertiary cases
N = 41,022; AIC = 75,760; Autocorrelation reduction = -51%

Weekly resolution with 2-4 week lags shows all lags positive and significant. The negligible monthly lag-3 negation effect reflects the three-month span exceeding typical dengue generation time, whilst weekly lag-4 (one month) remains within biological relevance. The subdistrict model shows even stronger effects than the district level (0.731 versus 0.553 at the two-week lag).

Spike Propagation

We defined spikes as months where cases exceeded 20% above the seasonal baseline:

Spikei,t=1 if Ci,tCi,mCi,m+10.20, otherwise 0Spike_{i,t} = 1 \text{ if } \frac{C_{i,t} – C_{i,m}^{-}}{C_{i,m}^{-} + 1} \geq 0.20, \text{ otherwise } 0

Spike pressure measures network exposure to connected regions with dengue spikes:

Si,t=jiWji×Spikej,t1S_{i,t} = \sum_{j \neq i} W_{ji} \times Spike_{j,t-1}

Spike Probability by Network Exposure Across Scales

ScaleLow ExposureHigh ExposureRRp-value
Thailand-Province22.6%39.6%1.75<0.0001
Bangkok-District23.9%51.5%2.16<0.0001
Bangkok-Subdistrict27.6%46.9%1.70<0.0001

Areas with high network exposure are roughly twice as likely to experience outbreak spikes (relative risk of 1.70 to 2.16 across scales).

Ideal Data Source per Scale

Optimal Mobility Data Source by Spatial Scale

ScaleUnitsBest Mobility SourceReason
Thailand-Province77CDR (mobile phone)Captures actual long-distance travel patterns
Bangkok-District50Raw Transit schedulesCommuter routes (bus/metro) dominate
Bangkok-Subdistrict161Classical GravityLocal movement follows population and distance
  1. Summary

Summary of key findings across spatial scales: Thailand-Province (77 units), Bangkok-District (50 units), and Bangkok-Subdistrict (161 units). (A) Network effect strength increases at finer scales, with different optimal mobility sources. (B) All scales show substantial autocorrelation reduction when importation risk is included. (C) Spike probability comparison between low and high network exposure, with relative risk shown. (D) COVID-19 effects on importation risk coefficients varied by scale.

Summary Comparison Across All Spatial Scales

MetricThailand-ProvinceBangkok-DistrictBangkok-Subdistrict
Units7750161
Best mobilityCDRRaw TransitGravity
Coefficient (Lag-1)0.4520.6840.723
Autocorr change-49%-67%-46%
COVID effect-12%+71%+23%
Spike RR1.752.161.70

Importance of Mobility Networks

These findings translate into practical tools for dengue surveillance and response.

Provinces with high exposure to upstream outbreaks are 75% more likely to experience their own spike within a month. The 2-4 week lag structure provides an early warning window: when one province spikes, health officials can alert connected neighbours before local cases appear.

Transport connectivity matters more than geographic distance. A province with heavy bus traffic to an outbreak area faces an elevated risk even if geographically distant. Weighting cases by mobility connections captures this better than distance-based models.

COVID provided a natural experiment: inter-province travel restrictions reduced network transmission effects by 12%, while continued intra-city travel increased local spread. This suggests inter-city travel advisories may be more effective than within-city measures for controlling provincial outbreaks.

Methods Note

All analyses used negative binomial panel regression with location and time fixed effects. Coefficients represent elasticities (log-log specification). Statistical significance levels: p<0.001, p<0.01, p<0.05.

Abbreviations: AIC = Akaike Information Criterion (lower is better); SE = Standard Error; RR = Relative Risk; IR = Importation Risk.

CDR Data Source: Kiang et al. (2021), roughly 11 million mobile phone subscribers, August-October 2017.

Analysis period: 2003-2024.

Scroll to Top