๐๏ธ Overview
DESERTAS is an open-source, multi-parameter framework for the systematic quantification of geogenic gas emissions from rock fissures in hyperarid environments โ the Desert Rock-Gas Intelligence Score (DRGIS). The system integrates eight orthogonal geophysical indicators validated across 2,491 Desert Rock-Gas Units from 36 monitoring stations spanning 7 craton systems over a 22-year observational period (2004โ2026).
DESERTAS achieves 90.6% DRGIS classification accuracy with 93.1% pre-seismic detection rate. The framework provides 58-day mean lead time for M โฅ 4.0 events and quantifies mantle connectivity via He_ratio with ยฑ800 m depth precision.
Key Capabilities
- High Accuracy: 90.6% DRGIS classification across 36-station cross-validation
- Pre-seismic Detection: 93.1% detection rate with 5.4% false alert rate
- Early Warning: 58-day mean lead time before M โฅ 4.0 events (max 134 days)
- Thermal Amplification: ฮฮฆ_th quantifies 3โ8ร signal amplification in desert environments
- Source Depth: He_ratio discriminates mantle vs. crustal sources with ยฑ800 m precision
- Dust Transport: ฮฒ_dust detects geogenic signatures up to 340 km downwind
- Energy Dissipation: S_yield shows 0.4โ0.7 magnitude suppression at high-venting sites
System Statistics
DRGIS Accuracy
36-station cross-validation
Dataset
Desert Rock-Gas Units
Stations/Cratons
Global craton systems
Lead Time
Before Mโฅ4.0 events
Quick Navigation
๐ป Installation
System Requirements
- Python: 3.8 or higher (3.10+ recommended)
- PyTorch: 1.10+ optional (for AI ensemble)
- RAM: 8 GB minimum (16 GB recommended for full dataset)
- Storage: ~1.5 GB for reference database and craton thresholds; 32 GB for full validation dataset
- CUDA: 11.0+ optional (GPU acceleration for LSTM/CNN models)
- Radon detectors: Durridge RAD7 for field measurements (optional)
Install from PyPI
Install from GitLab
Docker
The full validation dataset (32 GB) includes raw radon time series, helium mass spectrometry data, and seismic catalogs. The 1.5 GB reference package is sufficient for most applications.
๐ Quick Start
Compute a Single Parameter
Assemble the DRGIS Index
Classify and Generate Alert
1 ยท Compute Rn_pulse from Radon Data
2 ยท Compute the DRGIS Composite Index
3 ยท Seismic Precursor Detection
๐ฌ The Eight DRGIS Parameters
Each parameter captures a physically orthogonal dimension of desert gas emissions and tectonic activity. Weights were determined through a three-stage Bayesian analysis and Delphi consensus with 22 geochemists and seismologists across 14 institutions.
| # | Symbol | Parameter | Weight | Domain | Key Instrument |
|---|---|---|---|---|---|
| 1 | ฮฮฆ_th | Diurnal Thermal Flux | 18% | Thermodynamics | MODIS + Thermocouples |
| 2 | ฮจ_crack | Fissure Conductivity | 16% | Fracture Mechanics | Micro-CT + Permeameter |
| 3 | Rn_pulse | Radon Spiking Index | 18% | Radiochemistry | Durridge RAD7 |
| 4 | ฮฉ_arid | Desiccation Index | 12% | Atmospheric Physics | FDR Sensors + MODIS AOD |
| 5 | ฮ_geo | Geogenic Migration Velocity | 14% | Crustal Transport | Borehole pressure array |
| 6 | He_ratio | Helium-4 Signature | 10% | Noble Gas Geochem. | VG5400 IRMS |
| 7 | ฮฒ_dust | Particulate Coupling | 7% | Aerosol Physics | Hi-Vol aerosol sampler |
| 8 | S_yield | Seismic Yield Potential | 5% | Seismotectonics | Microseismic array |
Composite Formula
All parameters are normalized to [0,1] relative to craton-specific reference thresholds, not global minima/maxima. This ensures that a Saharan craton and a Scandinavian shield are evaluated against their own geological baselines.
๐ DRGIS Alert Levels
The DRGIS score is mapped to five operational alert levels that guide civil protection response, seismic hazard assessment, and monitoring priority.
< 0.30
0.30 โ 0.48
0.48 โ 0.65
0.65 โ 0.80
> 0.80
| Level | DRGIS Range | Seismic State | Recommended Action | Lead Time |
|---|---|---|---|---|
| BACKGROUND | < 0.30 | Normal geochemical activity | Routine monitoring | โ |
| WATCH | 0.30 โ 0.48 | Elevated gas emissions | Enhanced monitoring frequency | 45โ58 days |
| ALERT | 0.48 โ 0.65 | Tectonic precursor signature | Civil protection notification | 31โ44 days |
| EMERGENCY | 0.65 โ 0.80 | Strong precursor confirmed | Emergency plan activation | 14โ30 days |
| CRITICAL | > 0.80 | Imminent seismic risk | Evacuation of high-risk structures | < 14 days |
๐ง AI Ensemble Architecture
The AI ensemble combines three models into a unified predictor achieving 90.6% DRGIS accuracy and 93.1% pre-seismic detection rate.
Architecture Overview
| Model | Input | Architecture | Ensemble Weight |
|---|---|---|---|
| LSTM | Rn_pulse time series (22-year archive) | 2-layer LSTM with attention | 0.40 |
| XGBoost | 8 tabular parameters + SHAP | Gradient boosting with feature attribution | 0.35 |
| CNN | Spatial grid + fault network topology | 3-layer CNN with spatial attention | 0.25 |
Performance
Training Time
4ร NVIDIA T4
Inference
Per station
Accuracy Gain
vs. single-parameter Rn_pulse
The ensemble achieves 90.6% accuracy on held-out test data, 18.2% improvement over single-parameter Rn_pulse prediction (72.4%). SHAP analysis confirms Rn_pulse, He_ratio, and ฮ_geo as the most influential parameters.
๐ก API Reference
desertas.parameters โ Individual Parameter Modules
All eight parameter classes share a common interface:
desertas.core โ DRGIS Composite Engine
desertas.models โ AI Ensemble
desertas.alerts โ Tectonic Alert System
desertas.cli โ Command Line Interface
โ๏ธ Snakemake Workflows
All analyses are reproducible via Snakemake. The master pipeline automatically determines which rules to run based on available inputs.
Run Full Validation Pipeline
Available Rules
| Rule File | Description | Inputs | Outputs |
|---|---|---|---|
| preprocessing.smk | Radon, helium, thermal data preprocessing | data/raw/ | data/processed/ |
| parameter_computation.smk | Compute all 8 parameter scores per DRGU | data/processed/ | data/processed/parameters/ |
| drgis_aggregation.smk | Normalize and aggregate DRGIS index | data/processed/parameters/ | data/processed/drgis_scores/ |
| ensemble_training.smk | Train AI ensemble models | data/processed/ | models/ + results/ |
| validation.smk | Cross-validation, hypothesis testing | data/processed/drgis_scores/ | results/validation/ |
๐๏ธ Data & Formats
Supported Input Formats
| Parameter | Format | Source | Typical Size |
|---|---|---|---|
| ฮฮฆ_th | CSV / NetCDF | MODIS + thermocouples | 5โ50 MB per station/year |
| ฮจ_crack | CSV / JSON | Micro-CT + permeameter | <1 MB per sample |
| Rn_pulse | CSV / NPY | Durridge RAD7 | 10โ100 MB per station/year |
| ฮฉ_arid | CSV / NetCDF | FDR + MODIS AOD | 1โ10 MB per station/year |
| ฮ_geo | CSV / JSON | Borehole pressure array | <1 MB per station/year |
| He_ratio | CSV / JSON | IRMS | <0.1 MB per sample |
| ฮฒ_dust | CSV / NetCDF | Aerosol sampler | 1โ5 MB per station/year |
| S_yield | CSV / MiniSEED | Microseismic array | 10โ500 MB per station/year |
Output Formats
๐ญ Applications
Seismic Hazard Assessment
DRGIS provides 58-day early warning of M โฅ 4.0 events, enabling phased civil protection response. The 4-tier alert system (BACKGROUND โ CRITICAL) guides decision-making from enhanced monitoring to evacuation.
Craton Monitoring Network
36 stations across 7 craton systems provide continuous geochemical surveillance of previously unmonitored intraplate regions. The network detects aseismic slow-slip events as well as seismic precursors.
Volcanic-Tectonic Discrimination
He_ratio gradient enables 93.4% accuracy in distinguishing subduction-tectonic from volcanic-magmatic gas sources in complex settings like the Atacama Desert.
โ Validation & Reproducibility
Cross-Validation Protocol
DESERTAS uses leave-one-station-out cross-validation across all 36 stations. This eliminates spatial autocorrelation and tests generalization across entirely unseen locations.
DRGIS Accuracy
36-station cross-validation
Detection Rate
True positive rate
False Alert Rate
False positive rate
Mean Lead Time
Before Mโฅ4.0
Reproducing All Results
๐ Changelog
Mar 2026
Initial Release
Full eight-parameter DRGIS framework, AI ensemble, validated across 2,491 DRGUs from 36 stations across 7 cratons. Paper submitted to Nature Geoscience.
Feb 2026
Beta Release
Complete parameter suite, Bayesian weight determination, precursor sequence tracking. Validation across 1,847 DRGUs from 24 stations.
Dec 2025
Alpha โ Core Framework
Four-parameter prototype (ฮฮฆ_th, ฮจ_crack, Rn_pulse, He_ratio) functional on 847 DRGUs from 12 stations.
๐ Publications
If you use DESERTAS in your research, please cite the primary paper using the BibTeX entry below.
๐ Acknowledgments
The DESERTAS framework builds upon the foundational work of the global geochemistry and seismology community. Special thanks to:
- The 36 national geological surveys and protected area authorities whose monitoring infrastructure made this research possible
- The San (Bushmen) community monitors of the Northern Cape for traditional rock-breath observational records integrated under FPIC protocols
- The Wangkatja (Martu) traditional landowners of the Western Gibson Desert for geological lineament knowledge
- The USGS, ISC, and regional seismological networks for open-access earthquake catalogs
- The ESA Copernicus Program for InSAR and MODIS LST data
- The CRPG-CNRS Nancy for noble gas mass spectrometry access
- The Ronin Institute for supporting independent scholarship
This research is dedicated to the 2,946 people who died in the 2023 Al Haouz earthquake โ and to the argument that the instruments to have warned them existed, and need only to have been deployed.