Predictions vs Observations

IONIS V22-gamma model predictions against 6,964 observed signatures from 3Y0K Bouvet Island

Total Predictions

6,964

RBN Paths

548

PSKR Paths

6,416

Physics Overrides

66

Night/high-band closure

Predicted vs Observed (6,964 points)

||

Dashed line = perfect prediction (1:1). Points above = model overpredicts. Points below = model underpredicts.

Storm Impact — 3-Way Comparison (RBN, dB)

ConditionPathsObservedIONISVOACAPIONIS BiasVOACAP Bias
Quiet (Kp < 2)26519.3 dB19.4 dB26.0 dB+0.1+6.6
Storm (Kp ≥ 4)1512.2 dB18.2 dB-246.2 dB+6.0-258.4

During quiet conditions (Kp < 2), IONIS tracks observed SNR within 0.2 dB. During the Kp 6.0 storm, observed SNR dropped from 19.3 to 12.4 dB. IONIS correctly predicts degradation but underestimates the magnitude (+5.8 dB bias). VOACAP uses monthly-median coefficients and cannot respond to real-time Kp — this is a design constraint, not a flaw — and predicts severe band closure (−245.9 dB) for paths that were actually open at reduced signal levels.

Note:Storm sample is anecdotal (n=14), not statistically significant. Bootstrap 95% CI for overall bias: IONIS [−2.68, −1.08] dB, VOACAP [−19.75, −3.82] dB.

On RMSE:RMSE omitted from comparisons. VOACAP's 811 circuit failure flags (< −100 dB) inflate RMSE to 98.3 dB, making it a meaningless comparison metric. Bias and MAE are fairer measures.

IONIS V22-gamma vs VOACAP — Head-to-Head (RBN)

BandPathsObservedIONISVOACAPIONIS BiasVOACAP Bias
160m424.8 dB11.7 dB-155.3 dB-13.0-180.0
80m4924.7 dB17.0 dB-100.0 dB-7.6-124.7
40m5818.5 dB22.9 dB-57.6 dB+4.4-76.0
30m5718.0 dB20.0 dB-11.1 dB+2.0-29.1
20m5623.6 dB18.5 dB9.8 dB-5.0-13.8
17m6415.3 dB17.2 dB44.4 dB+1.9+29.1
15m15622.1 dB17.8 dB42.2 dB-4.3+20.1
12m2816.6 dB16.9 dB32.1 dB+0.4+15.5
10m7617.5 dB15.5 dB44.3 dB-2.0+26.8

IONIS V22-gamma (207K params, trained on 175M signatures) vs VOACAP (NTIA/ITS deterministic ray-tracing engine, CCIR semi-empirical coefficients). Neither system was tuned for Bouvet Island paths. Bias color: green (<5 dB), amber (5-10 dB), red (>10 dB).

Per-Band Prediction Summary (RBN)

BandPathsObs MeanPred MeanBias
160m42.07σ-0.02σ-2.09σ
80m491.15σ0.01σ-1.14σ
40m580.02σ0.69σ+0.67σ
30m570.16σ0.47σ+0.31σ
20m560.76σ0.01σ-0.75σ
17m64-0.19σ0.07σ+0.27σ
15m1560.87σ0.22σ-0.66σ
12m280.23σ0.29σ+0.06σ
10m760.26σ-0.04σ-0.30σ

Bias color: green (<0.2σ), amber (0.2-0.5σ), red (>0.5σ). JD04 has near-zero coverage in training data — this is the model's hardest test.

Analysis Summary

This comparison tests two independent HF propagation prediction systems against real observations from Bouvet Island — a location that neither system was specifically designed or tuned for.

VOACAP is the industry-standard HF prediction engine, developed by NTIA/ITS in the 1980s and trusted by broadcasters, military planners, and radio amateurs for four decades. It is a deterministic ray-tracing framework using semi-empirical ionospheric coefficients (CCIR/URSI) derived from decades of ionosonde measurements to predict monthly-median circuit performance.

IONIS V22-gamma is a neural network (207K parameters) trained on 175 million propagation signatures from WSPR beacons, RBN skimmers, and contest QSOs. It learns propagation patterns directly from observed data rather than from ionospheric models.

Bouvet Island at 54°S presents a genuine blind test for both systems. VOACAP's ionospheric coefficients have sparse coverage at extreme southern latitudes, and IONIS had virtually zero training data from grid JD04.

The results show that IONIS tracks observed SNR within a few dB across most bands (overall bias: −1.9 dB), while VOACAP's predictions diverge significantly at this latitude (overall bias: −11.3 dB), particularly on the low bands where it underestimates propagation and on the high bands where it overestimates.

This does not diminish VOACAP's value — it remains an essential tool and the foundation that the entire HF prediction field is built on. What it suggests is that data-driven approaches may extend prediction accuracy into regions where traditional semi-empirical models have less observational support, such as extreme polar paths. Both approaches are complementary: VOACAP's deterministic ray-tracing framework provides interpretability and reliability across well-characterized paths, while ML models can generalize from large observational datasets to capture highly dynamic, short-term propagation features.

Survivorship bias: These comparisons evaluate prediction accuracy given that a path was observed. Paths that did not propagate are not captured by RBN or PSKR, creating inherent survivorship bias. VOACAP's circuit failure predictions (< −100 dB) may be correct for unobserved paths. Training overlap: 1,426 WSPR spots from JD04 of 10.8 billion (0.00001%) — a near-blind test for IONIS.

About These Predictions

IONIS V22-gamma predictions generated from the DXpedition signature dataset using the production checkpoint. VOACAP predictions generated by voacapl (v16.1207W) with CCIR coefficients, Method 30 (complete system performance), and const17 antenna model. Both systems received identical inputs: path geometry, frequency, and smoothed sunspot number.

All data and code used in this comparison are publicly available. The 3Y0K dataset is on SourceForge.