Predictions vs Observations

IONIS V22-gamma model predictions against 6,964 observed signatures from 3Y0K Bouvet Island

Total Predictions

6,964

RBN Paths

548

PSKR Paths

6,416

Physics Overrides

Night/high-band closure

Predicted vs Observed (6,964 points)

Dashed line = perfect prediction (1:1). Points above = model overpredicts. Points below = model underpredicts.

Storm Impact — 3-Way Comparison (RBN, dB)

Condition	Paths	Observed	IONIS	VOACAP	IONIS Bias	VOACAP Bias
Quiet (Kp < 2)	265	19.3 dB	19.4 dB	26.0 dB	+0.1	+6.6
Storm (Kp ≥ 4)	15	12.2 dB	18.2 dB	-246.2 dB	+6.0	-258.4

During quiet conditions (Kp < 2), IONIS tracks observed SNR within 0.2 dB. During the Kp 6.0 storm, observed SNR dropped from 19.3 to 12.4 dB. IONIS correctly predicts degradation but underestimates the magnitude (+5.8 dB bias). VOACAP uses monthly-median coefficients and cannot respond to real-time Kp — this is a design constraint, not a flaw — and predicts severe band closure (−245.9 dB) for paths that were actually open at reduced signal levels.

Note:Storm sample is anecdotal (n=14), not statistically significant. Bootstrap 95% CI for overall bias: IONIS [−2.68, −1.08] dB, VOACAP [−19.75, −3.82] dB.

On RMSE:RMSE omitted from comparisons. VOACAP's 811 circuit failure flags (< −100 dB) inflate RMSE to 98.3 dB, making it a meaningless comparison metric. Bias and MAE are fairer measures.

IONIS V22-gamma vs VOACAP — Head-to-Head (RBN)

Band	Paths	Observed	IONIS	VOACAP	IONIS Bias	VOACAP Bias
160m	4	24.8 dB	11.7 dB	-155.3 dB	-13.0	-180.0
80m	49	24.7 dB	17.0 dB	-100.0 dB	-7.6	-124.7
40m	58	18.5 dB	22.9 dB	-57.6 dB	+4.4	-76.0
30m	57	18.0 dB	20.0 dB	-11.1 dB	+2.0	-29.1
20m	56	23.6 dB	18.5 dB	9.8 dB	-5.0	-13.8
17m	64	15.3 dB	17.2 dB	44.4 dB	+1.9	+29.1
15m	156	22.1 dB	17.8 dB	42.2 dB	-4.3	+20.1
12m	28	16.6 dB	16.9 dB	32.1 dB	+0.4	+15.5
10m	76	17.5 dB	15.5 dB	44.3 dB	-2.0	+26.8

IONIS V22-gamma (207K params, trained on 175M signatures) vs VOACAP (NTIA/ITS deterministic ray-tracing engine, CCIR semi-empirical coefficients). Neither system was tuned for Bouvet Island paths. Bias color: green (<5 dB), amber (5-10 dB), red (>10 dB).

Per-Band Prediction Summary (RBN)

Band	Paths	Obs Mean	Pred Mean	Bias
160m	4	2.07σ	-0.02σ	-2.09σ
80m	49	1.15σ	0.01σ	-1.14σ
40m	58	0.02σ	0.69σ	+0.67σ
30m	57	0.16σ	0.47σ	+0.31σ
20m	56	0.76σ	0.01σ	-0.75σ
17m	64	-0.19σ	0.07σ	+0.27σ
15m	156	0.87σ	0.22σ	-0.66σ
12m	28	0.23σ	0.29σ	+0.06σ
10m	76	0.26σ	-0.04σ	-0.30σ

Bias color: green (<0.2σ), amber (0.2-0.5σ), red (>0.5σ). JD04 has near-zero coverage in training data — this is the model's hardest test.

Analysis Summary

This comparison tests two independent HF propagation prediction systems against real observations from Bouvet Island — a location that neither system was specifically designed or tuned for.

VOACAP is the industry-standard HF prediction engine, developed by NTIA/ITS in the 1980s and trusted by broadcasters, military planners, and radio amateurs for four decades. It is a deterministic ray-tracing framework using semi-empirical ionospheric coefficients (CCIR/URSI) derived from decades of ionosonde measurements to predict monthly-median circuit performance.

IONIS V22-gamma is a neural network (207K parameters) trained on 175 million propagation signatures from WSPR beacons, RBN skimmers, and contest QSOs. It learns propagation patterns directly from observed data rather than from ionospheric models.

Bouvet Island at 54°S presents a genuine blind test for both systems. VOACAP's ionospheric coefficients have sparse coverage at extreme southern latitudes, and IONIS had virtually zero training data from grid JD04.

The results show that IONIS tracks observed SNR within a few dB across most bands (overall bias: −1.9 dB), while VOACAP's predictions diverge significantly at this latitude (overall bias: −11.3 dB), particularly on the low bands where it underestimates propagation and on the high bands where it overestimates.

This does not diminish VOACAP's value — it remains an essential tool and the foundation that the entire HF prediction field is built on. What it suggests is that data-driven approaches may extend prediction accuracy into regions where traditional semi-empirical models have less observational support, such as extreme polar paths. Both approaches are complementary: VOACAP's deterministic ray-tracing framework provides interpretability and reliability across well-characterized paths, while ML models can generalize from large observational datasets to capture highly dynamic, short-term propagation features.

Survivorship bias: These comparisons evaluate prediction accuracy given that a path was observed. Paths that did not propagate are not captured by RBN or PSKR, creating inherent survivorship bias. VOACAP's circuit failure predictions (< −100 dB) may be correct for unobserved paths. Training overlap: 1,426 WSPR spots from JD04 of 10.8 billion (0.00001%) — a near-blind test for IONIS.

About These Predictions

IONIS V22-gamma predictions generated from the DXpedition signature dataset using the production checkpoint. VOACAP predictions generated by voacapl (v16.1207W) with CCIR coefficients, Method 30 (complete system performance), and const17 antenna model. Both systems received identical inputs: path geometry, frequency, and smoothed sunspot number.

All data and code used in this comparison are publicly available. The 3Y0K dataset is on SourceForge.