This section audits symbolic/analytic mathematical consistency (algebra, derivations, dimensional/unit checks, definition consistency).
Maths relevance: substantial
The paper contains a moderate set of central mathematical constructions: (i) definitions of scattering/ScatCov coefficients and a weighted $L_2$ synthesis loss; (ii) a per-$\ell$-bin Fourier-domain rescaling (‘soft $C_\ell$ match’); (iii) a multi-channel per-$\ell$-bin covariance whitening/recolouring transform (Eq. 12) claimed to enforce all auto- and cross-spectra; and (iv) several analytic interpretations (e.g., $-1/\sqrt{2}$ residual-correlation floor). The core linear-algebra of Eq. (12) is consistent, but several ‘by construction’ and ‘phase-preserving’ claims are overstated or depend on missing convergence/ordering details for the alternating histogram/spectrum projections.
✔ Scattering coefficient definitions (Eq. (1), Sec. 3.1, p.3)
⚠ Weighted ScatCov loss well-posedness (Eq. (2), Sec. 3.2, p.3)
⚠ Spectral-matched initialisation (Eq. (3), Sec. 3.2, p.3)
✔ Joint loss decomposition (Eq. (4), Sec. 3.3, p.3)
✖ Correlation coefficient definition vs 'demeaned Pearson' (Eq. (5) and text, Sec. 3.3, p.3)
✔ Sign penalty definition (Eq. (6), Sec. 3.3, p.3)
✔ Multi-channel operator mapping (Eq. (7), Sec. 3.4, p.3)
✔ Soft $C_\ell$ rescale amplitude factor (Eq. (8), Sec. 3.5, p.4)
✖ Soft $C_\ell$ rescale 'spatial properties preserved' claim (Text after Eq. (8), Sec. 3.5, p.4)
✔ Cosine schedule definition (Eq. (9), Sec. 3.6, p.4)
✔ DDPM reverse sampling step and coefficient discussion (Eq. (10) and §3.6.1, p.4)
✔ DDPM training objective (Eq. (11), Sec. 3.6.2, p.4)
✔ Cross-spectral covariance definition (§4.1.1, p.5)
✔ Joint Cholesky $C_\ell$-match enforces target covariance (Eq. (12), Sec. 4.1.1, p.5)
✖ Eq. (12) described as 'phase-preserving' (Abstract; §4.1.1, p.5; §5.2, p.11)
⚠ Pixel cross-correlation follows from matching spectra (§4.1.1 and §4.2, pp.5–6)
✔ Histogram match implies matching all 1-point statistics (§4.1.1, p.5)
⚠ Alternating histogram match and Cholesky gives simultaneous exact 1- and 2-point matching (§4.1.1, p.5; ‘by construction’ claims in Abstract and §5.6)
✔ Residual-correlation algebraic floor (§4.2.1, p.6)
✔ Minkowski functional definitions and $M_0$ matching argument (§4.4, p.9)
✔ Triple loss function with sign constraint (Eq. (13), Sec. 5.3, p.12)
✔ $N\times N$ generalisation of Eq. (12) (§5.3 and §5.4, pp.12–13)
This section audits numerical/empirical consistency: reported metrics, experimental design, baseline comparisons, statistical evidence, leakage risks, and reproducibility.
Out of 23 candidate numeric checks, 15 passed, 2 failed, and 6 were uncertain (mostly due to missing tabulated inputs for the intended recomputations). The two failures concern (i) a unit/pixel-scale consistency check for patch size vs pixel resolution, and (ii) a rounding/precision claim about matching to four decimal places for a median ratio example. Other recomputed percentages/ratios (held-out recovery percentages, skew/kurtosis ratios, algebraic constant $-1/\sqrt{2}$, and various percent recoveries) were numerically consistent within stated tolerances.
✔ C01_train_aug_count (p.3, end of §2 DATA)
✖ C02_patch_pixels_area_consistency (p.2, §2 DATA)
✔ C03_patch8_percentile_from_1523 (p.3, §2 DATA (Patch 8 description))
✔ C04_bandavg_from_bins_ST_tSZ (p.6, Table 2)
⚠ C05_bandavg_from_bins_ST_CIB (p.6, Table 2)
⚠ C06_bandavg_from_bins_DDPM_tSZ (p.6, Table 2)
⚠ C07_bandavg_from_bins_DDPM_CIB (p.6, Table 2)
✔ C08_skew_kurt_ratio_claims (p.8, §4.3 PDF Statistics)
✔ C09_min_core_recovery_percent (p.8, §4.3 PDF Statistics)
✔ C10_crossr_recovery_heldout_ST (p.7, held-out evaluation paragraph + p.11/p.12 held-out bullets + Table 7)
✔ C11_crossr_recovery_heldout_DDPM (p.12, DDPM held-out protocol bullets + Table 7)
✔ C12_core_depth_recovery_ST (p.11, held-out bullets + Table 7)
✔ C13_core_depth_recovery_DDPM (p.12, DDPM held-out bullets + Table 7)
✔ C14_algebraic_floor_minus_1_over_sqrt2 (p.6-7, §4.2.1 (residual diagnostic) + Fig. 6 text)
✔ C15_sampling_coeff_error_factor (p.4, §3.6.1 Critical sampling coefficient)
✔ C16_corrunet_param_count_check (p.4, §3.6.2 Architecture)
✖ C17_multi_freq_median_ratio_delta (p.13, §5.4 (text))
✔ C18_fig12_residual_dp_claim (p.13-15, §5.4 text + Fig. 12(b) caption)
✔ C19_crossr_ladder_percentages (p.16-18, §6 + Fig. 15)
✔ C20_iteration_sweep_percentages (p.16-17, §6.1 + Fig. 14 / Fig. 15 text)
✔ C21_break_even_samples_calc (p.19, §6.1.0.6)
⚠ C22_bp_error_reduction_factors (p.22, §7.2 (quantitative improvements))
⚠ C23_peak_count_deficits_percent (p.25, Table 17)
| Dimension | Score |
|---|---|
| Overall | 6/10 ██████░░░░ |
| Soundness | 6/10 ██████░░░░ |
| Novelty | 6/10 ██████░░░░ |
| Significance | 6/10 ██████░░░░ |
| Clarity | 5/10 █████░░░░░ |
| Evidence Quality | 6/10 ██████░░░░ |
Justification: The paper’s core linear-algebraic calibration (Eq. 12) is sound and well-supported by extensive experiments, and the generator-agnostic falsification plus the non-by-construction ladder provide useful, original insights. However, the audits flag important presentation and methodological caveats: overstatements about “phase-preserving” and simultaneous 1-pt/2-pt matching, an inconsistent Pearson definition, and missing implementation specifics for the spectral whitening/recolouring that limit reproducibility and rigor. Empirically, results are strong across a broad diagnostic suite, but several claims rely on small-N studies, paired-CDF conditioning risks, and uneven uncertainty reporting. Overall this is a solid, above-average contribution that would benefit from tightening the math claims, clarifying what is enforced vs learned, and standardizing uncertainty/implementation details.