This section audits symbolic/analytic mathematical consistency (algebra, derivations, dimensional/unit checks, definition consistency).
Maths relevance: substantial
The paper’s analytic content centers on (i) defining finite-difference approximations for spatial/temporal derivatives on a periodic 3D grid, (ii) defining vector-calculus operators and nonlinear convective terms used to build a candidate feature library, and (iii) relating sparse-regression-selected terms to the continuity equation and a Navier–Stokes-type momentum equation. Most equations are standard identities/definitions and are internally consistent; the main internal tension is the simultaneous use of “incompressible Navier–Stokes” language with a density-gradient proxy for pressure, which requires extra assumptions not fully formalized.
✔ Grid spacing from box length (Sec. 2.1, p. 3)
✔ First spatial derivative central difference (Sec. 2.2, p. 3)
✔ Second spatial derivative central difference (Sec. 2.2, p. 4)
✔ Laplacian operator assembly (Sec. 2.2, p. 4)
✔ Velocity divergence definition (Sec. 2.2, p. 4)
✔ Curl component definitions (Sec. 2.2, p. 4)
✔ Temporal derivative central difference (Sec. 2.2, p. 4)
✔ Valid temporal indices for central differencing (Sec. 2.1 and 2.2, pp. 3–4)
✔ Convective term definitions (Sec. 2.3, p. 5)
✔ Candidate feature library size (43 terms) (Sec. 2.3, pp. 4–5)
✔ Continuity equation product-rule expansion (Sec. 3.3.1, p. 10)
✔ Near-uniform density approximation in continuity term (Sec. 3.3.1, p. 10)
✔ Navier–Stokes momentum equation form (Sec. 3.3.2, p. 11)
⚠ Pressure-gradient surrogate via density gradient (Sec. 3.3.2, p. 11)
✔ Sign of viscous diffusion term in momentum (Sec. 3.3.2, pp. 11–12)
✔ Effect of standardization on coefficient sign and interpretability (Sec. 2.1, p. 3 and Sec. 3.3, pp. 10–12)
This section audits numerical/empirical consistency: reported metrics, experimental design, baseline comparisons, statistical evidence, leakage risks, and reproducibility.
All executed internal-consistency recomputations passed: dataset shape and grid spacing are arithmetically consistent; central-difference time indexing yields the stated $8$ valid slices; flattened sample counts comfortably exceed the $200,!000$ subsample; the $80/20$ split yields exact integer counts; enumerated feature-library term counts sum to $43$; $5$-fold CV on the training set is numerically feasible; abstract $R^2$ range is consistent with component $R^2$s under a $0.005$ bound-rounding tolerance; and reported train/test metrics examples show small gaps.
✔ C1_dataset_shape_consistency (Methods §2.1 (page 3))
✔ C2_grid_spacing_from_L_over_N (Methods §2.1 (page 3))
✔ C3_valid_time_slices_count_for_central_difference (Methods §2.1 and §2.2 (page 3-4))
✔ C4_flattened_sample_count_before_subsample (Methods §2.1 (page 3))
✔ C5_train_test_split_counts_from_200k (Methods §2.1 (page 3))
✔ C6_feature_library_count_from_enumerated_terms (Methods §2.3 (pages 4-5))
✔ C7_cross_validation_folds_count (Methods §2.4 (page 5))
✔ C8_r2_range_vs_component_values (Abstract (page 1) vs Results §3.3.2 (page 11))
✔ C9_density_r2_train_test_close (Results §3.3.1 (page 11))
✔ C10_vy_mse_train_test_close (Results §3.4 (page 12))
✔ C11_density_std_relative_to_mean (Results §3.1 (pages 6-7))
✔ C12_velocity_std_range_width (Results §3.1 (page 7))
✔ C13_continuity_equation_expansion_signs (Results §3.3.1 (page 10))
| Dimension | Score |
|---|---|
| Overall | 5/10 █████░░░░░ |
| Soundness | 5/10 █████░░░░░ |
| Novelty | 3/10 ███░░░░░░░ |
| Significance | 4/10 ████░░░░░░ |
| Clarity | 6/10 ██████░░░░ |
| Evidence Quality | 4/10 ████░░░░░░ |
Justification: The pipeline and mathematical formulations are largely correct and coherent, and the numerical results show moderate test R-squared for velocity components; the maths audit is mostly PASS with only one UNCERTAIN point regarding the pressure–density surrogate. However, key methodological gaps remain: the data-generating PDEs/solver are unspecified, coefficients are only reported in standardized units, validation is limited to pointwise derivative prediction without forward rollouts, and potential train/test leakage is present. Together with weak density fits, many retained terms (limited sparsity), and no robustness analyses to derivative estimation or library design, the evidence supports only a tentative, qualitative recovery of Navier–Stokes-like structure.