-
Dataset provenance and ground truth are not sufficiently specified. The dataset is described mainly as 10 time slices of $\rho$ and $\mathbf{v}$ on a periodic $128^3$ grid (Sec. 2.1, Sec. 3.1), but it is unclear whether the data are simulated (and if so with what governing equations, solver, parameters, discretization, time step, and boundary conditions), experimental, or synthetic. Without this, it is difficult to interpret residual magnitudes, assess whether the inferred “Euler-like” model matches the true generator, and evaluate the significance/novelty of the identification claims (Sec. 3.5, Sec. 4).
Recommendation: Expand Sec. 2.1 / Sec. 3.1 to specify: (i) data origin (simulation/experiment/synthetic), underlying PDE(s) if known (Euler/Navier–Stokes/etc.), boundary conditions, numerical method, resolution, and actual $\Delta t$; (ii) nondimensionalization/units and characteristic scales (e.g., reference $\rho$, velocity, length; Reynolds/Mach numbers if applicable); (iii) any preprocessing (filtering, normalization). If ground truth is known, state it explicitly and add a targeted comparison in Sec. 3.5 (what matches, what does not). If details are proprietary, provide as much generic information as possible and consider releasing a subset or a synthetic surrogate dataset for reproducibility.
-
Positioning and novelty are currently overstated relative to existing PDE discovery/validation approaches. Although the title/Abstract emphasize “equation discovery,” the presented workflow is primarily residual-based validation of a small set of hand-chosen candidate forms plus a correlation-based missing-term heuristic (Sec. 2.3–2.4, Sec. 3.2–3.4). This is conceptually close to established residual checking, PDE-FIND/SINDy-style libraries with derivative estimation, and weak-form/integral approaches designed for noisy derivatives, but the paper does not clearly articulate what is new (Introduction, Sec. 1).
Recommendation: In the Introduction (Sec. 1) add a focused related-work and positioning subsection that distinguishes your contribution from (i) PDE-FIND/SINDy (including weak-form variants), (ii) integral/constraint-based formulations, and (iii) PINN/adjoint identification. Explicitly state whether the novelty is (a) the sparse-time/high-space regime emphasis, (b) a practical spectral-derivative residual pipeline for 3D fields, (c) a diagnostic workflow for hypothesis testing rather than symbolic search, or (d) something else. If feasible, add a compact baseline comparison (Appendix or Sec. 3): run at least one established method under the same “10-slice” constraint and compare residuals/identified terms. If not feasible, narrow claims to “validation/refinement” and outline how the approach could plug into a broader discovery pipeline (Sec. 4).
-
Temporal derivative estimation and time-step scaling are insufficiently analyzed. The method relies on first-order forward differences over only nine intervals (Sec. 2.2), and the unstructured report notes an ambiguity around taking $\Delta t = 1$ “for relative comparison.” Temporal truncation error and unknown/implicit scaling can significantly affect both residual magnitudes (MAE/RMSE) and the apparent need for missing terms (Sec. 3.2–3.4).
Recommendation: Clarify in Sec. 2.2 whether $\Delta t$ is physical/simulation time or an index step; if known, report it and propagate units consistently. Add a sensitivity analysis: (i) compare forward vs central differences for interior slices (and possibly a 2nd-order scheme), (ii) if higher-frequency data exist, subsample at multiple $\Delta t$ to quantify how MAE/RMSE and correlations change with temporal sparsity, and/or (iii) include a controlled synthetic test (Appendix) where ground-truth $\partial_t$ is known. Use this to bound how much of the momentum residual could plausibly arise from temporal differencing error versus missing physics (discuss in Sec. 3.5 and Sec. 4).
-
Momentum-equation form being tested may be mismatched to compressible dynamics, weakening interpretation of residuals and “missing term” attribution. The paper tests a nonconservative, pressureless velocity form $\partial_t\mathbf{v} \approx -(\mathbf{v}\cdot\nabla)\mathbf{v}$ (Sec. 3.3), while compressible momentum balance is naturally expressed in conservative form $\partial_t(\rho\mathbf{v}) + \nabla\cdot(\rho\mathbf{v}\otimes\mathbf{v}) + \nabla P = \cdots$. Testing $\partial_t\mathbf{v}$ rather than $\partial_t(\rho\mathbf{v})$ can entangle density-variation effects and may alter residual structure and the inferred “pressure term.”
Recommendation: Justify in Sec. 2.3 / Sec. 3.3 why the chosen nonconservative velocity form is the right diagnostic for this dataset (e.g., quantify ‘weak compressibility’ beyond density range, or show density fluctuations are negligible in the momentum budget). Strongly consider adding a parallel validation in Sec. 3.3 of the conservative momentum equation using available $\rho$ and $\mathbf{v}$: compute $\Delta(\rho\mathbf{v})/\Delta t$ and $-\nabla\cdot(\rho\mathbf{v}\otimes\mathbf{v})$, then analyze residuals and missing-term correlations/regressions for $-\nabla P$ (and viscous stress divergence if applicable). This would substantially strengthen the physical interpretability of Sec. 3.4–3.5.
-
Pressure/closure identification is not yet supported at the level claimed. The conclusion that the system is “accurately governed by compressible Euler” with a pressure gradient inferred from an isothermal proxy rests mainly on moderate Pearson correlations ($r\approx 0.60$–0.64) with $F_P \propto -(1/\rho)\nabla\rho$ (Sec. 2.4, Sec. 3.4.1–3.4.2), without estimating coefficients, testing competing EOS forms, addressing collinearity among candidate terms, or demonstrating out-of-sample predictive skill (Sec. 3.5, Sec. 4). Correlation is suggestive but not identification.
Recommendation: Tone down definitive language in the Abstract/Sec. 3.5/Sec. 4 to “consistent with” unless stronger evidence is added. Strengthen Sec. 3.4 by moving from correlation-only to coefficient estimation: (i) fit $\mathbf{R}_V \approx c\,F_P$ (or the conservative-form analogue) and report residual reduction / explained variance; (ii) test EOS variants $P \propto \rho^\gamma$ (e.g., $\gamma\in\{1,1.4\}$) and/or fit $\gamma$ over a small grid; (iii) report statistics across all nine time intervals (mean $\pm$ std) rather than a single representative interval (Table 2 / Sec. 3.4). If feasible, add a short forward-prediction or held-out-interval check to demonstrate that the inferred term(s) improve predictive agreement beyond the analyzed interval.
-
Viscosity and dissipative effects are not tested with a physically appropriate compressible operator, so concluding “viscous effects are minor” is premature. The manuscript correlates residuals with $\nabla^2\mathbf{v}$ (Sec. 2.4, Sec. 3.4.2), but the compressible Navier–Stokes viscous term is the divergence of a stress tensor and typically includes both $\nabla^2\mathbf{v}$ and $\nabla(\nabla\cdot\mathbf{v})$ contributions (and possibly density/temperature-dependent viscosity). Weak correlation with $\nabla^2\mathbf{v}$ alone does not rule out viscous/bulk-viscous effects.
Recommendation: In Sec. 2.4 / Sec. 3.4.2, expand the dissipative candidate library to include at least $\nabla(\nabla\cdot\mathbf{v})$ and (if feasible) a combined operator mimicking constant-viscosity compressible stress divergence (up to unknown coefficients). Then perform regression (preferred) or correlation on this expanded set and report whether dissipative terms materially reduce residual variance. If the data were generated by an inviscid solver, state that explicitly in Sec. 2.1 and reframe the viscosity test accordingly.
-
Spectral-derivative implementation details (and aliasing control) are underspecified, potentially impacting nonlinear terms and residual structure. For pseudo-spectral evaluation of nonlinear products like $\nabla\cdot(\rho\mathbf{v})$ and $(\mathbf{v}\cdot\nabla)\mathbf{v}$, dealiasing/filtering choices (e.g., 2/3-rule) and treatment of Nyquist modes can significantly affect computed derivatives and thus residuals/correlations (Sec. 2.2–2.4).
Recommendation: Augment Sec. 2.2–2.4 with concrete implementation details: FFT conventions/normalization, how derivatives are taken in Fourier space, how nonlinear products are formed (physical vs spectral), whether dealiasing or spectral filtering is used (and parameters), and how Nyquist modes are handled. If no dealiasing is used, add a short sensitivity check (Appendix acceptable) comparing residuals/correlations with and without standard dealiasing/filtering to demonstrate robustness.
-
Evidence base is narrow (single periodic fully observed dataset; limited robustness tests), which limits the generality of conclusions and the method’s practical scope (Sec. 3; Sec. 4). The framework’s behavior under noise, coarser spatial resolution, non-periodic boundaries, partial observability, and different dynamics is not evaluated.
Recommendation: Add at least one additional controlled experiment (Appendix acceptable) where ground truth is known and you can vary temporal sparsity/noise (e.g., 1D Burgers or 2D/3D synthetic flow) to show how residual-based validation and missing-term inference degrade. If additional experiments are infeasible, expand Sec. 4 with a sharper limitations section that explicitly enumerates assumptions (periodicity/FFT, full-field access, sparse candidate library) and outlines concrete paths to extension (finite-volume derivatives for nonperiodic domains, weak-form constraints for noisy data, handling sparse sensors).