Essay Project · AI, Mathematics, and Empirical Science

Epistemological asymmetry between empirical science and mathematics

On the tension between graded empirical feedback, exact deductive validity, and the place of AI systems between them.

Mathematics and empirical science do not reward discovery in the same way. In empirical science, models can improve through dense graded feedback and approximate success. In mathematics, validity is sparse and exact: a proof is either correct or it is not. This asymmetry helps explain why AI systems often advance rapidly in scientific modeling while facing a qualitatively different barrier in formal mathematical reasoning.

Himanshu Singh

Core thesis

In science, discovery can precede proof. In mathematics, proof is discovery.

Empirical science admits partial success, approximation, and graded evaluation. Mathematics demands exact validity under formal deduction. AI systems therefore enter the two domains under different epistemic conditions: in one, useful progress can accumulate before complete justification; in the other, the bottleneck is the discovery of a valid proof itself. The resulting asymmetry is not just practical. It is structural.

AI systems are trained empirically, but evaluated mathematically.
Conceptual contrast

Empirical discovery offers dense feedback; formal proof search offers sparse exact validity.

The figure below contrasts two knowledge regimes. On the left, empirical tasks admit many locally useful regions and continuous performance gradients. On the right, valid proofs become combinatorially sparse as reasoning depth increases, even when verification itself remains efficient.

Conceptual contrast between empirical discovery and formal proof search

Conceptual contrast. Empirical science can reward approximate progress through graded evaluation, whereas formal mathematics requires exact logical validity. This difference changes the search landscape confronted by AI systems.

Verification–search asymmetry

Verification can be efficient even when search is combinatorial.

Verification is cheap

Once a candidate proof is proposed, checking correctness can be comparatively efficient. This makes formal systems attractive as verifiable endpoints, but it does not solve the harder problem: how to discover the proof in the first place.

Search remains the bottleneck

The space of candidate symbolic sequences grows rapidly, while the fraction that correspond to valid proofs collapses. AI systems can therefore face a search problem whose difficulty is qualitatively different from ordinary empirical optimization.

Verification-search asymmetry in formal proof systems

Verification–search asymmetry. The cost of checking a proof can remain small even as the discovery process becomes combinatorial. This explains why verifier-guided systems may help, but do not erase the underlying sparsity of exact validity.

Implications for AI

AI systems operate between optimization and proof.

Modern AI systems are trained through empirical optimization, yet are increasingly applied to domains that demand formal correctness. This creates a structural mismatch between how models learn and how validity is defined.

Optimization vs validity

Neural models improve through gradient-based learning and dense feedback signals. Formal mathematics, however, requires discrete logical correctness, not approximate improvement.

Verification is not discovery

The availability of efficient verifiers does not remove the combinatorial nature of proof discovery. AI systems may generate candidates rapidly, but valid proofs remain sparse.

Why science progresses faster

In empirical domains, approximate correctness is useful. Models can improve incrementally, making scientific discovery more accessible to data-driven systems.

The role of hybrid systems

The future likely lies in combining empirical generation with formal verification, bridging statistical learning and symbolic reasoning.

Outlook

This asymmetry is not a limitation. It is a design constraint.

The distinction between empirical and formal knowledge systems suggests that no single learning paradigm will dominate both domains. Instead, progress will come from systems that respect this asymmetry: combining statistical learning for exploration with symbolic methods for verification and structure. Understanding this boundary is not merely philosophical. It is central to the design of future AI systems.

Understanding this asymmetry is not philosophical—it is architectural.

Paper

Technical note and essay materials

This project page is a visual companion to the underlying note. The full PDF carries the broader argument, supporting discussion, and additional framing around AI, scientific discovery, and mathematical reasoning.