Mistral's Leanstral 1.5 proves math theorems and finds real bugs in open source code

Wortins’ read

Formal verification has stayed a niche discipline because writing proofs by hand is brutally slow, so a model that talks directly to the Lean compiler and iterates until a proof checks out could pull rigorous verification into mainstream software engineering. The fact that it found a real bug while just doing math is the detail worth sitting with, since it hints these systems could eventually audit critical code the way they now audit essays. Open weights also mean nobody has to trust the benchmark claims blindly, they can run it themselves.

Read the full story at GIGAZINE→

Source: GIGAZINE

Mistral's Leanstral 1.5 proves math theorems and finds real bugs in open source code

Related stories

AMA2 gives AI agents their own messenger instead of bolting them onto Slack

OpenKnowledge brings Claude Code and Codex straight into a local markdown wiki

AI Layoff Wave Reverses as Ford, IBM and Commonwealth Bank Rehire Staff

Teaching AI to run with the turbines

LinqAlpha raises 22 million dollars to build an AI intelligence layer for public markets

Generative AI and physics team up to design new antibiotics from scratch