Formal Guarantees for Frontier AI

Gagandeep Singh · UIUC

May 2026

Formal verification is often dismissed as too rigid, complex, or unscalable for frontier AI systems (LLMs, VLMs, agentic systems), pushing many researchers and developers toward less rigorous alternatives like benchmarking, red teaming, and adversarial attacks. This talk presents a new class of efficient formal verification methods that certify safety properties such as secure code generation and catastrophic conversational risks, delivering stronger generalization guarantees than standard evaluation approaches and positioning formal verification as a necessary foundation for reliable systems at scale.

Readings

BEAVER (arXiv:2512.05439)
Certifying Risk in Conversation (arXiv:2510.03969)
Lumos (arXiv:2512.02966)