Talks

Automated Proof Generation for Rust Code Using LLM

Xuheng Li - Columbia University

IRB 5105 or https://umd.zoom.us/j/91477081269?pwd=Zia5u3gDaI87nbIEz0FstJqb4vUuP0.1

Wednesday, February 12, 2025, 2:00-3:00 pm

You are subscribed to this talk through .
You are watching this talk through .
You are subscribed to this talk. (unsubscribe, watch)
You are watching this talk. (unwatch, subscribe)
You are not subscribed to this talk. (watch, subscribe)

Abstract

Generative AI has shown its values for many software engineering tasks. Still in its infancy, large language model (LLM)-based proof generation lags behind LLM-based code generation. We build an automated verification framework, AutoVerus. AutoVerus uses LLM to automatically generate correctness proof for Rust code. AutoVerus is designed to match the unique features of Verus, a verification tool that can prove the correctness of Rust code using proofs and specifications also written in Rust. AutoVerus consists of a network of LLM agents that are crafted and orchestrated to mimic human experts’ three phases of proof construction: preliminary proof generation, proof refinement guided by generic tips, and proof debugging guided by verification errors. To thoroughly evaluate AutoVerus and help foster future research in this direction, we have built a benchmark suite of 150 non-trivial proof tasks, based on existing code-generation benchmarks and verification benchmarks. Our evaluation shows that AutoVerus can automatically generate correct proof for more than 90% of them, with more than half of them tackled in less than 30 seconds or 3 LLM calls.

Bio

Xuheng Li is a fifth-year Ph.D. student at the Software Systems Laboratory at Columbia University. His research interest spawns over operating systems and formal verification. He dedicated his Ph.D. research to building and verifying reliable systems for confidential computing, including the first verified Arm CCA firmware.

This talk is organized by Samuel Malede Zewdu