Research - Ulam | DeepAlgebra Project & AI Research

Our research focuses on machine reasoning and artificial general intelligence, with mathematics as our primary test case for abstract thinking capabilities.

AI for Mathematics

We treat mathematics as the next game at which machines would excel, serving as a test case for machine reasoning capabilities.

Key Focus: Building autonomous mathematician working 24/7.

▶ DeepAlgebra - an outline of a program, 2016
▶ Mathematics in the Age of Large Language Models, 2025
▶ Open Mathematical Problems as an AI Reasoning Benchmark, 2026
▶ UlamAI Prover: An Open-Source Lean 4 Theorem Prover and Formalizer, 2026
▶ Lean for Science Formalization, 2026
▶ ErdosBench: A Research-Mathematics Benchmark, 2026, Github repo, Samples1, Samples2

AI-powered Mathematics

Research level mathematics we have done while testing reasoning capabilities of various LLMs. 16 open Erdos problems claimed (and some formally verified in Lean) with many partial results

Key Focus: Proving research level mathematics with AI. Benchmarking frontier models.

▶ Erdos Problem 1148, full solution (GPT-5.4 Pro), March 2026, formalization in Lean with UlamAI Prover.
▶ Erdos Problem 258, full solution (GPT-5.4 Pro), April 2026, formalization in Lean
▶ Erdos Problem 858, full solution (GPT-5.5 Pro), April 2026
▶ Erdos Problem 888, full solution (GPT-5.5 Pro), April 2026
▶ Erdos Problem 514, full solution (GPT-5.5 Pro), April 2026
▶ Erdos Problem 522, full solution (GPT-5.5 Pro), April 2026
▶ Erdos Problem 603, full solution (GPT-5.5 Pro), April 2026
▶ Erdos Problem 610, full solution (GPT-5.5 Pro), April 2026
▶ Erdos Problem 856, full solution (GPT-5.5 Pro), April 2026
▶ Erdos Problem 896, full solution (GPT-5.5 Pro), April 2026
▶ Erdos Problem 953, full solution (GPT-5.5 Pro), April 2026
▶ Erdos Problem 956, full solution (GPT-5.5 Pro), April 2026
▶ Erdos Problem 1092, full solution (GPT-5.5 Pro), April 2026
▶ Erdos Problem 1133, full solution (GPT-5.5 Pro), April 2026
▶ Erdos Problem 1151, full solution (GPT-5.5 Pro), April 2026
▶ Erdos Problem 1090, full solution (GPT-5.5 Pro), April 2026
▶ Erdos Problem 750, full solution (GPT-5.5 Pro), May 2026
▶ Benes Conjecture and Shuffle Exchange Conjecture, counterexamples (GPT-5.5 Pro), Lean formalization, July 2026

Machine Reasoning Theory

Research-level reasoning of large language models.

Key Focus: Mathematical frameworks for human-like reasoning and intelligence.

▶ Research-level Reasoning Trajectories for RLVR, 2026
▶ Olympiad-level Math Trajectories, 2026
▶ RAVE and QeRAVE: Router-Aware Virtual Experts for Efficient Mixture-of-Experts LLM Compression, 2026