Our Data

Our datasets are configurable across six independent dimensions. Items marked with * are in active development.

Problem Format

VQA: Verifiable question answer problems with a precise numeric or symbolic final answer.

Subproblems*: Problems structured into intermediate steps leading to a final answer.

Formal Proofs*: Problems formalized in Lean for machine checkable proofs.

Difficulty Calibration

Frontier Targeting: Problems calibrated to challenge the most capable models available today.

Configurable pass@N: Target solve rates at specified attempt counts.

Agentic*: Multi step problems designed for workflows that require tool use or multiple API calls.

Problem Style

Computational: Requires code execution or algorithmic search, often with nontrivial runtime.

Analytical: Proof oriented or insight driven problems that reward structural reasoning.

Academic Level

Undergraduate: University level problems and contest style problems.

Graduate: Qualifying exam level problems and advanced coursework level problems.

Post Graduate: Research level problems requiring deep domain expertise.

Training Stage

Mid Training: High quality mathematical content for continued pretraining and fine tuning.

RL Post Training: Reward verifiable problems designed for reinforcement learning pipelines.

Modality

Text Only: Pure symbolic and textual problem statements with numeric or symbolic answers.

Multimodal: Problems involving diagrams, figures, or visual reasoning.