Our Process

We systematically study how leading models behave at the jagged frontier of intelligence. What kinds of reasoning steps break them? Where does pattern-matching stop working? That frontier shifts as models improve, and we track it.

This informs everything we build. Our problems are designed to require actual thought: the kind of work where getting the right answer is strong evidence you understood the problem.