A new hierarchical reasoning model inspired by human brain structure
The model developed by researchers at AI company Sapient is called a hierarchical reasoning model or HRM, as reported by Live Science. According to the report, HRM is inspired by the layered approach of the human brain to process information, and breaks from the architecture used in popular leading language models (LLMs) such as ChatGpt, Claude and Deepseek.
Today's AI standards have only 27 million parameters, trained with just 1,000 examples, and despite the need to peer review, the report already makes a big impression.
According to a report by Live Science, traditional LLM relies on “thinking” (COT) (COT) reasoning.
However, according to the Sapient team, COT has restrictions, including “brittle task decomposition, extensive data requirements, and high latency,” as cited in the report.
Also Read: Peter Thiel says nvidia dominates the AI race, not Musk, Altman, or Zuckerberg
ISTOCKAI model inspired by a new brain 2025
How HRM mimics brain function with two special modules
HRM takes a different route. Its design includes two interconnected modules. A high-level module that handles slowly abstract plans, and a low-level module that manages rapid, detailed calculations, as reported by Live Science. This, according to the report, reflects the way humans process information in different regions.
Rather than solving problems in stages, HRM uses a computing method called iterative improvement. Here, according to Live Science, we gradually improve the accuracy of the solution by repeatedly improving the initial approximation with several short bursts of “thinking.” Each burst, according to the report, considers whether it should continue the thought process or submit as a “final” answer to the first prompt.
Also Read: Labor Day Weekend Trips: Top 5 Affordable Destinations as Flight Prices Slow to Five Years' First Descends
Up to creativeHRM surpasses its rivals in the ARC-AGI benchmark test
According to a report from Live Science, researchers used the ARC-AGI benchmark to test HRM. This used the ARC-AGI benchmark.
On the ARC-AGI-1 benchmark, HRM scored 40.3%. This is significantly higher than Openai's O3-Mini-High (34.5%), Anthropic's Claude 3.7 (21.2%), and Deepseek R1 (15.8%), with live science reported.
In even more stringent ARC-AGI-2 tests, HRM still led the pack at 5%, but according to the report it was 3% for OpenAI, 1.3% for Deepseek and 0.9% for Claude.
ReutersSuccess in complex Sudoku and maze tasks
HRM also managed to solve complex Sudoku puzzles. This is something that most large language models still struggle with, and were able to find the best path in the maze navigation task, reported Live Science.

Unexpected discoveries from independent verification
After Sapient shared its model and findings on the Preprint site Arxiv on June 26th and open-sourcing code on Github, the team behind ARC-AGI independently validated the results and found something unexpected.
Although the hierarchical structure of HRM was innovative, it was not the only factor behind its powerful performance. During training there was a sophisticated refinement process that promoted substantial performance improvements, Live Science reported.
FAQ
Why is HRM different from ChatGpt or Claude?
HRM uses a brain-inspired two-module system to repeatedly refine answers, unlike traditional models that use mindset inference to split problems.
What problems can HRM solve over other AI?
HRM is excellent for complex inference tasks such as Sudoku Puzzles and Mazes' optimal path investigation.
