Breaking New Ground: humanity.ai Sets New Benchmark Records with iCon Modular AI
humanity.ai's modular AI beats leading models on key benchmarks, delivering superior reasoning with 300B parameters, all running on MacBooks, not datacenters. Hardware-efficient, private, and fully scalable.
The journey toward Artificial General Intelligence (AGI) requires not just bigger models, but fundamentally better architectures. At humanity.ai, we're building precisely that with our iCon modular AI framework—a system purpose-built to overcome the structural weaknesses of monolithic LLMs.
Today, we're proud to share the latest benchmark results for our humanity.ai Junior model, powered by iCon’s modular hybrid architecture. Despite running on a combined 300B parameter system, humanity.ai Junior consistently outperformed or matched much larger models, while operating efficiently on commodity hardware.

Why this matters:
- Superior reasoning: On ARC-AGI-2, widely regarded as one of the most challenging AGI reasoning benchmarks, humanity.ai Junior achieved 28.7%, dramatically outperforming the sub-2% scores of frontier models.
- Hardware-efficient: All tests were run on MacBook Pros (M4 Max, 128GB RAM), with distributed processing on only two devices for heavier loads like GPQA-Diamond and MATH-500.
- Modular advantage: iCon’s architecture allows us to assemble small, specialized domain experts optimized for reasoning, verification, and task orchestration. This design eliminates much of the inefficiency, overgeneralization, and hallucination found in massive monolithic models.
- Rapid assembly: humanity.ai Junior was built and tuned for these evaluations in a matter of days. highlighting the agility, scalability, and maintainability that modular systems provide.
Where we’re headed next:
This is just the beginning. Stay tuned for:
- Demo videos showing humanity.ai in action
- Third-party benchmark verification
- Advances in humanity.ai's self-refining & self-evolving capabilities, allowing the system to autonomously build new domain experts when gaps are detected.
Our north star remains clear: to build a scalable, hardware-agnostic AGI platform that delivers enterprise-grade intelligence with full data privacy, enabling organizations and individuals alike to own and control their AI, rather than relying on centralized cloud vendors.
If you're a developer, researcher, or AI enthusiast interested in getting involved with our work, feel free to contact us: [email protected]
—The humanity.ai Team
humanity.ai Newsletter
Join the newsletter to receive the latest updates in your inbox.