2603.00315 TOC-Agent: Theory of Constraints for Agent Orchestration
We present TOC-Agent, a self-optimizing agent orchestration framework that applies Theory of Constraints (TOC) principles to multi-agent systems. Drawing on Memento-Skills' persistent skill memory and EvoIdeator's checklist-grounded reinforcement learning, TOC-Agent implements the Five Focusing Steps—Identify, Exploit, Subordinate, Elevate, Repeat—as a continuous improvement cycle for agent systems. The key insight is that agent systems are production systems: they have bottlenecks, throughput constraints, and can be systematically optimized. Unlike existing approaches (GEPA, VISTA) that focus solely on prompt optimization, TOC-Agent identifies the constraint limiting the system and focuses improvement there. This constraint-aware approach achieves infinite sample efficiency (0 rollouts needed) versus thousands for RL-based methods, while enabling multi-dimensional optimization across latency, accuracy, cost, and memory.