Skip to main content

Introduction

Claude Octopus is a Claude Code plugin that turns one model into three. It orchestrates Codex, Gemini, and Claude with distinct roles, adversarial review, and consensus gates — so no single model’s blind spots slip through.

What problem does it solve?

Other multi-AI tools run providers in parallel and hand you three answers. Claude Octopus assigns each model a distinct role — Codex for implementation depth, Gemini for ecosystem breadth, Claude for synthesis — then enforces a 75% consensus gate before anything ships. Disagreements get caught, not ignored.

Key features

Multi-AI orchestration

Coordinate three AI providers across every workflow:
ProviderRole
CodexImplementation depth — code patterns, technical analysis, architecture
GeminiEcosystem breadth — alternatives, security review, research synthesis
ClaudeOrchestration — quality gates, consensus building, final synthesis
Providers run in parallel for research, sequentially for problem scoping, and adversarially for review. A 75% consensus quality gate prevents questionable work from shipping.

Double Diamond methodology

Every task moves through four structured phases adapted from the UK Design Council’s framework:
PhaseCommandWhat happens
Discover/octo:discoverMulti-AI research and broad exploration
Define/octo:defineRequirements clarification with consensus
Develop/octo:developImplementation with quality gates
Deliver/octo:deliverAdversarial review and go/no-go scoring
Run phases individually or all four with /octo:embrace. Configure autonomy: supervised (approve each phase), semi-autonomous (intervene on failures), or autonomous (run all four).

33 specialized personas

Not generic agents. Specialized personas that activate automatically based on your request:
  • Software Engineering (11): backend-architect, frontend-developer, tdd-orchestrator, debugger, security-auditor, and more
  • Specialized Development (6): python-pro, typescript-pro, database-architect, ai-engineer, cloud-architect, graphql-architect
  • Documentation & Communication (5): docs-architect, product-writer, exec-communicator, academic-writer, mermaid-expert
  • Research & Strategy (4): research-synthesizer, strategy-analyst, ux-researcher, business-analyst
  • Business & Compliance (3): finance-analyst, legal-compliance-advisor, marketing-strategist
  • Creative & Design (4): Specialized visual and content creation
When you say “audit my API for vulnerabilities,” security-auditor activates. When you say “write a research paper,” academic-writer takes over.

46 commands

Eight commands that orchestrate up to three AI providers, apply quality gates, and produce deliverables:
/octo:embrace build stripe integration     # Full lifecycle: research → define → develop → deliver
/octo:factory "build a CLI"                # Autonomous pipeline — spec in, software out
/octo:debate monorepo vs microservices     # Structured three-way AI debate with consensus
/octo:research htmx vs react in 2026       # Multi-source synthesis from three AI providers
/octo:review                                # Code review with security + 4x10 scoring
/octo:tdd create user auth                 # Red-green-refactor with test discipline
/octo:security                              # OWASP vulnerability scan + remediation
/octo:prd mobile checkout redesign          # AI-optimized PRD with 100-point scoring
Including: debug, extract, deck, docs, schedule, parallel, sentinel, brainstorm, claw, doctor, and more.Don’t remember the command name? Just describe what you need:
/octo research microservices patterns    → routes to discover phase
/octo build user authentication          → routes to develop phase
/octo compare Redis vs DynamoDB          → routes to debate
The smart router parses your intent and selects the right workflow.

How it works at a high level

Three brains, one workflow

  1. Parallel research — Multiple AI providers research different angles simultaneously
  2. Consensus building — 75% agreement required before advancing
  3. Quality gates — Each phase validates before the next begins
  4. Adversarial review — Providers challenge each other’s work (“the squeeze”)

Dark Factory mode

Takes a spec and autonomously runs the full pipeline — research, define, develop, deliver — with holdout testing and satisfaction scoring. You review the output, not every step.
/octo:factory "build a CLI that converts CSV to JSON"

Works with just Claude. Scales to three.

Zero external providers needed to start. You get every persona, every workflow, every skill on day one. Add Codex or Gemini and multi-AI orchestration lights up — parallel research, adversarial debate, cross-model review.

Full cost transparency

Visual indicators show exactly which providers are active and whose billing they hit:
IndicatorMeaningCost Source
🐙Multi-AI mode activeMultiple APIs
🔴Codex CLI executingUser’s OPENAI_API_KEY
🟡Gemini CLI executingUser’s GEMINI_API_KEY
🔵Claude subagentIncluded with Claude Code
Smart cost routing picks the cheapest provider that can handle the job. No hidden API calls, no surprise charges.

Who should use it?

Claude Octopus is built for:

Developers

Building features that need research, design, implementation, and validation in one workflow

Teams

Projects requiring multiple perspectives and adversarial review to catch edge cases

Solo builders

Anyone who wants structured quality gates and multi-AI validation without manual coordination

Researchers

Knowledge workers who need deep research synthesis and consensus-driven insights
Start simple: Use Claude Octopus with just Claude built-in. Add external providers (Codex, Gemini) when you need multi-AI features like parallel research and adversarial debate.

What’s next?

Quickstart

Get to your first workflow in under 5 minutes

Installation

Detailed setup with provider configuration

Build docs developers (and LLMs) love