prompts-gpt orchestrate

Run multi-agent orchestration across providers with four modes: auto planning, parallel racing, pipeline chaining, and eval scoring.

Usage

prompts-gpt orchestrate --mode <auto|parallel|pipeline|eval> [-f <path>] [options]

Modes

Auto Mode

Describe a goal in plain English — the system plans the pipeline, generates prompts, executes end-to-end, and evaluates the result.

prompts-gpt orchestrate --mode auto --goal "Add user authentication with JWT, write tests, and update docs"

Use cases:

Quick end-to-end task execution from a single sentence
Prototyping workflows before building explicit pipelines
Hands-free execution for well-scoped goals

Key flags:

--goal <text> — Plain English description of what to accomplish
--max-steps <n> — Maximum pipeline steps to generate (default: 5, max: 10)
--quality-gate <n> — Minimum score (1-100) to consider the run successful (default: 70)

Parallel Mode

Race the same prompt across multiple providers simultaneously and compare results.

prompts-gpt orchestrate --mode parallel -f prompt.md --providers codex,claude,cursor

Use cases:

Compare how different providers solve the same problem
Pick the best result from multiple agents
Benchmark provider quality and speed

Key flags:

--providers <list> — Comma-separated providers to race
If omitted, auto-selects the top 2 available providers

Pipeline Mode

Chain multiple providers sequentially — output from step 1 feeds into step 2.

prompts-gpt orchestrate --mode pipeline -f prompt.md --steps pipeline.json

Use cases:

Research → Implement → Review workflows
Architecture → Code → Test pipelines
Multi-stage transformations

Steps file format (pipeline.json):

{
  "steps": [
    {
      "name": "research",
      "promptFile": ".prompts-gpt/orchestrations/pipeline-steps/research.md",
      "agent": "claude",
      "model": "claude-sonnet-4-20250514"
    },
    {
      "name": "implement",
      "promptFile": ".prompts-gpt/orchestrations/pipeline-steps/implement.md",
      "agent": "codex",
      "model": "gpt-5.5"
    },
    {
      "name": "review",
      "promptFile": ".prompts-gpt/orchestrations/pipeline-steps/review.md",
      "agent": "cursor"
    }
  ]
}

Eval Mode

Execute a prompt and automatically evaluate/score the output using configurable criteria.

prompts-gpt orchestrate --mode eval -f prompt.md --criteria correctness,completeness,quality

Use cases:

Automated quality assessment
Scoring agent outputs against rubrics
Benchmark and compare model performance

Key flags:

--criteria <list> — Comma-separated scoring dimensions
--evaluator <provider> — Provider for evaluation (can differ from executor)

Options

Common Options

Flag	Description
`--mode <mode>`	Required. `auto`, `parallel`, `pipeline`, or `eval`.
`-f, --prompt-file <path>`	Prompt file to orchestrate.
`--agent <name>`	Default provider profile.
`--model <name>`	Model override.
`--timeout <seconds>`	Execution timeout.
`--dry-run`	Preview without executing.
`--json`	Machine-readable output.
`--cwd <path>`	Project directory.

Auto-Specific

Flag	Description
`--goal <text>`	Plain English goal description.
`--max-steps <n>`	Maximum steps to generate (default: 5).
`--quality-gate <n>`	Minimum eval score (default: 70).
`--interactive`	Confirm generated plan before executing.

Parallel-Specific

Flag	Description
`--providers <list>`	Comma-separated providers to race.

Pipeline-Specific

Flag	Description
`--steps <path>`	JSON file defining pipeline steps.

Eval-Specific

Flag	Description
`--criteria <list>`	Scoring criteria (comma-separated).
`--eval-criteria <list>`	Alias for `--criteria`.
`--evaluator <provider>`	Provider for evaluation.

Examples

Auto: End-to-end from a goal

# Let the system plan and execute everything
prompts-gpt orchestrate --mode auto \
  --goal "Review the codebase for security issues, fix critical ones, and add tests"

# With quality gate and step limit
prompts-gpt orchestrate --mode auto \
  --goal "Refactor the auth module to use JWT" \
  --max-steps 4 \
  --quality-gate 80

# Interactive auto mode — confirm the plan before executing
prompts-gpt orchestrate --mode auto \
  --goal "Migrate from REST to GraphQL" \
  --interactive

# Dry run to preview the generated plan
prompts-gpt orchestrate --mode auto \
  --goal "Add dark mode support" \
  --dry-run

Parallel: Compare Codex vs Claude

prompts-gpt orchestrate --mode parallel \
  -f .prompts-gpt/refactor.md \
  --providers codex,claude

Pipeline: Research → Implement → Test

# First, scaffold the pipeline
prompts-gpt generate-orchestration --title "Feature Pipeline" --mode pipeline

# Then run it
prompts-gpt orchestrate --mode pipeline \
  --steps .prompts-gpt/orchestrations/feature-pipeline.json

Eval: Score quality

prompts-gpt orchestrate --mode eval \
  -f .prompts-gpt/code-review.md \
  --criteria correctness,completeness,security \
  --agent codex

Results

Auto Result

{
  mode: "auto";
  goal: string;
  generatedSteps: Array<{ name: string; prompt: string; provider: string; model: string }>;
  pipelineResult: PipelineResult | null;
  evaluation: EvalScore | null;
  totalDurationMs: number;
}

Parallel Result

{
  mode: "parallel";
  results: Record<string, RunPromptResult>;
  winner?: string;
  runnerUp?: string;
  totalDurationMs: number;
}

Pipeline Result

{
  mode: "pipeline";
  steps: Array<{
    name: string;
    result: RunPromptResult;
  }>;
  totalDurationMs: number;
}

Eval Result

{
  mode: "eval";
  runResult: RunPromptResult;
  scores: EvalScore[];
  overallScore: number;
  totalDurationMs: number;
}

Generate Orchestration Files

Use generate-orchestration to scaffold files:

# Interactive wizard
prompts-gpt generate-orchestration

# Non-interactive
prompts-gpt generate-orchestration \
  --title "Auth Feature" \
  --goal "Add user authentication" \
  --mode pipeline

prompts-gpt orchestrate

Usage

Modes

Auto Mode

Parallel Mode

Pipeline Mode

Eval Mode

Options

Common Options

Auto-Specific

Parallel-Specific

Pipeline-Specific

Eval-Specific

Examples

Auto: End-to-end from a goal

Parallel: Compare Codex vs Claude

Pipeline: Research → Implement → Test

Eval: Score quality

Results

Auto Result

Parallel Result

Pipeline Result

Eval Result

Generate Orchestration Files

See Also