Agents & Skills

276 built across 36 systems — 75 Claude Code agents, 163 skills, and 38 in-app agentic systems (LangGraph / LangChain / custom). 174 of them run on both Claude Code and Codex.

75agents 163skills 38in-app systems 20dual-host systems
build-loop 28 agents · 51 skills Claude CodeCodex
Agent
advisor

Frontier (Fable) role that authors and re-plans Phase 2 plan synthesis — decomposes work, builds dependency graph, MECE-partitions ownership, issues corrected instructions on planning misses.

Agent
alignment-checker

Sonnet advisory judge scoring autonomous-iterate-loop queue items (aligned/misaligned/uncertain) against intent.md/goal.md/constitution; never blocks, only routes.

Agent
api-assessor

Debugging domain assessor for API/REST/GraphQL errors, auth, rate limiting, and server-side route handlers.

Agent
build-loop:architecture-scout

Read-only architecture analyst dispatched by the orchestrator for baseline/chunk-impact/review-rules/iterate-subgraph/learn-sync tasks; decides native engine vs NavGator escalation.

Agent
assessment-orchestrator

Opus-tier coordinator that fans out parallel database/frontend/API/performance assessors for multi-domain debugging.

Agent
build-orchestrator

Top-level Opus orchestrator for the 5-phase build loop (Assess→Plan→Execute→Review→Iterate, optional Learn) with a 7-step Review sub-flow.

Agent
database-assessor

Debugging domain assessor for query/migration/schema/Prisma/Postgres/connection-pool/vector-index issues.

Agent
design-contract-specialist

Sole writer to .build-loop/app-contract/{ui.md,data.md,traceability.json}; chooses UI design direction and reconciles design/data deltas after implementation.

Agent
fact-checker

Fable-tier validator tracing all rendered data/claims/metrics to their sources before completion.

Agent
fix-critique

Fable-tier adversarial pressure-test of a proposed fix — root cause vs symptom, regression risk, evidence check.

Agent
frontend-assessor

Debugging domain assessor for React/hooks/rendering/hydration/client-state issues.

Agent
implementer

Applies a single ux-fix-plan.md or targeted fix plan from the Phase 5 work list; designed for parallel fan-out (≤4 in flight).

Agent
independent-auditor

Fable-tier LLM escalation for the deterministic commit-audit hook; renders a four-option verdict on large/boundary-crossing diffs.

Agent
mock-scanner

Haiku-tier fast scan for residual mock/placeholder/fake/private/secret data in production code paths.

Agent
optimize-runner

Executes the optimization loop — generates hypotheses, makes atomic changes, measures metrics, keeps or reverts, until convergence/budget exhaustion.

Agent
overfitting-reviewer

Fable-tier read-only adversarial review of optimization results for overfitting/Goodharting/test-gaming.

Agent
performance-assessor

Debugging domain assessor for slowness/latency/memory-leak/CPU/bottleneck symptoms.

Agent
plan-critic

Fable-tier adversarial critique of a Phase 2 plan for non-deterministic issues (alternatives, MECE quality, marker adequacy) after plan-verify runs.

Agent
promotion-reviewer

Fable-tier advisory judge for Phase 6 experimental-artifact promotion (approve/rethink/new_approach); never blocks, feeds user-confirmation gate.

Agent
recurring-pattern-detector

Haiku-tier pattern-matcher scanning state.json run logs for recurring failures/patterns across 3+ runs; no judgment, just detection.

Agent
retrospective-synthesizer

Post-push background synthesizer writing a 9-section retrospective + summary from the session transcript/state.json/intent/plan.

Agent
root-cause-investigator

Deep causal-tree investigator for debugging symptoms beyond surface diagnosis; flags when external/environmental research is needed.

Agent
scope-auditor

Fable-tier read-only Plan→Execute boundary check tracing callers of any changed public signature outside a commit's owned-files.

Agent
security-reviewer

Fable-tier adversarial security review of implementer output against OWASP LLM/Agentic/Web Top 10 and MITRE ATLAS, gated on riskSurfaceChange.

Agent
self-improvement-architect

Drafts a concrete experimental SKILL.md or agent definition from a recurring-pattern-detector proposal, written to .build-loop/skills/experimental/.

Agent
synthesis-critic

Sonnet-tier read-only critic for subjective synthesis dimensions (copy_tone, empty_state) on UI-touching commits; WARN-only, never blocks.

Agent
transcript-pattern-miner

Pure stdlib regex miner of local Claude Code transcripts for corrections/tool sequences/file patterns/command rituals/secret exposures; no LLM calls.

Agent
ui-validator

Deterministic UI scanner against the running dev server (layout collisions, touch targets, console errors, hydration, visual SSIM) with its own authed browser session.

Skill
agent-rally-point

Coordinate build-loop with peer coding agents via Rally Point (presence/inbox/handoffs).

Skill
agent-rally-watcher

Listen for and wire Rally Point change watchers.

Skill
api-registry-bridge

Consult the api-registry plugin when a new API dependency is detected; degrades gracefully if absent.

Skill
build-loop:architecture-dead

Scan for dead/orphaned code; native copy of NavGator's dead-code scan.

Skill
build-loop:architecture-impact

Blast-radius analysis tracing upstream/downstream dependencies before a refactor.

Skill
build-loop:architecture-review

Heavyweight architectural integrity review for builds crossing 2+ layers; native copy from NavGator.

Skill
build-loop:architecture-rules

Detects orphans, layer violations, circular deps, and hotspots; native rules engine from NavGator.

Skill
build-loop:architecture-scan

Refreshes native component/connection data in .build-loop/architecture/.

Skill
build-loop:architecture-trace

Follows a component's connections end-to-end; native trace copied from NavGator's CLI.

Skill
attribution-standard

Apply the four-layer Apache-2.0 attribution model (NOTICE, SPDX, REUSE.toml, canary markers) to a repo.

Skill
authentication

Wire/debug auth (Better Auth, Supabase Auth, Google OAuth, Resend) for a new or existing project.

Skill
auto-decision-capture

Signal taxonomy and extraction prompts for proactive in-session decision capture into build-loop-memory.

Skill
auto-finding-capture

Default-on auto-capture of severity-labeled findings/issues into the backlog regardless of source.

Skill
build-loop

Main orchestrated build-loop entrypoint for multi-step code work; verb/symptom trigger language.

Skill
building-with-deepagents

Build/refactor agents importing OSS deepagents (create_deep_agent) — SubAgent API, middleware, tool scoping.

Skill
build-loop:capabilities

Populates state.json.activeCapabilities with ≤8 relevant entries per phase via plugin-surface collapse.

Skill
debug-loop

Deep iterative debugging loop for when a fix didn't hold or root-cause analysis is needed.

Skill
debugging-memory

Memory-first debugging workflow; owns verdict-handling and delegates lookup to debugging-memory-search.

Skill
build-loop:debugging-assess

Parallel domain assessment fan-out (db/frontend/api/perf) for complex debugging symptoms.

Skill
build-loop:debugging-memory-search

Search primitive for local build-loop incidents plus optional standalone Coding Debugger memory.

Skill
build-loop:debugging-store

Stores a debugging incident record after a bug fix; native local storage with optional Coding Debugger mirror.

Skill
build-loop:defenseclaw-bridge

Bridges the orchestrator to the defenseclaw project's eval/test conventions when detected; otherwise no-op.

Skill
handoff

Compose a durable build-loop handoff document and optionally launch a fresh session with it injected.

Skill
build-loop:ibr-bridge

Routes UI visual verification to the IBR plugin when installed, else falls back to native-ax-driver/ui-validator.

Skill
knowledge-review

Read-only repo-local episodic memory review surface (decision rot, stale procedures, open conflicts).

Skill
knowledge

Canonical build-loop-memory framework for recording decisions/ADRs, regenerating indexes, and recall.

Skill
build-loop:logging-tracer-bridge

Optional escalation to standalone Coding Debugger for extended observability/tracer backends.

Skill
logging-tracer

Generates stack-appropriate logging/tracing with optional OpenTelemetry for silent failures.

Skill
loop-builder

Generates declarative focused-loop specs, presets, validators, and skill-chain plans.

Skill
mcp-builder

Build/package/debug an MCP server or add MCP tools to a plugin.

Skill
model-bakeoff

Runs a controlled multi-model bake-off across isolated git worktrees with deterministic re-scoring and best-of grafting.

Skill
model-tiering

Two-axis taxonomy (work-role segment × 7-rung capability ladder) for choosing a subagent's model tier.

Skill
build-loop:native-ax-driver

Drives running macOS apps via Accessibility API without touching the hardware cursor; self-contained Swift binary.

Skill
optimize

Design-of-Experiments test matrix (up to 6 variables) to find which variable moves a target metric.

Skill
plan-verify

Runs deterministic plan-verify rules on a Phase 2 plan and emits findings JSON.

Skill
plugin-builder

Create/scaffold/convert/migrate a Claude Code plugin — plugin.json, layout, hooks, MCP, agents, LSP.

Skill
plugin-tests

Static-analysis test harness for Claude Code plugins (namesake collisions, manifest drift, MCP misregistration).

Skill
build-loop:prd-bridge

Surfaces PRD principles and Navigation Map into Phase 1 Assess when a project PRD exists.

Skill
recursive-retrospective

Recursive-learning retrospective encoding build history into memory, agents, skills, evals, and gates.

Skill
research

Generates a repo-grounded pre-decision research packet with risks, best path, confidence, next action.

Skill
root-cause-analysis

Blameless RCA producing durable system levers, tiered L0/L1/L2 by impact; delegated-to by debug-loop.

Skill
runtime-parity-verification

Verifies a running app's core flow against the backing source-of-truth before claiming a fix works.

Skill
build-loop:security-methodology

Threat-modeling and scoped-permission methodology backing the security-reviewer agent.

Skill
build-loop:security-scan

Deterministic OWASP scanner over the repo (secrets, injection, rate limiting, headers) mapped to OWASP IDs.

Skill
build-loop:self-improve

Scans recent runs for recurring patterns and auto-drafts experimental skills/agents with A/B tracking.

Skill
spec-writing

Writes a build-loop-compatible plan/spec, walking a completeness checklist and running plan-critic.

Skill
build-loop:sync-skills

Read-only drift check recomputing source_hash for skills/architecture/ and skills/debugging/.

Skill
telemetry

OpenTelemetry-first guidance for instrumenting observability/tracing/metrics/logs in an app.

Skill
ui-design

Selects UI design direction for design-contract-specialist from tokens, UI Guidance, IBR artifacts, and research.

Skill
verify-dispatch

Checklist for independently verifying a dispatched agent's claimed commits/tests, since a solicited peer check is not independent.

interface-built-right 1 agents · 26 skills Claude CodeCodex
Agent
design-validator

Scans live UI page, compares against user intent, fixes mismatches, re-scans. Invoked for 'check my UI', 'verify the design', or after building components.

Skill
design-director

Primary design-agent planner — design intent, specialist passes, target roles, validation criteria for page, flow, app, dashboard work.

Skill
web-design-router

Web archetype classifier — routes to defaults for dashboards, research tools, workbenches, AI chat, checkout, content, admin screens.

Skill
data-visualization

Chart-worthiness gates, chart routing, hierarchy, attribution, accessibility, validation for charts, graphs, dashboards, KPIs, metrics.

Skill
design-guidance

Pre-build design direction, Calm Precision rules, component pattern selection, active design tokens, validation handoff.

Skill
component-patterns

Reusable component blueprints for cards, nav, forms, dashboards, modals, tables, lists with Calm Precision principles.

Skill
design-system

Design token extraction, validation, design system compliance, initialize tokens, view principles, add rules.

Skill
design-implementation

Building UI from user descriptions, implementing designs, matching mockups, aligning frontend edits with intent.

Skill
design-validation

Post-build verification pass — audit UI, validate build, check accessibility, find regressions, compare before/after.

Skill
design-reference

Capturing and comparing design references, screenshotting pages, saving design inspiration from URLs.

Skill
iterative-refinement

IBR refinement loop orchestration — scan → fix → re-scan iteration management until validation passes.

Skill
cli-reference

CLI command reference for inspecting, testing, validating UI via IBR for web and native platforms.

Skill
interactive-testing

Form testing, click-through flows, search/login interaction testing, browser interaction assertions.

Skill
native-testing

iOS, watchOS, macOS native app scanning — touch targets, a11y labels, watchOS constraints, Fix Guide.

Skill
auto-verify

Configuring/troubleshooting IBR automatic before/after scan workflow on UI file edits.

Skill
ui-brainstorm-preamble

Pre-build UI brainstorming — explore directions before implementing, capture platform/scope/archetype context.

Skill
ui-guidance-library

Reusable UI guidance patterns and decision aids, indexes central and project-local templates.

Skill
mockup-gallery-bridge

Bridge between mockup gallery reviews and IBR scan verification, enforces approved target roles.

Skill
mobile-web-ui

Mobile web UI patterns — responsive design, touch targets, viewport handling, Material 3, iOS Safari, WCAG 2.2.

Skill
ios-design

iOS HIG rules — SwiftUI conventions, safe areas, SF Symbols, Dynamic Type, haptics, Liquid Glass.

Skill
ios-design-router

iOS archetype classifier — routes to defaults for 6 iOS app archetypes (Utility, Content, Productivity, Consumer, Editorial, Tool/Pro).

Skill
apple-platform

How to build for iOS, watchOS, macOS — architecture patterns, SwiftData, Swift concurrency, CI/CD, TestFlight.

Skill
macos-ui

macOS UI patterns — AppKit/SwiftUI, menu bar, window chrome, materials, Liquid Glass, notarization.

Skill
design

Use when planning or implementing UI with IBR in Codex — compact routing skill.

Skill
native

Use when validating iOS, watchOS, macOS UI with IBR in Codex.

Skill
ui-ux-guidance

Compact IBR guidance for Codex — Calm Precision, web archetypes, interaction states, mobile, content states, data viz, Mockup Gallery.

Skill
validate

Use when scanning, auditing, comparing, or testing UI with IBR in Codex.

UI Guidance 22 skills Claude Code
Skill
audio

Design auditory UI channels — sounds, tones, notifications, sonic logos, voice confirmation, spatial audio.

Skill
audit-against-patterns

Audit URLs or images against tagged pattern library, preferences profile, and ui-guidance principles.

Skill
components

Design interactive components — buttons, icons, forms; cross-platform craft with state coverage.

Skill
data-viz

Design charts, graphs, tables, KPIs, sparklines; decision-first titling and ruthless decluttering.

Skill
designer

Build design systems by revealed preference — users pick live previews; emits platform-aware DESIGN.md (web/iOS/macOS).

Skill
evidence-policy

Route design evidence gathering between source-of-truth reads (tokens, palette) and live IBR capture.

Skill
feedback

Design action feedback — confirmations, button states, loading, success/error states, haptics, sounds.

Skill
hierarchy

Design visual hierarchy — title/description/metadata ladders, typography scale, emphasis through size-weight-contrast.

Skill
ingest-image

Extract design patterns from screenshots or mockups; maps observable structure to ingestion schema.

Skill
ingest-url

Extract design patterns from live URLs via IBR scan; maps DOM and computed styles to ingestion schema.

Skill
ios

Build or audit SwiftUI/UIKit iOS apps — HIG essentials, archetype classifier, SF Symbols, Dynamic Type, haptics.

Skill
macos

Build or audit native macOS apps — window chrome, toolbars, menu bar, NSVisualEffectView, keyboard-first interaction.

Skill
motion

Design animations and transitions — page transitions, hover, sequenced reveal, shimmer, micro-interactions.

Skill
navigation

Design navigation and wayfinding — tab bars, drawers, breadcrumbs, back stacks, sheets, modals.

Skill
preferences

Set defaults for token shape (typography, color, touch targets, motion, error UX) or audit against owned-app reference set.

Skill
principles

Cross-medium design doctrine applicable to UI, writing, images, docs, data — principle stable across primitives.

Skill
refine-preferences

Discover or narrow design preferences via mockup-gallery A/B rounds; aggregates picks into preferences profile.

Skill
responsive

Design for multiple form factors and screen sizes — breakpoints, container queries, density per surface.

Skill
states

Design element state lifecycles — rest, hover, focus, active, selected, disabled, loading; empty/error/success screens.

Skill
tag-pattern

Mark captures as liked or disliked patterns; cross-references ui-guidance principles followed or violated.

Skill
tokens

Design token systems shipping across form factors without forking — scale, ladder, condition, composite patterns.

Skill
web

Build or audit web apps — semantic HTML, ARIA, responsive design, forms, WCAG 2.1 AA, keyboard navigation.

market-research-platform 14 skills · 2 in-app Claude CodeCodexApp-embedded
Skill
pptx-builder

Build production PowerPoint decks with a python-pptx primary engine plus an opt-in pptxgenjs polish layer for the cover slide; runs a QA gauntlet (typography, WCAG contrast, drift check) before delivery.

Skill
diagram-mapper

Turn a codebase, workflow, process doc, or org/financial model into an audit-grade diagram (Mermaid, SVG, HTML, or PPTX).

Skill
pyramid-principle-core

Canonical rule library for Barbara Minto's Pyramid Principle (SCQA, MECE, governing thought) that sibling skills depend on.

Skill
pyramid-audit

Audit existing writing against Pyramid Principle / MECE / SCQA rules and produce ranked, cited findings.

Skill
pyramid-short-form

Draft answer-first short-form prose (email, memo, exec summary) under ~500 words using SCQA and a governing-thought key line.

Skill
pyramid-long-form

Produce a full nested key-line outline for long-form reports, briefs, and whitepapers before prose drafting.

Skill
pyramid-presentation

Produce a slide-by-slide ghost-deck storyline (assertion headlines and key lines) for presentations before slide design.

Skill
docx-builder

Build Word (.docx) files to a locked typography spec (Arial 12pt, heading hierarchy, TOC, callouts) via python-docx mechanics.

Skill
tyrone-conversational-voice

Draft or rewrite short-form conversational writing (Slack, casual email, chat) in the author's voice.

Skill
tyrone-writing-core

Canonical style library (punch-then-unpack, em-dash hygiene, register map) underlying the other voice skills.

Skill
deck-structure

Owns argument/structure logic for PowerPoint decks (Minto Pyramid, MECE, Duarte, action titles) prior to handoff to pptx-builder.

Skill
tyrone-writing-reviewer

Pre-ship logic/fact review that flags contradictions, unsupported claims, and overreach with severity-tagged findings.

Skill
linkedin-writing-style

Draft or rewrite LinkedIn posts/comments in the author's analytical, research-backed voice.

Skill
doc-structure

Section architecture and narrative logic for Word documents and long-form reports, prior to handoff to docx-builder.

Skill
tyrone-professional-writing

Draft or rewrite structured professional writing (cover letters, memos, executive emails) in the author's voice.

In-app
Research pipeline (research.py) · custom

3-node linear pipeline (retrieve_semantic -> retrieve_graph -> analyze) that does semantic + graph retrieval then a single LLM call to produce a cited analysis (general/SWOT/competitive/trend). Self-labeled 'Research agent' with an 'A-RAG adaptive' docstring, but the implementation is a fixed linear sequence, not dynamic tool-choice.

In-app
Reporting pipeline (reporting.py) · custom

3-node pipeline (gather_findings -> extract_brief -> generate_outputs) that runs an LLM map-reduce extraction into a StrategyBrief, then fans out to format generators (pptx/docx/email/notes/outline).

stratagem 13 in-app App-embedded
In-app
stratagem-control-agent · claude-agent-sdk

Top-level orchestrator (Opus): plans via research-planner, delegates to specialist subagents, tracks progress, enforces Pyramid Principle output and artifact verification

In-app
research-planner · claude-agent-sdk

Decomposes research questions into structured task plans before data gathering

In-app
data-extractor · claude-agent-sdk

Extracts and verifies data from PDFs, websites, spreadsheets, and presentations

In-app
financial-analyst · claude-agent-sdk

Analyzes SEC filings, earnings reports, and financial statements; runs Python calculations

In-app
research-synthesizer · claude-agent-sdk

Synthesizes multi-source extracted data into MECE/Pyramid-structured narratives with citations

In-app
executive-synthesizer · claude-agent-sdk

Produces executive-ready SCQA briefs from synthesis output

In-app
flowchart-architect · claude-agent-sdk

Designs process architectures and flowchart visualizations as PowerPoint decks

In-app
design-agent · claude-agent-sdk

Designs visual layout/structure for deliverables applying Calm Precision principles

In-app
prompt-optimizer · claude-agent-sdk

Refines weak/ambiguous research prompts for accuracy and specificity

In-app
plan-validator · claude-agent-sdk

Monitors execution for scope/quality/source/goal drift against the research plan

In-app
source-verifier · claude-agent-sdk

Verifies synthesized claims against cited sources; flags unsupported claims and stale data

In-app
report-critic · claude-agent-sdk

Scores report quality (completeness, accuracy, structure, depth, actionability) against a 4.0/5.0 threshold

In-app
after-action-analyst · claude-agent-sdk

Post-run debrief that diagnoses execution quality and records lessons/memory updates for future runs

claude-code-debugger 7 agents · 3 skills Claude CodeCodex
Agent
api-assessor

Domain assessor for API/REST/GraphQL, auth, and route-handler debugging symptoms.

Agent
assessment-orchestrator

Coordinates parallel domain-specific assessors (db/frontend/api/performance) when the debugging symptom spans multiple domains.

Agent
database-assessor

Domain assessor for database/query/migration/Prisma/PostgreSQL debugging symptoms.

Agent
fix-critique

Pressure-tests a proposed fix before it's declared resolved (root cause vs symptom, regressions, evidence).

Agent
frontend-assessor

Domain assessor for React/hooks/rendering/state/hydration debugging symptoms.

Agent
performance-assessor

Domain assessor for latency/memory/CPU/bottleneck debugging symptoms.

Agent
root-cause-investigator

Builds a causal tree (not a linear 5-whys chain) to find true root causes across multiple branches.

Skill
debug-loop

7-phase iterative root-cause debugging loop (investigate, hypothesize, fix, verify, score, critique, report), up to 5 iterations.

Skill
debugging-memory

Memory-first debugging workflow that checks past incidents via the debugger MCP search tool before investigating a bug.

Skill
logging-tracer

Generates stack-appropriate structured logging/tracing code (optionally OpenTelemetry) with a zero-dependency-first tiered approach.

nimbalyst 1 agents · 7 skills · 2 in-app Claude CodeApp-embedded
Agent
e2e-runner

Runs Playwright/E2E tests inside a Docker devcontainer for isolated, reproducible execution; manages the full container lifecycle (build, run, cleanup). Uses haiku model.

Skill
extension-development

Build, install, and hot-reload Nimbalyst extensions using in-app MCP tools.

Skill
feedback-intake

Entry point for Nimbalyst's in-app bug-report/feature-request flow; gathers evidence, anonymizes, and posts to GitHub Issues.

Skill
git-commit

Drives Nimbalyst's interactive 'Commit with AI' widget; explicitly NOT for generic commit requests.

Skill
datamodellm

Create visual Prisma-format database schema / ER diagrams using Nimbalyst's DataModelLM editor.

Skill
mockuplm

Create .mockup.html UX mockups/wireframes for visual planning inside Nimbalyst.

Skill
planning

Create/track structured plan documents and work items (bugs/tasks/ideas) via YAML frontmatter in nimbalyst-local/plans/.

Skill
excalidraw

Create flowcharts/architecture diagrams/sketches as .excalidraw files; includes an explicit 'stop after one pass, don't thrash' rule.

In-app
ClaudeCodeProvider + TeammateManager (AI Teammates) · claude-agent-sdk

Lead-agent session driver built on claude-agent-sdk's query(); TeammateManager (2039 lines) spawns, messages, and manages the lifecycle (active/idle/resume) of sub-agent 'teammates' the lead agent creates at runtime.

In-app
nimbalyst-meta-agent (MetaAgentService) · mcp

MCP server + service letting a running agent session create/spawn/inspect/prompt sibling or child agent sessions across providers (Claude, Codex) and worktrees (tools: create_session, spawn_session, get_session_status/result, send_prompt, respond_to_prompt, list_spawned_sessions).

Travel Planner 9 agents Claude Code
Agent
backend-api-engineer

Implement, modify, or update server-side logic, API endpoints, database schemas, and business logic

Agent
code-review-security

Perform comprehensive security analysis on code changes, static code scanning, and dependency checks

Agent
devops-deployment-agent

Handle CI/CD pipelines, container builds, and cloud deployments with rollback capabilities

Agent
documentation-maintainer

Maintain, consolidate, and update documentation across repositories

Agent
front-end-engineer

Build, modify, and maintain React/TypeScript UI components with styling and state management

Agent
product-design-auditor

Champion usability for React/TypeScript web apps through heuristic audits and Calm Precision design

Agent
system-architect

Design and review overall architecture, API contracts, and data models for React/TypeScript applications

Agent
task-orchestrator

Coordinate multiple agents for complex tasks and ensure proper delegation across domains

Agent
testing-qa-automation

Run automated test suites including Jest, Playwright, and Lighthouse for quality assurance

NavGator 4 agents · 5 skills Claude CodeCodex
Agent
architecture-advisor

Specialized agent for architecture decisions, dependency management, and impact analysis. Use when planning significant changes to the tech stack.

Agent
architecture-investigator

Autonomous subagent for investigating architecture issues using NavGator MCP tools. Follows SRE-style read-only investigation before proposing changes.

Agent
architecture-planner

Reads NavGator's stored graph, decides whether data is stale, runs incremental scan if needed, then dispatches read-only analysis tools and aggregates structured reports.

Agent
external-resolver

Resolves external boundary nodes (npm/pip/spm/cargo/go/service/llm/infra) against upstream registries with canonical identity, version, and freshness verdict.

Skill
code-review

Architectural integrity review of system flow and connections using impact analysis, data flow tracing, and lessons tracking.

Skill
architecture-scan

Scan project architecture, check health, and monitor staleness using NavGator scan and status tools.

Skill
architecture-export

Generate architecture diagrams and export architecture summaries using mermaid diagrams and CLI exports.

Skill
navgator-setup

Install the plugin, update to latest version, or launch the web dashboard and UI maintenance.

Skill
impact-analysis

Analyze what's affected by changes and map component connections using impact analysis, connection viewing, and dataflow tracing.

market-research 9 agents Claude Code
Agent
backend-api-engineer

Implement/modify server-side logic, API endpoints, DB schemas, and business logic for the React/TS app; proactive when new data entities or API contracts change.

Agent
code-review-security

Security gate run after tests pass, before deploy: static analysis, dependency vulnerability scanning, blocks merges on critical issues.

Agent
devops-deployment-agent

Owns CI/CD, Docker builds, and cloud deployments (Vercel/Fly.io/AWS), including rollback procedures; runs proactively after code-review approval.

Agent
documentation-maintainer

Maintains/consolidates repo docs, prevents doc sprawl, merges LESSONS.md growth into docs, updates changelogs/API guides.

Agent
front-end-engineer

Builds/maintains React/TypeScript UI components, Tailwind styling, Calm Precision design tokens, Storybook docs, accessibility.

Agent
product-design-auditor

Heuristic UX audits, accessibility/WCAG review, proposes Calm Precision token tweaks, drafts BDD test scenarios.

Agent
system-architect

Owns end-to-end architecture and cross-agent coordination: API contracts, data models, tech-stack decisions across frontend/backend/infra.

Agent
testing-qa-automation

Runs Jest/Playwright/Lighthouse automated test suites proactively after merges or new test scaffolds, fails fast on regressions.

Agent
project-orchestrator

Coordinates substantial multi-system changes by engaging system-architect and other domain agents for planning and sequencing.

pptx-builder 5 agents · 2 skills Claude Code
Agent
persona-strategist

Owns the deck's argument before any layout decision. Turns a fuzzy ask into a Pyramid-shaped ghost deck with governing thought, key line, and action-titled outline with evidence placeholders.

Agent
persona-critic

Owns visual + structural quality. Audits the Builder's .pptx against hard-constraints, composition principles, and house-style instances. Returns findings with routing verdicts.

Agent
persona-researcher

Runs after Strategist and before storyline writing. Gathers and classifies information into a tiered evidence base (proven/directional/gap) that grounds downstream claims.

Agent
persona-builder

Owns structure.json → .pptx rendering. Reads Designer's layout decisions, calls python-pptx helpers, runs full QA gauntlet before signaling done.

Agent
persona-designer

Owns the visual system. Reads Strategist's ghost deck and auto-layout pipeline. Produces per-slide LayoutDecision objects the Builder consumes. Never calls python-pptx directly.

Skill
pptx-builder

Build PowerPoint (.pptx) decks with the calm-precision-pptx hybrid model. python-pptx is the primary structural engine; pptxgenjs polish layer optional for cover slides.

Skill
diagram-mapper

Turn any source artifact into a clean, audit-grade diagram with explicit node types, decision branches, and exception paths. Renders to Mermaid, SVG, HTML/interactive, or PowerPoint.

local-smartz 7 in-app App-embedded
In-app
Local Smartz Agent · langgraph

Main production research agent — DeepAgents single-agent flow with built-in write_todos planning, filesystem tools, and task-tool subagent delegation (planner/researcher/analyzer/writer/fact_checker/reviewer/reflector/worker role prompts)

In-app
Orchestrator pattern · custom

Lead agent decomposes task into worker subtasks, workers execute independently and register artifacts, lead synthesizes from artifact summaries only (telephone-game mitigation, one-level-only enforced)

In-app
Critic Loop pattern · custom

Evaluator-optimizer loop — writer agent generates content, critic agent scores/verdicts, writer revises until score threshold or max iterations

In-app
Reflection pattern · custom

Reflexion-style loop — same-model agent generates an answer, then self-critiques against a rubric and revises (bounded iterations, stagnation guard)

In-app
Parallel/Ensemble pattern · custom

Runs N sampled agent turns at varied temperature concurrently, then a synthesizer agent merges them into one answer (ensemble voting / hallucination catching)

In-app
Router pattern · custom

Classifier turn labels the query, then dispatches to one specialist agent turn scoped to that label

In-app
Chain pattern · custom

Linear staged agent pipeline (gather -> analyze -> write), each stage consuming the prior stage's output, optionally pinned to different models per stage

spectra 1 agents · 5 skills Claude CodeCodex
Agent
marketing-planner

Plans (and optionally produces) product marketing video for any software product; dispatches for launch strategies, audience-tailored content plans, and production.

Skill
demo

Transforms raw screen recordings into polished demo clips with spotlight focus, captions, and optional speed adjustment.

Skill
video-design

Tunes video craft — pacing, color, typography, sound, and emotion arc — and translates design choices into Spectra render parameters.

Skill
content-capture

Connects to and captures screenshots/video from running web, macOS, and iOS/watchOS simulator applications for marketing and documentation.

Skill
product-marketing

Plans product marketing content tailored to product type, audience, and funnel stage; routes to Spectra capture/polish tools for production.

Skill
host-walkthrough

LLM-driven sequential navigation through UI flows with capture; used by host agents (Claude Code, Codex) to plan from snapshots and drive Spectra execution.

agent-builder 1 skills · 4 in-app Claude CodeCodexApp-embedded
Skill
agent-builder

Design, evaluate, or rebuild agentic harnesses — tool use, permissions, workflow state, memory, evals, observability, multi-agent, framework selection. Activates on harness-gap symptoms too.

In-app
Chief of Staff Agent · custom

Multi-node orchestrator for schedule optimization, priority triage, time-block planning, decision logging, and follow-up tracking with role-based task assignment.

In-app
Local Knowledge Agent · custom

Document analysis and knowledge management agent with embeddings, retrieval, and local LLM summarization for meetings, documents, and research materials.

In-app
Plan Runner Agent · custom

Planning agent that constructs structured outlines, generates plan sections, and builds implementation roadmaps using local LLMs.

In-app
Meeting Transcript Agent · custom

Meeting analysis agent that extracts actions, decisions, and notes from meeting transcripts with semantic classification.

pyramid-principle 5 skills Claude CodeCodex
Skill
pyramid-principle-core

Canonical rule library for Pyramid Principle: SCQA, MECE, deductive/inductive logic, vertical/horizontal relationships, governing thought structure.

Skill
pyramid-short-form

Drafts answer-first short-form writing (emails, memos, exec summaries, one-pagers, BLUF notes) using Pyramid Principle structure.

Skill
pyramid-long-form

Structures long-form professional documents (reports, briefs, proposals, whitepapers) with full nested pyramid outlines before prose.

Skill
pyramid-presentation

Builds presentation storylines and slide decks with assertion-based headlines and logical flow following Pyramid Principle.

Skill
pyramid-audit

Audits existing writing against Pyramid Principle rules; produces structured violation findings and ranked fixes with source citations.

TruePace 4 in-app App-embedded
In-app
AIInsightService · custom

Orchestrates LLM requests between on-device (Apple Foundation Models) and cloud (Groq) providers with routing, caching, and state management for AI features

In-app
PlanCoachService · custom

Orchestrates plan critique generation by combining deterministic PlanCoachEngine logic with LLM inference and safety validation

In-app
WeeklyReviewService · custom

Orchestrates weekly review generation via AIInsightService with weekly caching, permission checks, and prompt construction

In-app
TaskDecomposer · custom

Routes task decomposition between Apple on-device Foundation Models and cloud providers for parsing raw text into structured tasks

replit-migrate 1 agents · 3 skills Claude CodeCodex
Agent
migration-analyst

Deep investigation agent that reads all source files in a Replit project, maps the full dependency graph, identifies hidden Replit dependencies, and produces a comprehensive migration readiness report.

Skill
migration-scan

Use to "scan this Replit app", "analyze for migration", "check Replit dependencies", or assess migration readiness; auto-triggers when a .replit file is present.

Skill
migrate-web

Use when the user asks to "migrate to web", "deploy to Vercel", "move to Cloudflare", "get off Replit", or convert a Replit app to standalone/self-hosted production.

Skill
migrate-ios

Use when the user asks to "make this an iOS app", "convert to native", "port to Swift", "SwiftUI version", or migrate a Replit app to native iOS/macOS.

agent-doe-engine 2 agents · 1 skills Claude CodeCodex
Agent
optimize-runner

Executes the agent-doe-engine autoresearch loop: generates one hypothesis per iteration, makes an atomic scoped change, measures every objective, and keeps/reverts by aggregate score until convergence or budget exhaustion.

Agent
overfitting-reviewer

Read-only adversarial reviewer that checks kept optimization commits for Goodhart violations, metric-gaming, safety removal, scope violations, and unvalidated-metric winners.

Skill
agent-doe-engine

Orchestrates DOE (Design of Experiments) and autoresearch optimization: plans/validates factors, sets multi-objective scoring (scalarize/desirability/pareto), runs the trial matrix or single-factor loop, and reviews results for overfitting.

api-registry 3 skills Claude Code
Skill
api-registry

Use BEFORE writing API config, env setup, auth, LLM client init, or debugging external services. Returns authoritative source URLs, latest version, and deprecation warnings. Triggers on any named library/API/tool.

Skill
build-loop-bridge

Contract for external plugins (build-loop, debugger, research) to consult api-registry during API setup or debug. Reads ~/.api-registry/registry.db; exits silently if absent.

Skill
docs-search

Use when the user asks how a library/API/tool works, requests docs for a named service, or asks about current models/versions/endpoints. Routes via api-registry to Context7 or WebFetch.

AI User Personas 2 agents · 1 skills Claude CodeCodex
Agent
persona-perspective-reviewer

Reviews a product, UI, workflow, or feature from one assigned persona perspective with evidence and measurement.

Agent
persona-panel-orchestrator

Selects task-specific persona perspectives, checks execution readiness, defines measurements, launches review passes, and synthesizes findings.

Skill
persona-lab

Turn user requests into focused persona panels for reviewing products, UIs, workflows, and features with task-specific perspectives, measurement, and evidence-based findings.

agent-rally-point 3 skills Claude CodeCodex
Skill
agent-rally-point

Use when working in a repository that uses Rally/Agent Rally Point for cross-agent coordination, especially at session start, before editing files, when deciding what to do next, handing work to another agent, recording facts/artifacts/decisions, resolving blockers, or coordinating with other coding agents through the `rally` CLI.

Skill
mini-loop

Ultra-light per-task quality loop for Rally Flow / dynamic-workflows fan-out. Wrap each fanned-out task in assess → plan → execute → mini-judge to raise accuracy, with zero tools or scripts.

Skill
rally-workflows

Use when fanning out work across multiple agents, running a dynamic workflow, coordinating parallel subagents, or splitting a workstream across hosts, terminals, or machines through Agent Rally Point.

atomize-ai 1 agents · 1 skills · 1 in-app Claude CodeApp-embedded
Agent
frontend-ui-engineer

Modifies/creates/fixes UI (HTML/CSS/frontend-backend integration), ensures data rendering and design-system consistency; invoked by the Chief orchestrator or directly for UI work.

Skill
db-investigation

Read-only Supabase/Postgres investigation for atomize-ai — pipeline freshness sweep and coverage-gap analysis (freshness.py, coverage_gap.py) used to diagnose stale/empty data before fixing.

In-app
Search Orchestrator V2 (runSearch) · custom

Multi-step intelligent-search pipeline — classifyIntent → planRetrieval → fan-out retrieval (vector+KG) → rerank → quality-guard (reformulate-and-retry on low relevance) → synthesize → numeric verification → normalize; feature-flagged, falls back to legacy search handler.

decision-doctor-cc 3 in-app App-embedded
In-app
Aida Chat Orchestrator · custom

Conversational multi-step router: detects specialty/intent, generates and adapts intake surveys, classifies via Stage 0, then dispatches to the Decision Engine or Recommendation Engine and streams the Groq completion back to the user

In-app
Decision Engine (runDecision, V1) · custom

Deterministic multi-stage decision pipeline (weights/AHP -> constraints -> weighting -> outranking -> ranking -> feasibility -> scaffold), with LLM calls at Stages 1, 5, 6 via Groq

In-app
Recommendation Engine / pain-to-AI recommender (runRecommendation, V2) · custom

Chains pain-path classification -> library retrieval -> candidate task generation -> 9-criteria scoring -> promotion (or the newer runWorkflowV2 decompose/score/lynchpin/horizon pipeline) to produce an AiTaskRecommendation

bookmark 1 agents · 1 skills Claude CodeCodex
Agent
snapshot-analyst

Agent for deep analysis of session context and snapshot comparison. Use for detailed context review, decision timelines, and coverage analysis.

Skill
context-continuity

Restores session via bookmark snapshots when user asks to continue from last session, restore context, or pick up where they left off.

atomize 1 agents · 1 in-app Claude CodeApp-embedded
Agent
frontend-ui-engineer

Frontend UI specialist handling HTML/CSS/TypeScript UI implementation, responsive design, accessibility, and frontend-backend integration.

In-app
SourceOrchestrator · custom

Multi-strategy content scraping orchestrator supporting RSS discovery, sitemap parsing, HTML scraping with fallback logic.

loop-builder 1 agents · 1 skills Claude CodeCodex
Agent
loop-builder

Turns a described workflow into a generated loop pack (loop.yaml, rubric.md, report template, validators, README.md).

Skill
loop-builder

Generates declarative focused-loop specs, presets, validators, and skill-chain plans for any host coding agent.

ProductPilot 1 in-app App-embedded
In-app
Adaptive Intake Controller · custom

Multi-step orchestrator that runs guided-conversation intake for a product spec: routes to a method (jtbd/qfd/pugh/agent) via a Haiku sub-call, scores unknowns 0-15 to decide ask-vs-infer, generates the next question or safe-default assumptions, then finalizes into a rendered spec draft.

atomize-news 1 agents Claude Code
Agent
frontend-ui-engineer

Handles UI/UX implementation (HTML/CSS/JS/TS), responsive layouts, accessibility, and frontend-backend data integration.

docx-builder 1 skills Claude Code
Skill
docx-builder

Build Word (.docx) files with locked typography, heading hierarchy (16/14/12pt), 1" margins, table header shading, callout boxes, auto-TOC. Pair with doc-structure for section planning.

blog-content-scraper 1 skills Claude CodeCodex
Skill
web-scraper

Extract blog and news content from any website via RSS, sitemap, or HTML scraping with LLM-ready output.

agent-builder-studio 1 skills Claude CodeCodex
Skill
agent-builder

Design, evaluate, or rebuild agentic harnesses — tool use, permissions, workflow state, memory, evals, observability, multi-agent, framework selection.

pptx-style-builder 1 skills Claude Code
Skill
pptx-style-builder

Capture PowerPoint .pptx visual style (theme colors, fonts, slide size, masters, layouts) into JSON spec, then generate new .pptx from Markdown/JSON content reusing that style; includes native chart support via R or python-pptx fallback.

prompt-builder 1 skills Claude CodeCodex
Skill
prompt-builder

Classify, diagnose, rewrite, and score prompts calibrated to model tier and deployment context.

research-plugin 1 skills Claude CodeCodex
Skill
research

Structured research methodology (5-phase general research, collection, synthesis, quantitative/database analysis, active-project ingestion) producing cited, verified findings persisted to ~/dev/research/.

mockup-gallery 1 skills Claude CodeCodex
Skill
mockup-review

Create scratch-first HTML mockups, rate mockups in gallery, check selections, align mockups to routes/screens, and implement approved designs from the mockup gallery.

threat-modeler 1 skills Claude CodeCodex
Skill
threat-modeler

Workshop assistant for producing threat-model artifacts — walks STRIDE decomposition, asset/actor enumeration, data-flow description, OWASP LLM/Agentic/ATLAS/NIST cross-mapping, mitigations, residual risk, decision log for agentic systems and risk-surface changes.

Generated from a cross-repo assessment; catalog stored in build-loop-memory and refreshed periodically.