Agents & Skills

276 built across 36 systems — 75 Claude Code agents, 163 skills, and 38 in-app agentic systems (LangGraph / LangChain / custom). 174 of them run on both Claude Code and Codex.

75agents 163skills 38in-app systems 20dual-host systems

build-loop 28 agents · 51 skills Claude CodeCodex

Agent

advisor

Frontier (Fable) role that authors and re-plans Phase 2 plan synthesis — decomposes work, builds dependency graph, MECE-partitions ownership, issues corrected instructions on planning misses.

Agent

alignment-checker

Sonnet advisory judge scoring autonomous-iterate-loop queue items (aligned/misaligned/uncertain) against intent.md/goal.md/constitution; never blocks, only routes.

Agent

api-assessor

Debugging domain assessor for API/REST/GraphQL errors, auth, rate limiting, and server-side route handlers.

Agent

build-loop:architecture-scout

Read-only architecture analyst dispatched by the orchestrator for baseline/chunk-impact/review-rules/iterate-subgraph/learn-sync tasks; decides native engine vs NavGator escalation.

Agent

assessment-orchestrator

Opus-tier coordinator that fans out parallel database/frontend/API/performance assessors for multi-domain debugging.

Agent

build-orchestrator

Top-level Opus orchestrator for the 5-phase build loop (Assess→Plan→Execute→Review→Iterate, optional Learn) with a 7-step Review sub-flow.

Agent

database-assessor

Debugging domain assessor for query/migration/schema/Prisma/Postgres/connection-pool/vector-index issues.

Agent

design-contract-specialist

Sole writer to .build-loop/app-contract/{ui.md,data.md,traceability.json}; chooses UI design direction and reconciles design/data deltas after implementation.

Agent

fact-checker

Fable-tier validator tracing all rendered data/claims/metrics to their sources before completion.

Agent

fix-critique

Fable-tier adversarial pressure-test of a proposed fix — root cause vs symptom, regression risk, evidence check.

Agent

frontend-assessor

Debugging domain assessor for React/hooks/rendering/hydration/client-state issues.

Agent

implementer

Applies a single ux-fix-plan.md or targeted fix plan from the Phase 5 work list; designed for parallel fan-out (≤4 in flight).

Agent

independent-auditor

Fable-tier LLM escalation for the deterministic commit-audit hook; renders a four-option verdict on large/boundary-crossing diffs.

Agent

mock-scanner

Haiku-tier fast scan for residual mock/placeholder/fake/private/secret data in production code paths.

Agent

optimize-runner

Executes the optimization loop — generates hypotheses, makes atomic changes, measures metrics, keeps or reverts, until convergence/budget exhaustion.

Agent

overfitting-reviewer

Fable-tier read-only adversarial review of optimization results for overfitting/Goodharting/test-gaming.

Agent

performance-assessor

Debugging domain assessor for slowness/latency/memory-leak/CPU/bottleneck symptoms.

Agent

plan-critic

Fable-tier adversarial critique of a Phase 2 plan for non-deterministic issues (alternatives, MECE quality, marker adequacy) after plan-verify runs.

Agent

promotion-reviewer

Fable-tier advisory judge for Phase 6 experimental-artifact promotion (approve/rethink/new_approach); never blocks, feeds user-confirmation gate.

Agent

recurring-pattern-detector

Haiku-tier pattern-matcher scanning state.json run logs for recurring failures/patterns across 3+ runs; no judgment, just detection.

Agent

retrospective-synthesizer

Post-push background synthesizer writing a 9-section retrospective + summary from the session transcript/state.json/intent/plan.

Agent

root-cause-investigator

Deep causal-tree investigator for debugging symptoms beyond surface diagnosis; flags when external/environmental research is needed.

Agent

scope-auditor

Fable-tier read-only Plan→Execute boundary check tracing callers of any changed public signature outside a commit's owned-files.

Agent

security-reviewer

Fable-tier adversarial security review of implementer output against OWASP LLM/Agentic/Web Top 10 and MITRE ATLAS, gated on riskSurfaceChange.

Agent

self-improvement-architect

Drafts a concrete experimental SKILL.md or agent definition from a recurring-pattern-detector proposal, written to .build-loop/skills/experimental/.

Agent

synthesis-critic

Sonnet-tier read-only critic for subjective synthesis dimensions (copy_tone, empty_state) on UI-touching commits; WARN-only, never blocks.

Agent

transcript-pattern-miner

Pure stdlib regex miner of local Claude Code transcripts for corrections/tool sequences/file patterns/command rituals/secret exposures; no LLM calls.

Agent

ui-validator

Deterministic UI scanner against the running dev server (layout collisions, touch targets, console errors, hydration, visual SSIM) with its own authed browser session.

Skill

agent-rally-point

Coordinate build-loop with peer coding agents via Rally Point (presence/inbox/handoffs).

Skill

agent-rally-watcher

Listen for and wire Rally Point change watchers.

Skill

api-registry-bridge

Consult the api-registry plugin when a new API dependency is detected; degrades gracefully if absent.

Skill

build-loop:architecture-dead

Scan for dead/orphaned code; native copy of NavGator's dead-code scan.

Skill

build-loop:architecture-impact

Blast-radius analysis tracing upstream/downstream dependencies before a refactor.

Skill

build-loop:architecture-review

Heavyweight architectural integrity review for builds crossing 2+ layers; native copy from NavGator.

Skill

build-loop:architecture-rules

Detects orphans, layer violations, circular deps, and hotspots; native rules engine from NavGator.

Skill

build-loop:architecture-scan

Refreshes native component/connection data in .build-loop/architecture/.

Skill

build-loop:architecture-trace

Follows a component's connections end-to-end; native trace copied from NavGator's CLI.

Skill

attribution-standard

Apply the four-layer Apache-2.0 attribution model (NOTICE, SPDX, REUSE.toml, canary markers) to a repo.

Skill

authentication

Wire/debug auth (Better Auth, Supabase Auth, Google OAuth, Resend) for a new or existing project.

Skill

auto-decision-capture

Signal taxonomy and extraction prompts for proactive in-session decision capture into build-loop-memory.

Skill

auto-finding-capture

Default-on auto-capture of severity-labeled findings/issues into the backlog regardless of source.

Skill

build-loop

Main orchestrated build-loop entrypoint for multi-step code work; verb/symptom trigger language.

Skill

building-with-deepagents

Build/refactor agents importing OSS deepagents (create_deep_agent) — SubAgent API, middleware, tool scoping.

Skill

build-loop:capabilities

Populates state.json.activeCapabilities with ≤8 relevant entries per phase via plugin-surface collapse.

Skill

debug-loop

Deep iterative debugging loop for when a fix didn't hold or root-cause analysis is needed.

Skill

debugging-memory

Memory-first debugging workflow; owns verdict-handling and delegates lookup to debugging-memory-search.

Skill

build-loop:debugging-assess

Parallel domain assessment fan-out (db/frontend/api/perf) for complex debugging symptoms.

Skill

build-loop:debugging-memory-search

Search primitive for local build-loop incidents plus optional standalone Coding Debugger memory.

Skill

build-loop:debugging-store

Stores a debugging incident record after a bug fix; native local storage with optional Coding Debugger mirror.

Skill

build-loop:defenseclaw-bridge

Bridges the orchestrator to the defenseclaw project's eval/test conventions when detected; otherwise no-op.

Skill

handoff

Compose a durable build-loop handoff document and optionally launch a fresh session with it injected.

Skill

build-loop:ibr-bridge

Routes UI visual verification to the IBR plugin when installed, else falls back to native-ax-driver/ui-validator.

Skill

knowledge-review

Read-only repo-local episodic memory review surface (decision rot, stale procedures, open conflicts).

Skill

knowledge

Canonical build-loop-memory framework for recording decisions/ADRs, regenerating indexes, and recall.

Skill

build-loop:logging-tracer-bridge

Optional escalation to standalone Coding Debugger for extended observability/tracer backends.

Skill

logging-tracer

Generates stack-appropriate logging/tracing with optional OpenTelemetry for silent failures.

Skill

loop-builder

Generates declarative focused-loop specs, presets, validators, and skill-chain plans.

Skill

mcp-builder

Build/package/debug an MCP server or add MCP tools to a plugin.

Skill

model-bakeoff

Runs a controlled multi-model bake-off across isolated git worktrees with deterministic re-scoring and best-of grafting.

Skill

model-tiering

Two-axis taxonomy (work-role segment × 7-rung capability ladder) for choosing a subagent's model tier.

Skill

build-loop:native-ax-driver

Drives running macOS apps via Accessibility API without touching the hardware cursor; self-contained Swift binary.

Skill

optimize

Design-of-Experiments test matrix (up to 6 variables) to find which variable moves a target metric.

Skill

plan-verify

Runs deterministic plan-verify rules on a Phase 2 plan and emits findings JSON.

Skill

plugin-builder

Create/scaffold/convert/migrate a Claude Code plugin — plugin.json, layout, hooks, MCP, agents, LSP.

Skill

plugin-tests

Static-analysis test harness for Claude Code plugins (namesake collisions, manifest drift, MCP misregistration).

Skill

build-loop:prd-bridge

Surfaces PRD principles and Navigation Map into Phase 1 Assess when a project PRD exists.

Skill

recursive-retrospective

Recursive-learning retrospective encoding build history into memory, agents, skills, evals, and gates.

Skill

research

Generates a repo-grounded pre-decision research packet with risks, best path, confidence, next action.

Skill

root-cause-analysis

Blameless RCA producing durable system levers, tiered L0/L1/L2 by impact; delegated-to by debug-loop.

Skill

runtime-parity-verification

Verifies a running app's core flow against the backing source-of-truth before claiming a fix works.

Skill

build-loop:security-methodology

Threat-modeling and scoped-permission methodology backing the security-reviewer agent.

Skill

build-loop:security-scan

Deterministic OWASP scanner over the repo (secrets, injection, rate limiting, headers) mapped to OWASP IDs.

Skill

build-loop:self-improve

Scans recent runs for recurring patterns and auto-drafts experimental skills/agents with A/B tracking.

Skill

spec-writing

Writes a build-loop-compatible plan/spec, walking a completeness checklist and running plan-critic.

Skill

build-loop:sync-skills

Read-only drift check recomputing source_hash for skills/architecture/ and skills/debugging/.

Skill

telemetry

OpenTelemetry-first guidance for instrumenting observability/tracing/metrics/logs in an app.

Skill

ui-design

Selects UI design direction for design-contract-specialist from tokens, UI Guidance, IBR artifacts, and research.

Skill

verify-dispatch

Checklist for independently verifying a dispatched agent's claimed commits/tests, since a solicited peer check is not independent.

interface-built-right 1 agents · 26 skills Claude CodeCodex

Agent

design-validator

Scans live UI page, compares against user intent, fixes mismatches, re-scans. Invoked for 'check my UI', 'verify the design', or after building components.

Skill

design-director

Primary design-agent planner — design intent, specialist passes, target roles, validation criteria for page, flow, app, dashboard work.

Skill

web-design-router

Web archetype classifier — routes to defaults for dashboards, research tools, workbenches, AI chat, checkout, content, admin screens.

Skill

data-visualization

Chart-worthiness gates, chart routing, hierarchy, attribution, accessibility, validation for charts, graphs, dashboards, KPIs, metrics.

Skill

design-guidance

Pre-build design direction, Calm Precision rules, component pattern selection, active design tokens, validation handoff.

Skill

component-patterns

Reusable component blueprints for cards, nav, forms, dashboards, modals, tables, lists with Calm Precision principles.

Skill

design-system

Design token extraction, validation, design system compliance, initialize tokens, view principles, add rules.

Skill

design-implementation

Building UI from user descriptions, implementing designs, matching mockups, aligning frontend edits with intent.

Skill

design-validation

Post-build verification pass — audit UI, validate build, check accessibility, find regressions, compare before/after.

Skill

design-reference

Capturing and comparing design references, screenshotting pages, saving design inspiration from URLs.

Skill

iterative-refinement

IBR refinement loop orchestration — scan → fix → re-scan iteration management until validation passes.

Skill

cli-reference

CLI command reference for inspecting, testing, validating UI via IBR for web and native platforms.

Skill

interactive-testing

Form testing, click-through flows, search/login interaction testing, browser interaction assertions.

Skill

native-testing

iOS, watchOS, macOS native app scanning — touch targets, a11y labels, watchOS constraints, Fix Guide.

Skill

auto-verify

Configuring/troubleshooting IBR automatic before/after scan workflow on UI file edits.

Skill

ui-brainstorm-preamble

Pre-build UI brainstorming — explore directions before implementing, capture platform/scope/archetype context.

Skill

ui-guidance-library

Reusable UI guidance patterns and decision aids, indexes central and project-local templates.

Skill

mockup-gallery-bridge

Bridge between mockup gallery reviews and IBR scan verification, enforces approved target roles.

Skill

mobile-web-ui

Mobile web UI patterns — responsive design, touch targets, viewport handling, Material 3, iOS Safari, WCAG 2.2.

Skill

ios-design

iOS HIG rules — SwiftUI conventions, safe areas, SF Symbols, Dynamic Type, haptics, Liquid Glass.

Skill

ios-design-router

iOS archetype classifier — routes to defaults for 6 iOS app archetypes (Utility, Content, Productivity, Consumer, Editorial, Tool/Pro).

Skill

apple-platform

How to build for iOS, watchOS, macOS — architecture patterns, SwiftData, Swift concurrency, CI/CD, TestFlight.

Skill

macos-ui

macOS UI patterns — AppKit/SwiftUI, menu bar, window chrome, materials, Liquid Glass, notarization.

Skill

design

Use when planning or implementing UI with IBR in Codex — compact routing skill.

Skill

native

Use when validating iOS, watchOS, macOS UI with IBR in Codex.

Skill

ui-ux-guidance

Compact IBR guidance for Codex — Calm Precision, web archetypes, interaction states, mobile, content states, data viz, Mockup Gallery.

Skill

validate

Use when scanning, auditing, comparing, or testing UI with IBR in Codex.

UI Guidance 22 skills Claude Code

Skill

audio

Design auditory UI channels — sounds, tones, notifications, sonic logos, voice confirmation, spatial audio.

Skill

audit-against-patterns

Audit URLs or images against tagged pattern library, preferences profile, and ui-guidance principles.

Skill

components

Design interactive components — buttons, icons, forms; cross-platform craft with state coverage.

Skill

data-viz

Design charts, graphs, tables, KPIs, sparklines; decision-first titling and ruthless decluttering.

Skill

designer

Build design systems by revealed preference — users pick live previews; emits platform-aware DESIGN.md (web/iOS/macOS).

Skill

evidence-policy

Route design evidence gathering between source-of-truth reads (tokens, palette) and live IBR capture.

Skill

feedback

Design action feedback — confirmations, button states, loading, success/error states, haptics, sounds.

Skill

hierarchy

Design visual hierarchy — title/description/metadata ladders, typography scale, emphasis through size-weight-contrast.

Skill

ingest-image

Extract design patterns from screenshots or mockups; maps observable structure to ingestion schema.

Skill

ingest-url

Extract design patterns from live URLs via IBR scan; maps DOM and computed styles to ingestion schema.

Skill

ios

Build or audit SwiftUI/UIKit iOS apps — HIG essentials, archetype classifier, SF Symbols, Dynamic Type, haptics.

Skill

macos

Build or audit native macOS apps — window chrome, toolbars, menu bar, NSVisualEffectView, keyboard-first interaction.

Skill

motion

Design animations and transitions — page transitions, hover, sequenced reveal, shimmer, micro-interactions.

Skill

navigation

Design navigation and wayfinding — tab bars, drawers, breadcrumbs, back stacks, sheets, modals.

Skill

preferences

Set defaults for token shape (typography, color, touch targets, motion, error UX) or audit against owned-app reference set.

Skill

principles

Cross-medium design doctrine applicable to UI, writing, images, docs, data — principle stable across primitives.

Skill

refine-preferences

Discover or narrow design preferences via mockup-gallery A/B rounds; aggregates picks into preferences profile.

Skill

responsive

Design for multiple form factors and screen sizes — breakpoints, container queries, density per surface.

Skill

states

Design element state lifecycles — rest, hover, focus, active, selected, disabled, loading; empty/error/success screens.

Skill

tag-pattern

Mark captures as liked or disliked patterns; cross-references ui-guidance principles followed or violated.

Skill

tokens

Design token systems shipping across form factors without forking — scale, ladder, condition, composite patterns.

Skill

web

Build or audit web apps — semantic HTML, ARIA, responsive design, forms, WCAG 2.1 AA, keyboard navigation.

market-research-platform 14 skills · 2 in-app Claude CodeCodexApp-embedded

Skill

pptx-builder

Build production PowerPoint decks with a python-pptx primary engine plus an opt-in pptxgenjs polish layer for the cover slide; runs a QA gauntlet (typography, WCAG contrast, drift check) before delivery.

Skill

diagram-mapper

Turn a codebase, workflow, process doc, or org/financial model into an audit-grade diagram (Mermaid, SVG, HTML, or PPTX).

Skill

pyramid-principle-core

Canonical rule library for Barbara Minto's Pyramid Principle (SCQA, MECE, governing thought) that sibling skills depend on.

Skill

pyramid-audit

Audit existing writing against Pyramid Principle / MECE / SCQA rules and produce ranked, cited findings.

Skill

pyramid-short-form

Draft answer-first short-form prose (email, memo, exec summary) under ~500 words using SCQA and a governing-thought key line.

Skill

pyramid-long-form

Produce a full nested key-line outline for long-form reports, briefs, and whitepapers before prose drafting.

Skill

pyramid-presentation

Produce a slide-by-slide ghost-deck storyline (assertion headlines and key lines) for presentations before slide design.

Skill

docx-builder

Build Word (.docx) files to a locked typography spec (Arial 12pt, heading hierarchy, TOC, callouts) via python-docx mechanics.

Skill

tyrone-conversational-voice

Draft or rewrite short-form conversational writing (Slack, casual email, chat) in the author's voice.

Skill

tyrone-writing-core

Canonical style library (punch-then-unpack, em-dash hygiene, register map) underlying the other voice skills.

Skill

deck-structure

Owns argument/structure logic for PowerPoint decks (Minto Pyramid, MECE, Duarte, action titles) prior to handoff to pptx-builder.

Skill

tyrone-writing-reviewer

Pre-ship logic/fact review that flags contradictions, unsupported claims, and overreach with severity-tagged findings.

Skill

linkedin-writing-style

Draft or rewrite LinkedIn posts/comments in the author's analytical, research-backed voice.

Skill

doc-structure

Section architecture and narrative logic for Word documents and long-form reports, prior to handoff to docx-builder.

Skill

tyrone-professional-writing

Draft or rewrite structured professional writing (cover letters, memos, executive emails) in the author's voice.

In-app

Research pipeline (research.py) · custom

3-node linear pipeline (retrieve_semantic -> retrieve_graph -> analyze) that does semantic + graph retrieval then a single LLM call to produce a cited analysis (general/SWOT/competitive/trend). Self-labeled 'Research agent' with an 'A-RAG adaptive' docstring, but the implementation is a fixed linear sequence, not dynamic tool-choice.

In-app

Reporting pipeline (reporting.py) · custom

3-node pipeline (gather_findings -> extract_brief -> generate_outputs) that runs an LLM map-reduce extraction into a StrategyBrief, then fans out to format generators (pptx/docx/email/notes/outline).

stratagem 13 in-app App-embedded

In-app

stratagem-control-agent · claude-agent-sdk

Top-level orchestrator (Opus): plans via research-planner, delegates to specialist subagents, tracks progress, enforces Pyramid Principle output and artifact verification

In-app

research-planner · claude-agent-sdk

Decomposes research questions into structured task plans before data gathering

In-app

data-extractor · claude-agent-sdk

Extracts and verifies data from PDFs, websites, spreadsheets, and presentations

In-app

financial-analyst · claude-agent-sdk

Analyzes SEC filings, earnings reports, and financial statements; runs Python calculations

In-app

research-synthesizer · claude-agent-sdk

Synthesizes multi-source extracted data into MECE/Pyramid-structured narratives with citations

In-app

executive-synthesizer · claude-agent-sdk

Produces executive-ready SCQA briefs from synthesis output

In-app

flowchart-architect · claude-agent-sdk

Designs process architectures and flowchart visualizations as PowerPoint decks

In-app

design-agent · claude-agent-sdk

Designs visual layout/structure for deliverables applying Calm Precision principles

In-app

prompt-optimizer · claude-agent-sdk

Refines weak/ambiguous research prompts for accuracy and specificity

In-app

plan-validator · claude-agent-sdk

Monitors execution for scope/quality/source/goal drift against the research plan

In-app

source-verifier · claude-agent-sdk

Verifies synthesized claims against cited sources; flags unsupported claims and stale data

In-app

report-critic · claude-agent-sdk

Scores report quality (completeness, accuracy, structure, depth, actionability) against a 4.0/5.0 threshold

In-app

after-action-analyst · claude-agent-sdk

Post-run debrief that diagnoses execution quality and records lessons/memory updates for future runs

claude-code-debugger 7 agents · 3 skills Claude CodeCodex

Agent

api-assessor

Domain assessor for API/REST/GraphQL, auth, and route-handler debugging symptoms.

Agent

assessment-orchestrator

Coordinates parallel domain-specific assessors (db/frontend/api/performance) when the debugging symptom spans multiple domains.

Agent

database-assessor

Domain assessor for database/query/migration/Prisma/PostgreSQL debugging symptoms.

Agent

fix-critique

Pressure-tests a proposed fix before it's declared resolved (root cause vs symptom, regressions, evidence).

Agent

frontend-assessor

Domain assessor for React/hooks/rendering/state/hydration debugging symptoms.

Agent

performance-assessor

Domain assessor for latency/memory/CPU/bottleneck debugging symptoms.

Agent

root-cause-investigator

Builds a causal tree (not a linear 5-whys chain) to find true root causes across multiple branches.

Skill

debug-loop

7-phase iterative root-cause debugging loop (investigate, hypothesize, fix, verify, score, critique, report), up to 5 iterations.

Skill

debugging-memory

Memory-first debugging workflow that checks past incidents via the debugger MCP search tool before investigating a bug.

Skill

logging-tracer

Generates stack-appropriate structured logging/tracing code (optionally OpenTelemetry) with a zero-dependency-first tiered approach.

nimbalyst 1 agents · 7 skills · 2 in-app Claude CodeApp-embedded

Agent

e2e-runner

Runs Playwright/E2E tests inside a Docker devcontainer for isolated, reproducible execution; manages the full container lifecycle (build, run, cleanup). Uses haiku model.

Skill

extension-development

Build, install, and hot-reload Nimbalyst extensions using in-app MCP tools.

Skill

feedback-intake

Entry point for Nimbalyst's in-app bug-report/feature-request flow; gathers evidence, anonymizes, and posts to GitHub Issues.

Skill

git-commit

Drives Nimbalyst's interactive 'Commit with AI' widget; explicitly NOT for generic commit requests.

Skill

datamodellm

Create visual Prisma-format database schema / ER diagrams using Nimbalyst's DataModelLM editor.

Skill

mockuplm

Create .mockup.html UX mockups/wireframes for visual planning inside Nimbalyst.

Skill

planning

Create/track structured plan documents and work items (bugs/tasks/ideas) via YAML frontmatter in nimbalyst-local/plans/.

Skill

excalidraw

Create flowcharts/architecture diagrams/sketches as .excalidraw files; includes an explicit 'stop after one pass, don't thrash' rule.

In-app

ClaudeCodeProvider + TeammateManager (AI Teammates) · claude-agent-sdk

Lead-agent session driver built on claude-agent-sdk's query(); TeammateManager (2039 lines) spawns, messages, and manages the lifecycle (active/idle/resume) of sub-agent 'teammates' the lead agent creates at runtime.

In-app

nimbalyst-meta-agent (MetaAgentService) · mcp

MCP server + service letting a running agent session create/spawn/inspect/prompt sibling or child agent sessions across providers (Claude, Codex) and worktrees (tools: create_session, spawn_session, get_session_status/result, send_prompt, respond_to_prompt, list_spawned_sessions).

Travel Planner 9 agents Claude Code

Agent

backend-api-engineer

Implement, modify, or update server-side logic, API endpoints, database schemas, and business logic

Agent

code-review-security

Perform comprehensive security analysis on code changes, static code scanning, and dependency checks

Agent

devops-deployment-agent

Handle CI/CD pipelines, container builds, and cloud deployments with rollback capabilities

Agent

documentation-maintainer

Maintain, consolidate, and update documentation across repositories

Agent

front-end-engineer

Build, modify, and maintain React/TypeScript UI components with styling and state management

Agent

product-design-auditor

Champion usability for React/TypeScript web apps through heuristic audits and Calm Precision design

Agent

system-architect

Design and review overall architecture, API contracts, and data models for React/TypeScript applications

Agent

task-orchestrator

Coordinate multiple agents for complex tasks and ensure proper delegation across domains

Agent

testing-qa-automation

Run automated test suites including Jest, Playwright, and Lighthouse for quality assurance

NavGator 4 agents · 5 skills Claude CodeCodex

Agent

architecture-advisor

Specialized agent for architecture decisions, dependency management, and impact analysis. Use when planning significant changes to the tech stack.

Agent

architecture-investigator

Autonomous subagent for investigating architecture issues using NavGator MCP tools. Follows SRE-style read-only investigation before proposing changes.

Agent

architecture-planner

Reads NavGator's stored graph, decides whether data is stale, runs incremental scan if needed, then dispatches read-only analysis tools and aggregates structured reports.

Agent

external-resolver

Resolves external boundary nodes (npm/pip/spm/cargo/go/service/llm/infra) against upstream registries with canonical identity, version, and freshness verdict.

Skill

code-review

Architectural integrity review of system flow and connections using impact analysis, data flow tracing, and lessons tracking.

Skill

architecture-scan

Scan project architecture, check health, and monitor staleness using NavGator scan and status tools.

Skill

architecture-export

Generate architecture diagrams and export architecture summaries using mermaid diagrams and CLI exports.

Skill

navgator-setup

Install the plugin, update to latest version, or launch the web dashboard and UI maintenance.

Skill

impact-analysis

Analyze what's affected by changes and map component connections using impact analysis, connection viewing, and dataflow tracing.

market-research 9 agents Claude Code

Agent

backend-api-engineer

Implement/modify server-side logic, API endpoints, DB schemas, and business logic for the React/TS app; proactive when new data entities or API contracts change.

Agent

code-review-security

Security gate run after tests pass, before deploy: static analysis, dependency vulnerability scanning, blocks merges on critical issues.

Agent

devops-deployment-agent

Owns CI/CD, Docker builds, and cloud deployments (Vercel/Fly.io/AWS), including rollback procedures; runs proactively after code-review approval.

Agent

documentation-maintainer

Maintains/consolidates repo docs, prevents doc sprawl, merges LESSONS.md growth into docs, updates changelogs/API guides.

Agent

front-end-engineer

Builds/maintains React/TypeScript UI components, Tailwind styling, Calm Precision design tokens, Storybook docs, accessibility.

Agent

product-design-auditor

Heuristic UX audits, accessibility/WCAG review, proposes Calm Precision token tweaks, drafts BDD test scenarios.

Agent

system-architect

Owns end-to-end architecture and cross-agent coordination: API contracts, data models, tech-stack decisions across frontend/backend/infra.

Agent

testing-qa-automation

Runs Jest/Playwright/Lighthouse automated test suites proactively after merges or new test scaffolds, fails fast on regressions.

Agent

project-orchestrator

Coordinates substantial multi-system changes by engaging system-architect and other domain agents for planning and sequencing.

pptx-builder 5 agents · 2 skills Claude Code

Agent

persona-strategist

Owns the deck's argument before any layout decision. Turns a fuzzy ask into a Pyramid-shaped ghost deck with governing thought, key line, and action-titled outline with evidence placeholders.

Agent

persona-critic

Owns visual + structural quality. Audits the Builder's .pptx against hard-constraints, composition principles, and house-style instances. Returns findings with routing verdicts.

Agent

persona-researcher

Runs after Strategist and before storyline writing. Gathers and classifies information into a tiered evidence base (proven/directional/gap) that grounds downstream claims.

Agent

persona-builder

Owns structure.json → .pptx rendering. Reads Designer's layout decisions, calls python-pptx helpers, runs full QA gauntlet before signaling done.

Agent

persona-designer

Owns the visual system. Reads Strategist's ghost deck and auto-layout pipeline. Produces per-slide LayoutDecision objects the Builder consumes. Never calls python-pptx directly.

Skill

pptx-builder

Build PowerPoint (.pptx) decks with the calm-precision-pptx hybrid model. python-pptx is the primary structural engine; pptxgenjs polish layer optional for cover slides.

Skill

diagram-mapper

Turn any source artifact into a clean, audit-grade diagram with explicit node types, decision branches, and exception paths. Renders to Mermaid, SVG, HTML/interactive, or PowerPoint.

local-smartz 7 in-app App-embedded

In-app

Local Smartz Agent · langgraph

Main production research agent — DeepAgents single-agent flow with built-in write_todos planning, filesystem tools, and task-tool subagent delegation (planner/researcher/analyzer/writer/fact_checker/reviewer/reflector/worker role prompts)

In-app

Orchestrator pattern · custom

Lead agent decomposes task into worker subtasks, workers execute independently and register artifacts, lead synthesizes from artifact summaries only (telephone-game mitigation, one-level-only enforced)

In-app

Critic Loop pattern · custom

Evaluator-optimizer loop — writer agent generates content, critic agent scores/verdicts, writer revises until score threshold or max iterations

In-app

Reflection pattern · custom

Reflexion-style loop — same-model agent generates an answer, then self-critiques against a rubric and revises (bounded iterations, stagnation guard)

In-app

Parallel/Ensemble pattern · custom

Runs N sampled agent turns at varied temperature concurrently, then a synthesizer agent merges them into one answer (ensemble voting / hallucination catching)

In-app

Router pattern · custom

Classifier turn labels the query, then dispatches to one specialist agent turn scoped to that label

In-app

Chain pattern · custom

Linear staged agent pipeline (gather -> analyze -> write), each stage consuming the prior stage's output, optionally pinned to different models per stage

spectra 1 agents · 5 skills Claude CodeCodex

Agent

marketing-planner

Plans (and optionally produces) product marketing video for any software product; dispatches for launch strategies, audience-tailored content plans, and production.

Skill

demo

Transforms raw screen recordings into polished demo clips with spotlight focus, captions, and optional speed adjustment.

Skill

video-design

Tunes video craft — pacing, color, typography, sound, and emotion arc — and translates design choices into Spectra render parameters.

Skill

content-capture

Connects to and captures screenshots/video from running web, macOS, and iOS/watchOS simulator applications for marketing and documentation.

Skill

product-marketing

Plans product marketing content tailored to product type, audience, and funnel stage; routes to Spectra capture/polish tools for production.

Skill

host-walkthrough

LLM-driven sequential navigation through UI flows with capture; used by host agents (Claude Code, Codex) to plan from snapshots and drive Spectra execution.

agent-builder 1 skills · 4 in-app Claude CodeCodexApp-embedded

Skill

agent-builder

Design, evaluate, or rebuild agentic harnesses — tool use, permissions, workflow state, memory, evals, observability, multi-agent, framework selection. Activates on harness-gap symptoms too.

In-app

Chief of Staff Agent · custom

Multi-node orchestrator for schedule optimization, priority triage, time-block planning, decision logging, and follow-up tracking with role-based task assignment.

In-app

Local Knowledge Agent · custom

Document analysis and knowledge management agent with embeddings, retrieval, and local LLM summarization for meetings, documents, and research materials.

In-app

Plan Runner Agent · custom

Planning agent that constructs structured outlines, generates plan sections, and builds implementation roadmaps using local LLMs.

In-app

Meeting Transcript Agent · custom

Meeting analysis agent that extracts actions, decisions, and notes from meeting transcripts with semantic classification.

pyramid-principle 5 skills Claude CodeCodex

Skill

pyramid-principle-core

Canonical rule library for Pyramid Principle: SCQA, MECE, deductive/inductive logic, vertical/horizontal relationships, governing thought structure.

Skill

pyramid-short-form

Drafts answer-first short-form writing (emails, memos, exec summaries, one-pagers, BLUF notes) using Pyramid Principle structure.

Skill

pyramid-long-form

Structures long-form professional documents (reports, briefs, proposals, whitepapers) with full nested pyramid outlines before prose.

Skill

pyramid-presentation

Builds presentation storylines and slide decks with assertion-based headlines and logical flow following Pyramid Principle.

Skill

pyramid-audit

Audits existing writing against Pyramid Principle rules; produces structured violation findings and ranked fixes with source citations.

TruePace 4 in-app App-embedded

In-app

AIInsightService · custom

Orchestrates LLM requests between on-device (Apple Foundation Models) and cloud (Groq) providers with routing, caching, and state management for AI features

In-app

PlanCoachService · custom

Orchestrates plan critique generation by combining deterministic PlanCoachEngine logic with LLM inference and safety validation

In-app

WeeklyReviewService · custom

Orchestrates weekly review generation via AIInsightService with weekly caching, permission checks, and prompt construction

In-app

TaskDecomposer · custom

Routes task decomposition between Apple on-device Foundation Models and cloud providers for parsing raw text into structured tasks

replit-migrate 1 agents · 3 skills Claude CodeCodex

Agent

migration-analyst

Deep investigation agent that reads all source files in a Replit project, maps the full dependency graph, identifies hidden Replit dependencies, and produces a comprehensive migration readiness report.

Skill

migration-scan

Use to "scan this Replit app", "analyze for migration", "check Replit dependencies", or assess migration readiness; auto-triggers when a .replit file is present.

Skill

migrate-web

Use when the user asks to "migrate to web", "deploy to Vercel", "move to Cloudflare", "get off Replit", or convert a Replit app to standalone/self-hosted production.

Skill

migrate-ios

Use when the user asks to "make this an iOS app", "convert to native", "port to Swift", "SwiftUI version", or migrate a Replit app to native iOS/macOS.

agent-doe-engine 2 agents · 1 skills Claude CodeCodex

Agent

optimize-runner

Executes the agent-doe-engine autoresearch loop: generates one hypothesis per iteration, makes an atomic scoped change, measures every objective, and keeps/reverts by aggregate score until convergence or budget exhaustion.

Agent

overfitting-reviewer

Read-only adversarial reviewer that checks kept optimization commits for Goodhart violations, metric-gaming, safety removal, scope violations, and unvalidated-metric winners.

Skill

agent-doe-engine

Orchestrates DOE (Design of Experiments) and autoresearch optimization: plans/validates factors, sets multi-objective scoring (scalarize/desirability/pareto), runs the trial matrix or single-factor loop, and reviews results for overfitting.

api-registry 3 skills Claude Code

Skill

api-registry

Use BEFORE writing API config, env setup, auth, LLM client init, or debugging external services. Returns authoritative source URLs, latest version, and deprecation warnings. Triggers on any named library/API/tool.

Skill

build-loop-bridge

Contract for external plugins (build-loop, debugger, research) to consult api-registry during API setup or debug. Reads ~/.api-registry/registry.db; exits silently if absent.

Skill

docs-search

Use when the user asks how a library/API/tool works, requests docs for a named service, or asks about current models/versions/endpoints. Routes via api-registry to Context7 or WebFetch.

AI User Personas 2 agents · 1 skills Claude CodeCodex

Agent

persona-perspective-reviewer

Reviews a product, UI, workflow, or feature from one assigned persona perspective with evidence and measurement.

Agent

persona-panel-orchestrator

Selects task-specific persona perspectives, checks execution readiness, defines measurements, launches review passes, and synthesizes findings.

Skill

persona-lab

Turn user requests into focused persona panels for reviewing products, UIs, workflows, and features with task-specific perspectives, measurement, and evidence-based findings.

agent-rally-point 3 skills Claude CodeCodex

Skill

agent-rally-point

Use when working in a repository that uses Rally/Agent Rally Point for cross-agent coordination, especially at session start, before editing files, when deciding what to do next, handing work to another agent, recording facts/artifacts/decisions, resolving blockers, or coordinating with other coding agents through the `rally` CLI.

Skill

mini-loop

Ultra-light per-task quality loop for Rally Flow / dynamic-workflows fan-out. Wrap each fanned-out task in assess → plan → execute → mini-judge to raise accuracy, with zero tools or scripts.

Skill

rally-workflows

Use when fanning out work across multiple agents, running a dynamic workflow, coordinating parallel subagents, or splitting a workstream across hosts, terminals, or machines through Agent Rally Point.

atomize-ai 1 agents · 1 skills · 1 in-app Claude CodeApp-embedded

Agent

frontend-ui-engineer

Modifies/creates/fixes UI (HTML/CSS/frontend-backend integration), ensures data rendering and design-system consistency; invoked by the Chief orchestrator or directly for UI work.

Skill

db-investigation

Read-only Supabase/Postgres investigation for atomize-ai — pipeline freshness sweep and coverage-gap analysis (freshness.py, coverage_gap.py) used to diagnose stale/empty data before fixing.

In-app

Search Orchestrator V2 (runSearch) · custom

Multi-step intelligent-search pipeline — classifyIntent → planRetrieval → fan-out retrieval (vector+KG) → rerank → quality-guard (reformulate-and-retry on low relevance) → synthesize → numeric verification → normalize; feature-flagged, falls back to legacy search handler.

decision-doctor-cc 3 in-app App-embedded

In-app

Aida Chat Orchestrator · custom

Conversational multi-step router: detects specialty/intent, generates and adapts intake surveys, classifies via Stage 0, then dispatches to the Decision Engine or Recommendation Engine and streams the Groq completion back to the user

In-app

Decision Engine (runDecision, V1) · custom

Deterministic multi-stage decision pipeline (weights/AHP -> constraints -> weighting -> outranking -> ranking -> feasibility -> scaffold), with LLM calls at Stages 1, 5, 6 via Groq

In-app

Recommendation Engine / pain-to-AI recommender (runRecommendation, V2) · custom

Chains pain-path classification -> library retrieval -> candidate task generation -> 9-criteria scoring -> promotion (or the newer runWorkflowV2 decompose/score/lynchpin/horizon pipeline) to produce an AiTaskRecommendation

bookmark 1 agents · 1 skills Claude CodeCodex

Agent

snapshot-analyst

Agent for deep analysis of session context and snapshot comparison. Use for detailed context review, decision timelines, and coverage analysis.

Skill

context-continuity

Restores session via bookmark snapshots when user asks to continue from last session, restore context, or pick up where they left off.

atomize 1 agents · 1 in-app Claude CodeApp-embedded

Agent

frontend-ui-engineer

Frontend UI specialist handling HTML/CSS/TypeScript UI implementation, responsive design, accessibility, and frontend-backend integration.

In-app

SourceOrchestrator · custom

Multi-strategy content scraping orchestrator supporting RSS discovery, sitemap parsing, HTML scraping with fallback logic.

loop-builder 1 agents · 1 skills Claude CodeCodex

Agent

loop-builder

Turns a described workflow into a generated loop pack (loop.yaml, rubric.md, report template, validators, README.md).

Skill

loop-builder

Generates declarative focused-loop specs, presets, validators, and skill-chain plans for any host coding agent.

ProductPilot 1 in-app App-embedded

In-app

Adaptive Intake Controller · custom

Multi-step orchestrator that runs guided-conversation intake for a product spec: routes to a method (jtbd/qfd/pugh/agent) via a Haiku sub-call, scores unknowns 0-15 to decide ask-vs-infer, generates the next question or safe-default assumptions, then finalizes into a rendered spec draft.

atomize-news 1 agents Claude Code

Agent

frontend-ui-engineer

Handles UI/UX implementation (HTML/CSS/JS/TS), responsive layouts, accessibility, and frontend-backend data integration.

docx-builder 1 skills Claude Code

Skill

docx-builder

Build Word (.docx) files with locked typography, heading hierarchy (16/14/12pt), 1" margins, table header shading, callout boxes, auto-TOC. Pair with doc-structure for section planning.

blog-content-scraper 1 skills Claude CodeCodex

Skill

web-scraper

Extract blog and news content from any website via RSS, sitemap, or HTML scraping with LLM-ready output.

agent-builder-studio 1 skills Claude CodeCodex

Skill

agent-builder

Design, evaluate, or rebuild agentic harnesses — tool use, permissions, workflow state, memory, evals, observability, multi-agent, framework selection.

pptx-style-builder 1 skills Claude Code

Skill

pptx-style-builder

Capture PowerPoint .pptx visual style (theme colors, fonts, slide size, masters, layouts) into JSON spec, then generate new .pptx from Markdown/JSON content reusing that style; includes native chart support via R or python-pptx fallback.

prompt-builder 1 skills Claude CodeCodex

Skill

prompt-builder

Classify, diagnose, rewrite, and score prompts calibrated to model tier and deployment context.

research-plugin 1 skills Claude CodeCodex

Skill

research

Structured research methodology (5-phase general research, collection, synthesis, quantitative/database analysis, active-project ingestion) producing cited, verified findings persisted to ~/dev/research/.

mockup-gallery 1 skills Claude CodeCodex

Skill

mockup-review

Create scratch-first HTML mockups, rate mockups in gallery, check selections, align mockups to routes/screens, and implement approved designs from the mockup gallery.

threat-modeler 1 skills Claude CodeCodex

Skill

threat-modeler

Workshop assistant for producing threat-model artifacts — walks STRIDE decomposition, asset/actor enumeration, data-flow description, OWASP LLM/Agentic/ATLAS/NIST cross-mapping, mitigations, residual risk, decision log for agentic systems and risk-surface changes.

Generated from a cross-repo assessment; catalog stored in build-loop-memory and refreshed periodically.