When to Use a Claude Skill vs a Claude Sub-Agent

You're staring at your .claude/ directory wondering: should this be a skill or a sub-agent? You've read the docs. You still aren't sure.

Here's the thing: skills and sub-agents sound similar but work completely differently. Pick the wrong one and you'll end up with context bloat or tasks that never finish. Pick the right one and Claude becomes genuinely better at your specific workflows.

This guide shows you exactly when to use each, with real examples and a decision framework you can actually use.

The Core Distinction

Skills: Prompt Templates That Inject Instructions

Skills are specialized prompt templates that inject domain-specific instructions into Claude's current context when relevant.

Think of them like this: you're working on a React component. Claude detects "React" in your files and loads a skill with patterns like "use explicit props over spreading" or "prefer function components." Those instructions guide Claude's responses for that specific task.

Key characteristics:

Model-invoked: Claude decides when to load them based on context
Lightweight: Start at ~few dozen tokens
Progressive disclosure: Load only relevant information as needed
No direct execution: They guide responses, not run code
Personal or project-scoped: ~/.claude/skills/ or .claude/skills/

What they contain:

Specialized instructions
Design patterns
Coding conventions
Templates and examples
Domain-specific knowledge
Documentation references

Sub-Agents: Isolated AI Assistants for Complex Tasks

Sub-agents are specialized AI assistants that run in separate context windows with their own system prompts and tool access.

Think of them like this: you ask Claude to review your entire codebase for security issues. That's not a quick task—it needs multiple steps, careful analysis, and its own context. A sub-agent spins up, does the work, and reports back with findings.

Key characteristics:

User-invoked or automatic: Ask Claude to "use the security-reviewer agent" or Claude invokes them automatically when appropriate
Full context: Complete context window for complex work
Tool access: Can read files, run commands, execute code
Autonomous: Complete multi-step tasks independently
Maintains expertise: Specialized system prompts for specific domains

What they do:

Multi-step code reviews
Architecture analysis
Complex refactoring across multiple files
Documentation generation
Test suite creation
Specialized problem-solving

When to Use Skills

Scenario 1: Reusable Coding Patterns

You have a house style for TypeScript that you want applied consistently: named exports only, no default exports, explicit return types.

Use a skill:

---
name: typescript-conventions
description: TypeScript coding standards for our codebase
---

# TypeScript Conventions

## Exports
- Use named exports only
- Never use default exports
- Group related exports at end of file

## Type Annotations
- Explicit return types on all functions
- No implicit any
- Prefer interfaces over types for objects

## Code Organization
- One component per file
- Co-locate tests with implementation
- Use barrel exports for directories

Save this as .claude/skills/typescript-conventions/SKILL.md. Claude loads it when working on TypeScript files and applies these patterns automatically.

Why this works: The patterns are simple, contextual, and apply across many files. You don't need a separate context window—you just need Claude to remember your preferences.

Scenario 2: Framework-Specific Knowledge

You're migrating to Next.js App Router. There are dozens of gotchas: async components, client boundaries, route segments, metadata API.

Use a skill:

prpm install @nextjs/app-router

This skill contains:

Server component patterns
When to use 'use client'
Metadata API examples
Loading/error state conventions
Route group patterns

Claude loads this when it detects Next.js and App Router in your project. You get framework-specific guidance without constantly looking up docs.

Why this works: The knowledge is specific to Next.js App Router but applies to many files. Skills enable progressive disclosure—Claude loads route group patterns only when you're actually working on route organization.

Scenario 3: Domain-Specific Vocabulary

You're working on a fintech app with specific terminology: "settlements," "clearing," "merchant accounts," "payment rails."

Use a skill:

---
name: fintech-glossary
description: Domain terminology for payment processing
---

# Payment Processing Terms

## Settlement
The process of transferring funds from acquiring bank to merchant account.
- Batch settlement: Once per day
- Real-time settlement: Within minutes (premium)

## Clearing
Reconciliation between issuing and acquiring banks.
- Timeline: T+1 to T+3
- Handles chargebacks and disputes

## Merchant Account
Bank account that holds funds before settlement.
- Reserve requirements: 5-20% typical
- Rolling reserve: Held for 6 months

[...]

Now when you discuss payments, Claude uses correct terminology and understands domain constraints.

Why this works: Domain knowledge improves all conversations about this topic. It doesn't need isolated execution—it needs consistent context.

Scenario 4: Testing Standards

You want every PR to include specific test types: unit tests, integration tests, edge case coverage.

Use a skill:

---
name: testing-standards
description: Testing requirements for all code changes
---

# Testing Standards

## Required Coverage
- Unit tests for all public functions
- Integration tests for API endpoints
- Edge case tests for error handling
- No tests for trivial getters/setters

## Test Organization
- Co-locate tests with implementation
- Use descriptive test names (no "test1", "test2")
- One assertion per test when possible
- Mock external dependencies

## Edge Cases to Cover
- Null/undefined inputs
- Empty arrays/objects
- Boundary values (0, -1, max)
- Concurrent operations
- Network failures

Claude references this when writing or reviewing code, ensuring consistent test coverage.

Why this works: Testing standards apply to many files and many conversations. Skills make them available without repeating yourself.

When to Use Sub-Agents

Scenario 1: Comprehensive Code Review

You need a security audit of your authentication flow across multiple files.

Use a sub-agent:

Ask Claude: "Use the security-reviewer agent to audit my auth flow"

The sub-agent:

Reads all authentication-related files
Traces token flow from login to protected routes
Checks for common vulnerabilities (CSRF, XSS, SQL injection)
Tests edge cases (expired tokens, concurrent logins)
Reports findings with severity levels and file locations

Why this works: This requires reading dozens of files, maintaining context across the entire auth system, and applying specialized security knowledge. A sub-agent maintains its own context window and can spend hundreds of tokens analyzing each file.

You can't do this with a skill—skills don't execute tasks, they guide responses.

Scenario 2: Large-Scale Refactoring

You're renaming a core abstraction used in 50+ files with inconsistent patterns.

Use a sub-agent:

Ask Claude: "Use the refactoring-assistant agent to help rename this abstraction"

The sub-agent:

Searches codebase for all uses of the old name
Analyzes context to understand each usage pattern
Generates refactoring plan with risk assessment
Updates files incrementally with safety checks
Runs tests after each batch to catch breaks early

Why this works: This is a complex multi-step task requiring careful coordination and error handling. Sub-agents can maintain state across steps and adapt their approach based on results.

A skill would just tell Claude how to refactor. A sub-agent actually does the refactoring.

Scenario 3: Architecture Analysis

You need to document the architecture of an unfamiliar codebase you just inherited.

Use a sub-agent:

Ask Claude: "Use the architecture-documenter agent to analyze this codebase"

The sub-agent:

Scans directory structure for entry points
Traces dependencies to map data flow
Identifies design patterns and architectural decisions
Documents module responsibilities
Generates diagrams (Mermaid format)
Flags potential issues (tight coupling, circular deps)

Why this works: Understanding architecture requires reading the entire codebase systematically, maintaining a mental model across hundreds of files, and synthesizing high-level insights. That's exactly what sub-agents excel at.

Scenario 4: Specialized Problem Domains

You're debugging a complex distributed systems issue with race conditions.

Use a sub-agent:

Ask Claude: "Use the distributed-systems-expert agent to analyze this race condition"

The sub-agent has a specialized system prompt with deep knowledge of:

CAP theorem implications
Consistency models (eventual, strong, causal)
Common race conditions in distributed systems
Debugging techniques for non-deterministic bugs

It analyzes your logs, traces, and code to identify the specific race condition and suggest fixes.

Why this works: Distributed systems expertise requires deep specialized knowledge applied systematically across multiple files and logs. The sub-agent's isolated context lets it maintain the mental model needed for this complexity.

The Decision Framework

Use this flowchart:

Is this a complex multi-step task?
├─ YES → Does it need to read/write many files?
│         ├─ YES → Use sub-agent
│         └─ NO → Use sub-agent (context isolation still helpful)
└─ NO → Is this guidance that applies to many situations?
          ├─ YES → Use skill
          └─ NO → Is this specialized knowledge for specific contexts?
                  ├─ YES → Use skill
                  └─ NO → Maybe just ask Claude directly?

Or use this checklist:

Use a skill if:

It's guidance, not execution
It applies to many files/conversations
Context size is small (< 500 tokens)
Claude should decide when to load it
It's patterns, conventions, or domain knowledge

Use a sub-agent if:

It's a complex task with multiple steps
It needs to read/analyze many files
It requires specialized expertise start-to-finish
It needs its own context window
You want it isolated from main conversation

Real-world examples:

Scenario	Skills	Sub-Agent
API migration	✅ Migration patterns and gotchas	❌ Overkill for pattern guidance
Security audit	❌ Can't analyze entire codebase	✅ Needs systematic analysis
Code style	✅ Conventions applied per-file	❌ No multi-step work needed
Generate test suite	❌ Can't coordinate across files	✅ Needs context across all tests
Domain glossary	✅ Terminology used everywhere	❌ No execution required
Refactor architecture	❌ Can't coordinate changes	✅ Complex multi-file changes

Skills vs Sub-Agents: Technical Deep Dive

How Skills Work Under the Hood

Skills use Claude's tool-use capability but in a clever way. They're "meta-tools"—prompt templates that inject specialized instructions into Claude's context.

Simon Willison calls them "lightweight and practical." Lee Hanchung explains they "enable progressive disclosure of context, allowing agents to load only relevant information as needed."

The loading mechanism:

You start a task involving React components
Claude's model sees "React" in file paths and content
It autonomously invokes the get_skill tool: get_skill("react-patterns")
The skill content gets injected into current context
Claude now has React-specific guidance for this conversation

Token efficiency:

Initial injection: ~50-200 tokens (just metadata)
Full load: ~500-2000 tokens (if Claude needs details)
Progressive: More context loaded only if needed

No code execution: Skills cannot run scripts, read files, or execute commands. They provide instructions. Claude still does the work.

How Sub-Agents Work Under the Hood

Sub-agents run in completely isolated context windows with their own system prompts.

The isolation mechanism:

You ask: "Use the security-reviewer agent"
Claude spawns new context window
Loads sub-agent's custom system prompt
Sub-agent has full tool access (read files, run commands)
Maintains conversation state independently
Reports back to main context when done

Token budget:

Separate allocation from main conversation
Can use thousands of tokens without affecting your main work
Context preserved across sub-agent's entire task

Full autonomy: Sub-agents can execute complex workflows without constant user input. They make decisions, adapt based on results, and complete multi-step processes.

Why the Distinction Matters

Anthropic explains: "Skills feel closer to the spirit of LLMs"—they're about guiding the model with context, not programming task automation.

Sub-agents are for when guidance isn't enough. When you need actual execution, systematic analysis, or isolated expertise.

Getting this wrong:

Using sub-agent for code style → waste of context window
Using skill for code review → can't analyze multiple files
Using sub-agent for glossary → unnecessary complexity
Using skill for refactoring → no execution capability

PRPM: Managing Skills and Sub-Agents

PRPM packages work for both skills and sub-agents. The format is the same—only the subtype differs.

Installing Skills from PRPM

# Install a skill
prpm install @nextjs/app-router
prpm install @stripe/migration-guide
prpm install @testing/jest-patterns

These go to .claude/skills/ and Claude loads them automatically based on context.

Installing Sub-Agents from PRPM

# Install a sub-agent
prpm install @code-review/security-auditor
prpm install @refactoring/architecture-analyzer
prpm install @documentation/api-documenter

These go to .claude/agents/ and you invoke them by asking: "Use the security-auditor agent"

Publishing Your Own

Package structure is identical:

Skill:

my-skill/
├── SKILL.md           # Main content with metadata
└── examples/          # Optional supporting files

Sub-agent:

my-agent/
├── agent.md           # Agent definition with system prompt and metadata
└── tools/             # Optional tool configs

The registry handles both. Install command is the same. Only difference is where they land and how Claude uses them.

Why PRPM matters here:

Cross-editor support: Skills work in Claude Code today. Tomorrow you might use Cursor. PRPM converts formats automatically.
Versioning: Lock skills and sub-agents to specific versions. Prevent breaking changes.

Collections: Install related skills as bundles:

prpm install collection/nextjs-full-stack  # Gets app-router, api-routes, deployment skills

Discovery: Browse 6,000+ packages in the registry. Find skills for your framework, language, or domain.

The Mental Model

Think of it this way:

Skills = Reference cards in Claude's pocket

Claude pulls them out when relevant
They remind Claude of patterns and conventions
Lightweight, always available
Guide responses but don't execute tasks

Sub-agents = Specialists you call in

You explicitly request their help
They work independently with full context
Complete complex tasks autonomously
Report back with results

Together:

Skills make Claude better at routine work
Sub-agents handle specialized complex tasks
Both integrate seamlessly with PRPM

Quick Reference

Question	Skills	Sub-Agents
Who invokes it?	Claude (automatically)	You (conversationally) or Claude (automatically)
Can it execute code?	No	Yes
Context window?	Shares main context	Isolated context
Token efficiency?	Very efficient	Separate budget
Best for?	Patterns, conventions, knowledge	Complex multi-step tasks
How to install?	`prpm install @vendor/skill-name`	`prpm install @vendor/agent-name`
Where installed?	`.claude/skills/`	`.claude/agents/`
How to use?	Automatic when relevant	"Use the X agent" or automatic

Try It Yourself

Start simple:

Install a skill:

prpm install @auser/jest-patterns

Open a test file. Notice how Claude references testing patterns automatically.

Try a sub-agent:

Ask Claude: "Use the code-reviewer agent to review this module"

Watch it analyze multiple files and synthesize findings.

See the difference? Skills enhance Claude's general responses. Sub-agents complete specific complex tasks.

Next Steps

Browse the registry: https://prpm.dev
- Filter by subtype: "skill" or "agent"
- See what others have published

Install relevant packages:

prpm install @prpm/react-testing-patterns
prpm install @auser/security-auditor

Create your own:
- Package your team's conventions as skills
- Build specialized sub-agents for your domain
- Publish to share with others

The Bottom Line

Choose skills when you want Claude to know your patterns and apply them consistently across many contexts. Choose sub-agents when you need Claude to complete a complex task independently with its own context and tools.

Get it right and Claude becomes significantly more useful. Get it wrong and you'll fight with context limits or wonder why your "skill" doesn't actually do anything.

The good news: you can experiment freely. Install a package, try it, uninstall if it's not right. PRPM makes this trivial.

Now you know when to use each. Go build something.

Questions?

Open an issue if you're still not sure whether something should be a skill or sub-agent. We're here to help.