Skill Authoring Best Practices

Practical guidance for writing skills that work reliably and adapt gracefully. These patterns apply to agents, workflows, and utilities alike.

Core Principle: Informed Autonomy

Give the executing agent enough context to make good judgment calls — not just enough to follow steps. The test for every piece of content: “Would the agent make better decisions with this context?” If yes, keep it. If it is genuinely redundant, cut it.

Simple utilities need minimal context — input/output is self-explanatory. Interactive workflows need domain understanding, user perspective, and rationale for non-obvious choices. When in doubt, explain why — an agent that understands the mission improvises better than one following blind steps.

Freedom Levels

Match specificity to task fragility.

Freedom	When to Use	Example
High (text instructions)	Multiple valid approaches, context-dependent	”Analyze structure, check for issues, suggest improvements”
Medium (pseudocode/templates)	Preferred pattern exists, some variation OK	`def generate_report(data, format="markdown"):`
Low (exact scripts)	Fragile operations, consistency critical	`python scripts/migrate.py --verify --backup` (do not modify)

Analogy: Narrow bridge with cliffs = low freedom. Open field = high freedom.

Quality Dimensions

Six dimensions to keep in mind during the build phase. The quality scanners check these automatically during optimization.

Dimension	What It Means
Informed Autonomy	Overview establishes domain framing, theory of mind, and design rationale — enough for judgment calls
Intelligence Placement	Scripts handle plumbing (fetch, transform, validate). Prompts handle judgment (interpret, classify, decide). If a script contains an `if` that decides what content means, intelligence has leaked
Progressive Disclosure	SKILL.md stays focused; stage instructions go in `prompts/`, reference data in `resources/`
Description Format	Two parts: `[5-8 word summary]. [Use when user says 'X' or 'Y'.]` — default to conservative triggering
Path Construction	Never use `{skill-root}`. Only use `{project-root}` for `_bmad` paths. Config variables used directly — they already contain `{project-root}`
Token Efficiency	Remove genuine waste (repetition, defensive padding). Preserve context that enables judgment (domain framing, rationale)

Common Patterns

Soft Gate Elicitation

For guided workflows, use “anything else?” soft gates at natural transition points instead of hard menus.

Present what you've captured so far, then:
"Anything else you'd like to add, or shall we move on?"

Users almost always remember one more thing when given a graceful exit ramp rather than a hard stop. This consistently produces richer artifacts than rigid section-by-section questioning. Use at every natural transition in collaborative discovery workflows. Skip in autonomous/headless execution.

Intent-Before-Ingestion

Never scan artifacts or project context until you understand WHY the user is here. Without knowing intent, you cannot judge what is relevant in a 100-page document.

1. Greet and understand intent
2. Accept whatever inputs the user offers
3. Ask if they have additional context
4. ONLY THEN scan artifacts, scoped to relevance

Capture-Don’t-Interrupt

When users provide information beyond the current scope — dropping requirements during a product brief, mentioning platforms during vision discovery — capture it silently for later use rather than redirecting them.

Users in creative flow share their best insights unprompted. Interrupting to say “we’ll cover that later” kills momentum and may lose the insight entirely.

Dual-Output: Human Artifact + LLM Distillate

Any artifact-producing workflow can output two complementary documents: a polished human-facing artifact AND a token-conscious, structured distillate optimized for downstream LLM consumption.

Output	Purpose
Primary	Human-facing document — concise, well-structured
Distillate	Dense, structured summary for downstream LLM workflows — captures overflow, rejected ideas (so downstream does not re-propose them), detail bullets with enough context to stand alone

The distillate bridges the gap between what belongs in the human document and what downstream workflows need. Always offered to the user, never forced.

Three-Mode Architecture

Interactive workflows can offer three execution modes matching different user contexts.

Mode	Trigger	Behavior
Guided	Default	Section-by-section with soft gates; drafts from what it knows, questions what it doesn’t
YOLO	`--yolo` or “just draft it”	Ingests everything, drafts complete artifact upfront, then walks user through refinement
Headless (Autonomous)	`--headless` / `-H`	Headless; takes inputs, produces artifact, no interaction

Not every workflow needs all three — but considering them during design prevents painting yourself into a single interaction model.

Parallel Review Lenses

Before finalizing any significant artifact, fan out multiple reviewers with different perspectives.

Reviewer	Focus
Skeptic	What is missing? What assumptions are untested?
Opportunity Spotter	What adjacent value? What angles?
Contextual	LLM picks the best third lens for the domain (regulatory risk for healthtech, DX critic for devtools)

Graceful degradation: if subagents are unavailable, the main agent does a single critical self-review pass.

Graceful Degradation

Every subagent-dependent feature should have a fallback path. Skills run across different platforms, models, and configurations. A skill that hard-fails without subagents is fragile. A skill that gracefully falls back to sequential processing is robust everywhere.

Verifiable Intermediate Outputs

For complex tasks: plan, validate, execute, verify.

Analyze inputs
Create changes.json with planned updates
Validate with script before executing
Execute changes
Verify output

Catches errors early, is machine-verifiable, and makes planning reversible.

Writing Guidelines

Do	Avoid
Consistent terminology — one term per concept	Switching between “workflow” and “process” for the same thing
Third person in descriptions — “Processes files”	First person — “I help process files”
Descriptive file names — `form_validation_rules.md`	Sequence names — `doc2.md`
Forward slashes in all paths	Backslashes or platform-specific paths
One level deep for references — SKILL.md → resource.md	Nested references — SKILL.md → A.md → B.md
Table of contents for files over 100 lines	Long files without navigation

Anti-Patterns

Anti-Pattern	Fix
Too many options upfront	One default with escape hatch for edge cases
Deep reference nesting (A→B→C)	Keep references one level from SKILL.md
Inconsistent terminology	Choose one term per concept
Vague file names	Name by content, not sequence
Scripts that classify meaning via regex	Intelligence belongs in prompts, not scripts
Over-optimization that flattens personality	Preserve phrasing that captures the intended voice
Hard-failing when subagents are unavailable	Always include a sequential fallback path