Assessment Chapter 1 v1

AI Agent Anti-Patterns & Best Practices Workbook

From Ch. 1: Cognitive Architecture and Design Principles for AI Agents

The Agentic Enterprise Strategy · Excel Workbook

📋 What It Is

An 8-tab enterprise risk assessment workbook that catalogs 45 known failure modes across 14 categories of AI agent projects — and gives your team a structured, repeatable process to score them, mitigate them, and prove readiness at every phase gate from initial feasibility through post-production monitoring. This isn't a checklist you glance at once. It's an operational instrument with 257 live formulas that connect your risk scores to mitigation ownership, phase gate readiness, and an auto-generated executive Dashboard — so the assessment stays current as your project evolves.

Most agent initiatives don't fail because the team lacked talent. They fail because predictable anti-patterns — hallucination normalization, gold-plated reasoning, evaluation theater, context bloat from eager skill loading — show up late, after the prototype is praised and the budget is committed. This workbook forces early confrontation with those risks, when the cost of correction is still low.

Includes full lifecycle coverage with 6 phase gates (from Discovery/Feasibility through Post-Production Review) and deep treatment of the emerging Skills & Capability Architecture domain — including progressive skill loading, skill version control, and Agent Development Lifecycle (ADLC) integration patterns that reflect how enterprise agent platforms are actually being built in 2026.

👥 Who It's For

  • AI/ML engineering leads evaluating architectural risk before decisions become structural and irreversible
  • AI program managers overseeing agent initiative portfolios across multiple teams and use cases
  • Security, GRC, and compliance reviewers conducting go/no-go assessments before production deployments
  • AgentOps / SRE teams establishing operational baselines and monitoring thresholds for agent systems
  • Platform and infrastructure architects validating that agent designs don't create hidden infrastructure debt
  • Business stakeholders and sponsors who need a shared vocabulary for what can go wrong — and evidence that the team is managing it

Best used cross-functionally. Score as a team — disagreements about a score are where hidden risks surface.

When to Use It

The workbook maps to a 6-gate lifecycle. Use it at the gate that matches where your project stands:

  • Discovery/Feasibility — before committing to an agent approach. Validate the use case, assess technical feasibility, and identify skill gaps before architecture work begins
  • Architecture Review — when design patterns become structural and hard to reverse. Score early, while you can still change course
  • Scope Expansion — before adding new tools, workflows, models, or agent capabilities. Each expansion reintroduces risk
  • Pre-Production — the final quality gate before deployment. Surface unknown unknowns before they become production incidents
  • Go-Live — deployment authorization. Verify all Critical mitigations are complete, rollback plans are tested, and monitoring is operational
  • Post-Production Review — 30/60/90-day validation. Audit drift detection, cost actuals vs. projections, and make the scale/continue/retire decision with data
  • Post-incident retrospectives — map incidents back to the anti-pattern registry. Was this a known failure mode you scored low? Recalibrate
  • Quarterly health checks — agent systems drift. Scores change. New anti-patterns emerge. Re-assess periodically

📦 What It Produces

A completed assessment generates:

  • Executive Dashboard — auto-calculated risk summary with overall risk profile, top 5 risks by score (deduplicated), phase gate readiness across all 6 gates, and mitigation progress tracking across P1/P2/P3 priorities
  • Scored Risk Register — 45 anti-patterns scored by likelihood × impact, auto-classified into Critical (≥20), High (12–19), Medium (6–11), and Low (1–5) risk levels with conditional formatting
  • Prioritized Mitigation Plan — 122 pre-built mitigations with owner assignments (8 RACI roles via dropdown), due dates, verification methods, and status tracking
  • Phase Gate Readiness Report — 57 checkpoints across 6 gates with pass rates that auto-feed the Dashboard
  • Best Practice Roadmap — 38 implementation patterns across Basic, Intermediate, and Advanced maturity tiers with distribution chart
  • Shared Risk Vocabulary — a 27-term glossary covering agent-specific concepts (MCP, A2A, ADLC, Skill Composition, Progressive Loading) plus calibrated scoring scales

Functions as a repeatable risk discipline for agent engineering — not a one-time audit, but a living document that tracks your project's risk posture from feasibility through post-production.

🚀 How to Use It — Quickstart

  • Step 1. Open the Scoring Guide tab. Walk through the Likelihood Scale (1–5), Impact Scale (1–5), and Risk Matrix with your team before anyone starts scoring. This step prevents the single biggest source of scoring inconsistency.
  • Step 2. Switch to Risk Assessment. Fill in project metadata (Project, Agent, Assessed By, Date, Review Stage), then walk through all 45 anti-patterns as a team. For each: set Applicable (Yes/No/Partial), score Likelihood (1–5) and Impact (1–5). Risk Score and Risk Level auto-calculate.
  • Step 3. Return to the Dashboard tab. Your overall risk profile, top 5 risks, phase gate readiness across all 6 gates, and mitigation progress are auto-generated. This is your executive summary.
  • Step 4. Open Mitigation Actions. For your Critical and High risks, 122 pre-built mitigations are already mapped with verification methods. Assign owners using the 8-role dropdown, set due dates, and confirm P1/P2/P3 priority.
  • Step 5. Complete the Phase Gate Checklist for your current gate. Mark each checkpoint as Pass, Fail, Conditional, or N/A. The Dashboard's Phase Gate Readiness auto-updates: ≥80% = Ready, ≥50% = Conditional, <50% = Not Ready.
  • Step 6. Set a review cadence. Re-score after scope changes, incidents, or quarterly milestones. Use the Post-Production Review gate for 30/60/90-day validation cycles.

👁 Preview — What's Inside

8 Tabs, 257 Live Formulas

TabTypeWhat It Does
DashboardCOREAuto-generated risk summary, top 5 risks, phase gate readiness across 6 gates, mitigation progress
Scoring GuideREFLikelihood/impact scales, risk matrix, protocol layer reference, skill architecture reference, 27-term glossary
Anti-Pattern RegistryREFComplete catalog of all 45 anti-patterns with descriptions, root causes, warning signs, and real-world examples
Risk AssessmentCORE ★Primary input tab. Score all 45 anti-patterns by likelihood and impact with pre-populated default risk values
Mitigation ActionsCORE122 pre-built mitigations with 8-role RACI dropdown, verification methods, and status tracking
Phase Gate ChecklistCORE57 checkpoints across 6 gates with evidence requirements and readiness scoring
Best PracticesREF38 implementation patterns across Basic, Intermediate, and Advanced maturity tiers
Workshop GuideREFFacilitation guide for running team assessment sessions — agenda, timing, tips

45 Anti-Patterns Across 14 Categories

  • Over-Reliance on Prompting — Prompt-as-Architecture, Single-Model Dependency, Hallucination Normalization
  • Poor System Integration — API-as-Afterthought, Data Silo Blindness
  • Over-Engineering — Premature Multi-Agent Orchestration, Gold-Plated Reasoning, Framework Churn
  • Governance Neglect — Missing Human-in-the-Loop, Audit Trail Absence, Compliance Avoidance
  • AgentOps Absence — No Observability, No Cost Tracking, No Drift Detection
  • Tool Sprawl & Misuse — Uncurated Tool Inventory, No Tool Authentication, Missing Tool Verification
  • Memory & Context Mismanagement — Unbounded Context Windows, RAG Without Strategy, No Memory Architecture
  • Evaluation Theater — Demo-Driven Development, Vibes-Based Assessment, No Production Metrics, No Multi-Turn Evaluation
  • Scope Creep & Agent Proliferation — Agent for Everything, No Retirement Criteria, Capability Creep Without Guardrails
  • Security & Trust Gaps — Prompt Injection Vulnerability, Excessive Agent Permissions, No Agent Identity Framework, Unguarded Agent Commerce
  • Data Quality & Pipeline Fragility — Garbage-In-Garbage-Out Blindness, Brittle ETL-to-Agent Pipelines
  • Organizational & Change Management — Pilot-to-Production Gap, Stakeholder Misalignment, Change Management Neglect
  • Protocol & Interoperability Gaps — Protocol Ignorance, Protocol Fragmentation, MCP Security Neglect
  • Skills & Capability Architecture — Monolithic Agent Capabilities, No Skill Governance, Monolithic Skill Design, No Skill Testing or Evaluation, Context Bloat from Eager Loading, Skill Drift without Version Control

Built-In Scoring Framework

  • Likelihood Scale (1–5): Rare → Unlikely → Possible → Likely → Almost Certain — with calibration examples
  • Impact Scale (1–5): Negligible → Minor → Moderate → Major → Severe — mapped to business consequences
  • Risk Matrix: 5×5 grid with auto-classification into Critical, High, Medium, Low
  • Thresholds: Critical ≥20 (block deployment), High 12–19 (action required), Medium 6–11 (address in sprint), Low 1–5 (accept or address opportunistically)
  • Default Risk Values: Every anti-pattern includes a pre-populated default score based on industry prevalence

Phase Gate Checkpoints — 6-Gate Lifecycle

  • Discovery/Feasibility (8 checkpoints) — Use case validation, feasibility POC, data assessment, skill inventory and gap analysis
  • Architecture Review (10 checkpoints) — Component boundaries, hallucination controls, memory architecture, skill decomposition patterns
  • Scope Expansion (7 checkpoints) — Risk re-assessment, tool governance, stakeholder alignment, protocol and skill impact assessment
  • Pre-Production (14 checkpoints) — Evaluation suite, multi-turn testing, observability, compliance review, skill evaluation suite passing
  • Go-Live (9 checkpoints) — Critical mitigations complete, rollback tested, monitoring active, skill monitoring and rollback verified
  • Post-Production Review (9 checkpoints) — 30-day metrics, drift detection audit, cost actuals review, skill performance audit, scale/continue/retire decision

📝 Version History

VersionDateChanges
v1 March 2026 8-tab enterprise risk assessment workbook with 257 live formulas. 45 anti-patterns across 14 categories including Skills & Capability Architecture (Monolithic Skill Design, No Skill Testing or Evaluation, Context Bloat from Eager Loading, Skill Drift without Version Control). 122 pre-built mitigations with verification methods and 8-role RACI dropdown. 57 phase gate checkpoints across 6-gate lifecycle (Discovery/Feasibility, Architecture Review, Scope Expansion, Pre-Production, Go-Live, Post-Production Review). 38 best practices across Basic, Intermediate, and Advanced maturity tiers with distribution chart. Executive Dashboard with auto-calculated risk summary, top 5 risks, phase gate readiness, and mitigation progress tracking. Protocol Layer Reference (MCP, A2A, ACP, AG-UI, ANP), Skill Architecture Reference, and 27-term glossary. Workshop facilitation guide included.
📊

AI Agent Anti-Patterns & Best Practices Workbook

Excel Workbook · v1

Free with email registration. No password needed.

Details

Type Assessment
Chapter 1
Format Excel Workbook
Version 1
License Personal Use
View Book Details

Related Deliverables