New section for org-level Claude Code governance — fills the gap between individual dev security (security-hardening.md) and what engineering managers actually need when deploying at scale. New files: - guide/security/enterprise-governance.md (1117 lines) 6 sections: local/shared split, usage charter, MCP approval workflow, 4 guardrail tiers (Starter/Standard/Strict/Regulated), policy enforcement at scale, SOC2/ISO27001 compliance guide - examples/scripts/mcp-registry-template.yaml Org-level MCP registry with approved/pending/denied tracking - examples/hooks/bash/governance-enforcement-hook.sh SessionStart hook validating MCPs against approved registry - examples/scripts/ai-usage-charter-template.md Full charter template with data classification, use case rules, compliance mapping (SOC2/ISO27001/HIPAA/PCI DSS/GDPR) Enriched sections: - adoption-approaches.md: enterprise rollout (50+ devs) with 3-phase approach and common mistakes - observability.md: manager audit checklist, compliance reporting - ai-traceability.md: evidence collection table for auditors - production-safety.md + security-hardening.md: cross-references with explicit scope boundaries Integration: guide/README.md, reference.yaml (22 new entries), CHANGELOG.md Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
980 lines
26 KiB
Markdown
980 lines
26 KiB
Markdown
---
|
|
title: "Production Safety Rules"
|
|
description: "Non-negotiable safety rules for teams deploying Claude Code in production environments"
|
|
tags: [security, guide, devops]
|
|
---
|
|
|
|
# Production Safety Rules
|
|
|
|
> **Audience**: Teams deploying Claude Code in production environments.
|
|
> **For solo learners**: See [Getting Started](./ultimate-guide.md#1-getting-started) instead.
|
|
|
|
---
|
|
|
|
## TL;DR (30 seconds)
|
|
|
|
**6 non-negotiable rules for production teams**:
|
|
|
|
1. ✅ **Port Stability**: Never change backend/frontend ports
|
|
2. ✅ **Database Safety**: Always backup before destructive ops
|
|
3. ✅ **Feature Completeness**: Never ship half-implemented features
|
|
4. ✅ **Infrastructure Lock**: Docker/env changes require permission
|
|
5. ✅ **Dependency Safety**: No new dependencies without approval
|
|
6. ✅ **Pattern Following**: Conform to existing codebase conventions
|
|
|
|
---
|
|
|
|
## When to Use These Rules
|
|
|
|
| Project Type | Use These Rules? | Why |
|
|
|--------------|------------------|-----|
|
|
| Learning / Tutorials | ❌ No | Too restrictive for exploration |
|
|
| Solo prototypes | ❌ No | Overhead not worth it |
|
|
| Small teams (2-3), staging env | ⚠️ Partial | Rules 1, 3, 6 only |
|
|
| Production apps, multi-dev teams | ✅ Yes | All 6 rules |
|
|
| Regulated industries (HIPAA, SOC2) | ✅ Yes + add compliance rules | Critical safety |
|
|
|
|
---
|
|
|
|
## Rule 1: Port Stability
|
|
|
|
### The Problem
|
|
|
|
Changing ports breaks:
|
|
- Local development environments
|
|
- Docker Compose configurations
|
|
- Deployed service configs
|
|
- Team member setups
|
|
|
|
**Real incident**: Backend port changed from 3000 → 8080 during refactor. All developers lost a day re-configuring local envs. Staging deployment failed silently because nginx proxy still pointed to 3000.
|
|
|
|
### The Rule
|
|
|
|
**Never modify backend/frontend ports without explicit team permission.**
|
|
|
|
### Implementation
|
|
|
|
**Option A: Permission deny in `settings.json`**
|
|
|
|
```json
|
|
{
|
|
"permissions": {
|
|
"deny": [
|
|
"Edit(docker-compose.yml:*ports*)",
|
|
"Edit(package.json:*PORT*)",
|
|
"Edit(.env.example:*PORT*)",
|
|
"Edit(vite.config.ts:*port*)"
|
|
]
|
|
}
|
|
}
|
|
```
|
|
|
|
**Option B: Pre-commit hook**
|
|
|
|
```bash
|
|
# .claude/hooks/PreToolUse.sh
|
|
if [[ "$TOOL" == "Edit" ]]; then
|
|
FILE=$(echo "$INPUT" | jq -r '.tool.input.file_path')
|
|
CONTENT=$(echo "$INPUT" | jq -r '.tool.input.new_string')
|
|
|
|
if [[ "$FILE" =~ (docker-compose|vite.config|package.json) ]] && \
|
|
[[ "$CONTENT" =~ (port|PORT):[[:space:]]*[0-9] ]]; then
|
|
echo "⚠️ BLOCKED: Port modification detected in $FILE"
|
|
echo "Ports must remain stable across team. Request permission first."
|
|
exit 2
|
|
fi
|
|
fi
|
|
```
|
|
|
|
**Option C: CLAUDE.md constraint**
|
|
|
|
```markdown
|
|
## Port Configuration
|
|
|
|
**CRITICAL**: Ports are locked for team coordination.
|
|
|
|
Current ports:
|
|
- Frontend (Vite): 5173
|
|
- Backend (Express): 3000
|
|
- Database: 5432
|
|
|
|
To change ports:
|
|
1. Create RFC document in `/docs/rfcs/`
|
|
2. Get team approval (3+ reviewers)
|
|
3. Update all environments simultaneously
|
|
4. Notify team 48h in advance
|
|
```
|
|
|
|
### Edge Cases
|
|
|
|
| Scenario | Behavior |
|
|
|----------|----------|
|
|
| Adding NEW service | OK (doesn't break existing) |
|
|
| Changing test env port | OK (isolated from dev/prod) |
|
|
| Port conflict on machine | Ask user to resolve locally (.env.local) |
|
|
|
|
---
|
|
|
|
## Rule 2: Database Safety
|
|
|
|
### The Problem
|
|
|
|
Accidental deletions in production = data loss.
|
|
|
|
**Real incidents**:
|
|
- `DELETE FROM users WHERE id = 123` → Forgot `WHERE` → All users deleted
|
|
- `DROP TABLE sessions` during cleanup → Production table dropped
|
|
- Migration rollback → Data loss because no backup
|
|
|
|
### The Rule
|
|
|
|
**Always backup before destructive operations.**
|
|
|
|
Destructive operations:
|
|
- `DELETE FROM` (without `LIMIT 1`)
|
|
- `DROP TABLE`
|
|
- `TRUNCATE`
|
|
- `ALTER TABLE ... DROP COLUMN`
|
|
- Database migrations that can't rollback
|
|
|
|
### Implementation
|
|
|
|
**Option A: Pre-tool hook with backup enforcement**
|
|
|
|
```bash
|
|
# .claude/hooks/PreToolUse.sh
|
|
#!/bin/bash
|
|
INPUT=$(cat)
|
|
TOOL=$(echo "$INPUT" | jq -r '.tool.name')
|
|
|
|
if [[ "$TOOL" == "Bash" ]]; then
|
|
COMMAND=$(echo "$INPUT" | jq -r '.tool.input.command')
|
|
|
|
# Detect destructive database operations
|
|
if [[ "$COMMAND" =~ (DROP TABLE|DELETE FROM|TRUNCATE|ALTER.*DROP) ]]; then
|
|
echo "🚨 BLOCKED: Destructive database operation detected"
|
|
echo ""
|
|
echo "Required steps:"
|
|
echo "1. Create backup: pg_dump -U user dbname > backup_\$(date +%Y%m%d_%H%M%S).sql"
|
|
echo "2. Verify backup size is reasonable"
|
|
echo "3. Re-run after backup confirmation"
|
|
exit 2
|
|
fi
|
|
fi
|
|
|
|
exit 0
|
|
```
|
|
|
|
**Option B: Migration safety wrapper**
|
|
|
|
```bash
|
|
# scripts/safe-migrate.sh
|
|
#!/bin/bash
|
|
set -e
|
|
|
|
echo "🔍 Pre-migration checks..."
|
|
|
|
# 1. Check environment
|
|
if [[ "$NODE_ENV" == "production" ]]; then
|
|
echo "❌ BLOCKED: Use migration service for production"
|
|
exit 1
|
|
fi
|
|
|
|
# 2. Create backup
|
|
BACKUP_FILE="backups/pre-migration-$(date +%Y%m%d_%H%M%S).sql"
|
|
mkdir -p backups
|
|
pg_dump $DATABASE_URL > "$BACKUP_FILE"
|
|
echo "✅ Backup created: $BACKUP_FILE"
|
|
|
|
# 3. Run migration
|
|
echo "🚀 Running migration..."
|
|
npm run prisma:migrate:dev
|
|
|
|
# 4. Verify
|
|
echo "🔍 Verifying database state..."
|
|
npm run prisma:validate
|
|
|
|
echo "✅ Migration complete. Backup: $BACKUP_FILE"
|
|
```
|
|
|
|
**Option C: CLAUDE.md protocol**
|
|
|
|
```markdown
|
|
## Database Operations
|
|
|
|
### Destructive Operations Protocol
|
|
|
|
**NEVER run these without backup**:
|
|
- DELETE, DROP, TRUNCATE, ALTER...DROP
|
|
|
|
**Required steps**:
|
|
1. Announce in #dev-ops Slack channel
|
|
2. Create backup: `./scripts/backup-db.sh`
|
|
3. Verify backup: `ls -lh backups/` (should be >0 bytes)
|
|
4. Execute in staging FIRST
|
|
5. Wait 24h for issues
|
|
6. Execute in production with on-call engineer present
|
|
|
|
**Emergency rollback**:
|
|
```bash
|
|
psql $DATABASE_URL < backups/[latest].sql
|
|
```
|
|
```
|
|
|
|
### MCP Database Safety
|
|
|
|
If using MCP database servers (Postgres, MySQL, etc.):
|
|
|
|
```json
|
|
{
|
|
"mcpServers": {
|
|
"database": {
|
|
"command": "npx",
|
|
"args": ["@modelcontextprotocol/server-postgres"],
|
|
"env": {
|
|
"POSTGRES_URL": "postgres://readonly:***@dev-db.example.com:5432/appdb"
|
|
},
|
|
"comment": "READ-ONLY user for safety"
|
|
}
|
|
}
|
|
}
|
|
```
|
|
|
|
**Critical**: Use read-only database users for MCP. See [Data Privacy Guide](./data-privacy.md#risk-2-mcp-database-access).
|
|
|
|
---
|
|
|
|
## Rule 3: Feature Completeness
|
|
|
|
### The Problem
|
|
|
|
Claude Code sometimes "half-asses" features when context runs low:
|
|
- Deletes existing functionality instead of fixing bugs
|
|
- Adds `TODO` comments for core features
|
|
- Leaves error states unhandled
|
|
- Creates mock implementations
|
|
|
|
**Real incidents**:
|
|
- Payment validation "fixed" by removing validation entirely
|
|
- Error handling "added" with `throw new Error("Not implemented")`
|
|
- Feature "completed" with `// TODO: Add actual logic here`
|
|
|
|
### The Rule
|
|
|
|
**Never ship half-implemented features. If you start, you finish to working state.**
|
|
|
|
### Implementation
|
|
|
|
**Option A: CLAUDE.md constraint**
|
|
|
|
```markdown
|
|
## Feature Implementation Standards
|
|
|
|
### NON-NEGOTIABLE
|
|
|
|
1. **No TODOs for core functionality**
|
|
- TODOs allowed ONLY for future enhancements
|
|
- Core features must be complete and working
|
|
|
|
2. **No mock implementations**
|
|
- No `throw new Error("Not implemented")`
|
|
- No fake data generators in production code paths
|
|
|
|
3. **Complete error handling**
|
|
- Every async call has try/catch
|
|
- Every user input is validated
|
|
- Every API call has timeout and retry logic
|
|
|
|
4. **Downgrade = Delete the feature entirely**
|
|
- If you can't fix properly, remove the feature
|
|
- Document why in commit message
|
|
- Create issue for proper implementation
|
|
|
|
### Validation
|
|
|
|
Before accepting changes, verify:
|
|
- [ ] No `TODO` in modified files (except future enhancements)
|
|
- [ ] No `throw new Error("Not implemented")`
|
|
- [ ] No commented-out code without explanation
|
|
- [ ] All new functions have error handling
|
|
```
|
|
|
|
**Option B: Pre-commit git hook**
|
|
|
|
```bash
|
|
# .git/hooks/pre-commit
|
|
#!/bin/bash
|
|
|
|
# Check staged files for "half-assing" patterns
|
|
STAGED=$(git diff --cached --name-only --diff-filter=ACM)
|
|
|
|
for FILE in $STAGED; do
|
|
if [[ "$FILE" =~ \.(ts|tsx|js|jsx|py)$ ]]; then
|
|
# Check for TODOs in core logic (not tests)
|
|
if ! [[ "$FILE" =~ test|spec ]]; then
|
|
if git diff --cached "$FILE" | grep -E "^\+.*TODO.*implement|^\+.*Not implemented"; then
|
|
echo "❌ COMMIT BLOCKED: TODO/Not implemented in $FILE"
|
|
echo " Complete the feature or remove it entirely."
|
|
exit 1
|
|
fi
|
|
fi
|
|
|
|
# Check for mock placeholders
|
|
if git diff --cached "$FILE" | grep -E "^\+.*(MOCK_DATA|fakeData|placeholder)"; then
|
|
echo "⚠️ WARNING: Mock data detected in $FILE"
|
|
echo " Ensure this is intentional for staging/dev only."
|
|
fi
|
|
fi
|
|
done
|
|
|
|
exit 0
|
|
```
|
|
|
|
**Option C: Output evaluator command**
|
|
|
|
```bash
|
|
# Before committing
|
|
/validate-changes
|
|
|
|
# This runs the output-evaluator agent (see examples/agents/output-evaluator.md)
|
|
# which scores changes on:
|
|
# - Correctness (10/10)
|
|
# - Completeness (10/10) ← Detects half-assing
|
|
# - Safety (10/10)
|
|
```
|
|
|
|
---
|
|
|
|
## Rule 4: Infrastructure Lock
|
|
|
|
### The Problem
|
|
|
|
Claude might modify infrastructure configs without understanding production implications:
|
|
- Changes Docker Compose volumes → data loss
|
|
- Modifies `.env.example` → breaks onboarding
|
|
- Updates Terraform → unintended resource changes
|
|
- Tweaks Kubernetes manifests → downtime
|
|
|
|
### The Rule
|
|
|
|
**Infrastructure modifications require explicit team permission.**
|
|
|
|
Files to protect:
|
|
- `docker-compose.yml`, `Dockerfile`
|
|
- `.env.example` (templates, NOT personal .env.local)
|
|
- `kubernetes/`, `k8s/`, `terraform/`, `helm/`
|
|
- CI/CD configs (`.github/workflows/`, `.gitlab-ci.yml`)
|
|
- Database schemas (requires migration review)
|
|
|
|
### Implementation
|
|
|
|
**Option A: Permission deny**
|
|
|
|
```json
|
|
{
|
|
"permissions": {
|
|
"deny": [
|
|
"Edit(docker-compose.yml)",
|
|
"Edit(Dockerfile)",
|
|
"Edit(.env.example)",
|
|
"Edit(terraform/**)",
|
|
"Edit(kubernetes/**)",
|
|
"Edit(.github/workflows/**)",
|
|
"Edit(prisma/schema.prisma)"
|
|
]
|
|
}
|
|
}
|
|
```
|
|
|
|
**Option B: CLAUDE.md rule**
|
|
|
|
```markdown
|
|
## Infrastructure Changes
|
|
|
|
You are **FORBIDDEN** from modifying these without explicit permission:
|
|
|
|
- `docker-compose.yml`, `Dockerfile`
|
|
- `.env.example` (template for new developers)
|
|
- `terraform/`, `kubernetes/` (infrastructure as code)
|
|
- `.github/workflows/` (CI/CD pipelines)
|
|
- `prisma/schema.prisma` (database schema)
|
|
|
|
**If infrastructure change is needed**:
|
|
1. Ask user: "This requires infrastructure change. Should I create an RFC?"
|
|
2. Create RFC document in `docs/rfcs/YYYYMMDD-<title>.md`
|
|
3. Do NOT modify files until RFC approved
|
|
```
|
|
|
|
**Note**: Personal `.env.local` files are OK to modify (they're gitignored).
|
|
|
|
---
|
|
|
|
## Rule 5: Dependency Safety
|
|
|
|
### The Problem
|
|
|
|
Adding dependencies without team approval:
|
|
- Increases bundle size (performance)
|
|
- Introduces security vulnerabilities
|
|
- Creates license compliance issues
|
|
- Adds maintenance burden
|
|
|
|
**Real incidents**:
|
|
- Added `moment.js` (200KB) when `date-fns` (tiny) already in project
|
|
- Installed `lodash` when project uses `ramda`
|
|
- Added GPL library → license violation for proprietary codebase
|
|
|
|
### The Rule
|
|
|
|
**No new dependencies without explicit approval.**
|
|
|
|
### Implementation
|
|
|
|
**Option A: Permission deny on package managers**
|
|
|
|
```json
|
|
{
|
|
"permissions": {
|
|
"deny": [
|
|
"Bash(npm install *)",
|
|
"Bash(npm i *)",
|
|
"Bash(pnpm add *)",
|
|
"Bash(yarn add *)",
|
|
"Bash(pip install *)",
|
|
"Bash(poetry add *)"
|
|
],
|
|
"allow": [
|
|
"Bash(npm install)",
|
|
"Bash(pnpm install)",
|
|
"Bash(pip install -r requirements.txt)"
|
|
]
|
|
}
|
|
}
|
|
```
|
|
|
|
**Option B: CLAUDE.md protocol**
|
|
|
|
```markdown
|
|
## Dependency Management
|
|
|
|
### Immutable Stack Rule
|
|
|
|
**You are FORBIDDEN from adding new dependencies** (`npm install <package>`).
|
|
|
|
**If new dependency is needed**:
|
|
1. Check if existing dependency solves it:
|
|
- Date manipulation? Use existing `date-fns`
|
|
- HTTP requests? Use existing `axios`
|
|
- State management? Use existing `zustand`
|
|
2. If genuinely needed, ASK:
|
|
- "I need [package] for [reason]. Existing alternatives: [X, Y]. Should I add it?"
|
|
3. Wait for explicit approval
|
|
4. User will run: `npm install <package>` manually
|
|
|
|
**Allowed without asking**:
|
|
- `npm install` (installs existing package.json deps)
|
|
- Dev dependencies for testing (`-D` flag after approval)
|
|
```
|
|
|
|
**Option C: Pre-tool hook**
|
|
|
|
```bash
|
|
# .claude/hooks/PreToolUse.sh
|
|
if [[ "$TOOL" == "Bash" ]]; then
|
|
COMMAND=$(echo "$INPUT" | jq -r '.tool.input.command')
|
|
|
|
# Block dependency installation
|
|
if [[ "$COMMAND" =~ (npm|pnpm|yarn)[[:space:]]+(install|add|i)[[:space:]]+[a-zA-Z] ]]; then
|
|
echo "🚨 BLOCKED: New dependency installation"
|
|
echo ""
|
|
echo "Dependencies must be approved by team lead."
|
|
echo "Create PR with RFC explaining:"
|
|
echo "1. Why this dependency is needed"
|
|
echo "2. Alternatives considered"
|
|
echo "3. Bundle size impact"
|
|
echo "4. License compatibility"
|
|
exit 2
|
|
fi
|
|
|
|
# Allow: npm install (no args), npm install -g, pnpm install
|
|
if [[ "$COMMAND" =~ ^(npm|pnpm|yarn)[[:space:]]+install$ ]]; then
|
|
exit 0
|
|
fi
|
|
fi
|
|
```
|
|
|
|
---
|
|
|
|
## Rule 6: Pattern Following
|
|
|
|
### The Problem
|
|
|
|
Claude introduces new patterns inconsistent with codebase:
|
|
- Uses `class` components when project is functional React
|
|
- Imports `lodash` when project uses `ramda`
|
|
- Writes REST endpoints when project is GraphQL
|
|
- Uses `fetch` when project standardized on `axios`
|
|
|
|
### The Rule
|
|
|
|
**Conform to existing codebase conventions. Check before implementing.**
|
|
|
|
### Implementation
|
|
|
|
**Option A: CLAUDE.md conventions**
|
|
|
|
```markdown
|
|
## Code Conventions
|
|
|
|
### Tech Stack (DO NOT DEVIATE)
|
|
|
|
**Frontend**:
|
|
- React 18 with **function components + hooks** (NO class components)
|
|
- State: Zustand (NOT Redux, Context)
|
|
- HTTP: axios (NOT fetch)
|
|
- Styling: Tailwind CSS (NOT styled-components, emotion)
|
|
- Forms: React Hook Form + Zod
|
|
|
|
**Backend**:
|
|
- Node.js + Express
|
|
- Database: Prisma ORM (NOT raw SQL, TypeORM)
|
|
- Auth: JWT via jose library
|
|
- Validation: Zod schemas
|
|
|
|
**Testing**:
|
|
- Unit: Vitest (NOT Jest)
|
|
- E2E: Playwright (NOT Cypress)
|
|
|
|
### Import Patterns
|
|
|
|
**Always use**:
|
|
```typescript
|
|
import { useState } from 'react' // ✅ Named imports
|
|
import axios from 'axios' // ✅ Default import
|
|
```
|
|
|
|
**Never use**:
|
|
```typescript
|
|
import React from 'react' // ❌ Deprecated pattern
|
|
import * as axios from 'axios' // ❌ Namespace import
|
|
```
|
|
|
|
### File Structure
|
|
|
|
```
|
|
src/
|
|
features/ ← Group by feature (NOT by type)
|
|
auth/
|
|
components/
|
|
hooks/
|
|
api/
|
|
shared/ ← Shared utilities
|
|
components/
|
|
hooks/
|
|
```
|
|
|
|
### Design System
|
|
|
|
UI changes MUST use existing design system:
|
|
- Check `src/shared/components/` for existing components
|
|
- Use Tailwind utility classes from `tailwind.config.js`
|
|
- Colors from `colors.ts` palette ONLY
|
|
- Typography from `typography.config.js`
|
|
|
|
**Before creating new component**:
|
|
1. Search: `rg "Button" src/shared/components/`
|
|
2. If exists, use it
|
|
3. If doesn't exist, ask: "Should I create new Button component or use existing primitive?"
|
|
```
|
|
|
|
**Option B: Pre-implementation analysis**
|
|
|
|
```markdown
|
|
## Before Implementing
|
|
|
|
**ALWAYS** run these checks:
|
|
|
|
1. **Pattern check**:
|
|
```bash
|
|
# How does codebase handle X?
|
|
rg "import.*useState" src/ # Check React patterns
|
|
rg "axios\." src/ # Check HTTP patterns
|
|
rg "prisma\." src/ # Check DB patterns
|
|
```
|
|
|
|
2. **Existing components**:
|
|
```bash
|
|
# Does component already exist?
|
|
find src/shared/components -name "*Button*"
|
|
find src/shared/components -name "*Modal*"
|
|
```
|
|
|
|
3. **Ask user if unclear**:
|
|
- "I see project uses [X]. Should I follow this pattern or use [Y]?"
|
|
```
|
|
|
|
**Option C: Automated validation**
|
|
|
|
```bash
|
|
# .claude/hooks/PostToolUse.sh
|
|
#!/bin/bash
|
|
if [[ "$TOOL" == "Write" ]] || [[ "$TOOL" == "Edit" ]]; then
|
|
FILE=$(echo "$INPUT" | jq -r '.tool.input.file_path')
|
|
|
|
# Check for pattern violations
|
|
if [[ "$FILE" =~ \.(tsx?)$ ]]; then
|
|
CONTENT=$(cat "$FILE")
|
|
|
|
# Violation: class components in React
|
|
if echo "$CONTENT" | grep -q "class.*extends.*Component"; then
|
|
echo "⚠️ WARNING: Class component detected in $FILE"
|
|
echo " Project uses function components. Consider refactoring."
|
|
fi
|
|
|
|
# Violation: wrong HTTP library
|
|
if echo "$CONTENT" | grep -q "import.*fetch\|window.fetch"; then
|
|
echo "⚠️ WARNING: fetch() detected in $FILE"
|
|
echo " Project uses axios. Use: import axios from 'axios'"
|
|
fi
|
|
fi
|
|
fi
|
|
```
|
|
|
|
---
|
|
|
|
## Rule 7: The Verification Paradox
|
|
|
|
### The Problem
|
|
|
|
When AI succeeds 99% of the time, traditional human verification becomes fragile:
|
|
|
|
**The paradox**: As AI reliability increases, human review quality decreases.
|
|
|
|
- **Vigilance fatigue**: Rare errors (1%) slip through when humans unconsciously trust patterns that usually work
|
|
- **Pattern-trusting behavior**: Manual review degrades as reviewers stop expecting errors
|
|
- **False confidence**: "It worked last 50 times" creates blind spots for the 51st failure
|
|
- **Cognitive load**: Humans aren't optimized to catch 1-in-100 errors consistently
|
|
|
|
**Real incidents**:
|
|
- Payment validation bypassed after 200 successful transactions → fraud on transaction #201
|
|
- Security check skipped because "AI always gets auth right" → credentials leaked
|
|
- Test suite passing 99% → production bug from the 1% case that wasn't tested
|
|
|
|
**Source**: [Alan Engineering Team (Charles Gorintin, Maxime Le Bras), Feb 2026](https://www.linkedin.com/pulse/le-principe-de-la-tour-eiffel-et-ralph-wiggum-maxime-le-bras-psmxe/)
|
|
|
|
### The Rule
|
|
|
|
**Build automated safety systems instead of relying on human vigilance.**
|
|
|
|
When AI reliability crosses ~95%, shift from manual review to automated guardrails.
|
|
|
|
### Anti-Patterns vs Better Approaches
|
|
|
|
| Anti-Pattern | Better Approach |
|
|
|--------------|-----------------|
|
|
| Manual review for every AI output | Automated test suites + selective review |
|
|
| Trust because "it worked last time" | Verification contracts (tests, types, lints) |
|
|
| Human as sole error detector | Guardrails that fail fast (CI/CD gates) |
|
|
| "Spot-check" strategy for high-frequency AI ops | Comprehensive automated validation |
|
|
| Reviewer fatigue = lower standards over time | Consistent automated quality bars |
|
|
|
|
### Implementation
|
|
|
|
**Option A: Automated Guardrail Stack**
|
|
|
|
```yaml
|
|
# .github/workflows/ai-safety.yml
|
|
name: AI Output Validation
|
|
|
|
on: [pull_request]
|
|
|
|
jobs:
|
|
validate:
|
|
runs-on: ubuntu-latest
|
|
steps:
|
|
- name: Type safety
|
|
run: npm run typecheck # Catch type errors AI missed
|
|
|
|
- name: Lint rules
|
|
run: npm run lint # Enforce code standards
|
|
|
|
- name: Unit tests
|
|
run: npm run test # Verify behavior contracts
|
|
|
|
- name: E2E tests
|
|
run: npm run test:e2e # Catch integration failures
|
|
|
|
- name: Security audit
|
|
run: npm audit # Detect vulnerable dependencies
|
|
|
|
- name: Bundle analysis
|
|
run: npm run analyze # Catch bloat/regressions
|
|
|
|
# Human review ONLY after all automation passes
|
|
```
|
|
|
|
**Option B: Verification Contracts in CLAUDE.md**
|
|
|
|
```markdown
|
|
## Verification Protocol
|
|
|
|
### NEVER rely on human review alone
|
|
|
|
**Automated verification required**:
|
|
1. **Type safety**: `npm run typecheck` must pass (zero errors)
|
|
2. **Tests**: `npm run test` coverage ≥ 80% for new code
|
|
3. **Lint**: `npm run lint` must pass (zero warnings)
|
|
4. **Security**: `npm audit` must show zero high/critical vulnerabilities
|
|
5. **Performance**: Lighthouse score ≥ 90 for affected pages
|
|
|
|
**Human review is for**:
|
|
- Architecture decisions
|
|
- UX/design choices
|
|
- Business logic validation
|
|
- Edge cases automation can't catch
|
|
|
|
**Human review is NOT for**:
|
|
- Syntax errors (use linters)
|
|
- Type errors (use TypeScript)
|
|
- Performance regressions (use benchmarks)
|
|
- Security issues (use automated scanners)
|
|
```
|
|
|
|
**Option C: Pre-merge checklist (automated)**
|
|
|
|
```bash
|
|
# .claude/hooks/PreCommit.sh
|
|
#!/bin/bash
|
|
|
|
echo "🔍 Running automated verification (Verification Paradox defense)..."
|
|
|
|
# 1. Type safety
|
|
npm run typecheck || { echo "❌ Type errors detected"; exit 1; }
|
|
|
|
# 2. Lint
|
|
npm run lint || { echo "❌ Lint errors detected"; exit 1; }
|
|
|
|
# 3. Tests
|
|
npm run test || { echo "❌ Tests failing"; exit 1; }
|
|
|
|
# 4. Security
|
|
npm audit --audit-level=high || { echo "❌ Security vulnerabilities detected"; exit 1; }
|
|
|
|
echo "✅ All automated checks passed"
|
|
echo "💡 Human review can now focus on architecture/UX/business logic"
|
|
```
|
|
|
|
### Edge Cases
|
|
|
|
| Scenario | Behavior |
|
|
|----------|----------|
|
|
| AI writes perfect code 99.9% | STILL run automation (paradox applies even at 99.9%) |
|
|
| Time pressure, "just ship it" | Automation is non-negotiable (fast ≠ skip safety) |
|
|
| Trivial changes (typo fix) | Run automation (typos can break prod) |
|
|
| Emergency hotfix | Automation REQUIRED (stress = higher error rate) |
|
|
|
|
### Why This Matters
|
|
|
|
**Old model (pre-AI)**:
|
|
- Code quality = human expertise + careful review
|
|
- Errors caught by experienced developers
|
|
- Review quality stays consistent
|
|
|
|
**New model (AI-assisted)**:
|
|
- AI produces high-quality code 95%+ of the time
|
|
- Humans become complacent ("AI usually gets it right")
|
|
- 5% error rate slips through fatigued review
|
|
|
|
**Solution**: Automate the boring verification (syntax, types, tests), reserve human attention for creative/strategic review.
|
|
|
|
### Integration with Other Rules
|
|
|
|
- **Rule 3 (Feature Completeness)**: Automated tests verify features are actually complete
|
|
- **Rule 2 (Database Safety)**: Migration tests catch destructive operations
|
|
- **Rule 6 (Pattern Following)**: Linters enforce project conventions automatically
|
|
|
|
---
|
|
|
|
## Integration with Existing Workflows
|
|
|
|
### With Plan Mode
|
|
|
|
```bash
|
|
# Before multi-file changes
|
|
/plan
|
|
|
|
# Claude enters read-only mode, explores codebase
|
|
# Identifies patterns, conventions, existing implementations
|
|
# Proposes plan following project conventions
|
|
# You review before execution
|
|
```
|
|
|
|
### With Git Hooks
|
|
|
|
These rules integrate with existing git workflows:
|
|
|
|
```bash
|
|
# .git/hooks/pre-commit
|
|
#!/bin/bash
|
|
|
|
# Run safety checks
|
|
./.claude/hooks/production-safety-check.sh
|
|
|
|
# If blocked, commit fails
|
|
exit $?
|
|
```
|
|
|
|
### With CI/CD
|
|
|
|
Add validation step:
|
|
|
|
```yaml
|
|
# .github/workflows/pr-validation.yml
|
|
name: PR Validation
|
|
on: [pull_request]
|
|
|
|
jobs:
|
|
validate:
|
|
runs-on: ubuntu-latest
|
|
steps:
|
|
- uses: actions/checkout@v3
|
|
|
|
- name: Check for half-assing
|
|
run: |
|
|
if git diff origin/main...HEAD | grep -E "TODO.*implement|Not implemented"; then
|
|
echo "❌ PR contains incomplete features"
|
|
exit 1
|
|
fi
|
|
|
|
- name: Check for unauthorized deps
|
|
run: |
|
|
git diff origin/main...HEAD -- package.json | grep -E '^\+.*"[^"]+": "[^"]+"' || exit 0
|
|
echo "⚠️ New dependencies detected. Review required."
|
|
```
|
|
|
|
---
|
|
|
|
## Troubleshooting
|
|
|
|
### "These rules are too restrictive"
|
|
|
|
**Solution**: Adapt based on your team size and stage.
|
|
|
|
| Team Size | Recommended Rules |
|
|
|-----------|-------------------|
|
|
| 1-2 devs | Rules 1, 3, 6 only |
|
|
| 3-10 devs | Rules 1, 3, 5, 6 |
|
|
| 10+ devs or production | All 6 rules |
|
|
|
|
### "Claude keeps getting blocked"
|
|
|
|
**Solution**: Rules are working! Options:
|
|
|
|
1. **Grant temporary permission**:
|
|
```bash
|
|
# In CLAUDE.md
|
|
## Temporary Override (expires 2026-01-25)
|
|
For this feature only: infrastructure changes allowed.
|
|
Reason: Setting up new microservice.
|
|
```
|
|
|
|
2. **Create exception**:
|
|
```json
|
|
{
|
|
"permissions": {
|
|
"allow": ["Edit(docker-compose.dev.yml)"],
|
|
"deny": ["Edit(docker-compose.prod.yml)"]
|
|
}
|
|
}
|
|
```
|
|
|
|
3. **Review if rule is appropriate**:
|
|
- Solo dev blocking themselves? → Remove rule
|
|
- Team needs flexibility? → Use "ask" instead of "deny"
|
|
|
|
### "How do I enforce rules across team?"
|
|
|
|
**Solution**: Commit to repo, not personal config.
|
|
|
|
```bash
|
|
# Shared team rules
|
|
/project/.claude/settings.json # Committed
|
|
/project/CLAUDE.md # Committed
|
|
|
|
# Personal overrides
|
|
/project/.claude/settings.local.json # Gitignored
|
|
/project/.claude/CLAUDE.md # Gitignored
|
|
```
|
|
|
|
Team settings take precedence, but individuals can opt-in to stricter rules.
|
|
|
|
---
|
|
|
|
## See Also
|
|
|
|
- [Ultimate Guide §9.12 Git Best Practices](./ultimate-guide.md#912-git-best-practices--workflows) — Commit workflow, Plan → Act pattern
|
|
- [Security Hardening Guide](./security-hardening.md) — MCP security, secret protection, hook stack
|
|
- [Data Privacy Guide](./data-privacy.md) — MCP database risks, retention policies
|
|
- [Enterprise AI Governance](./enterprise-governance.md) — Org-level governance: usage charters, MCP approval workflow, guardrail tiers, compliance
|
|
- [Adoption Approaches](../roles/adoption-approaches.md) — Team setup, shared conventions, enterprise rollout
|
|
- [Plan Mode](./ultimate-guide.md#25-plan-mode) — Safe exploration before execution
|
|
- [Permissions System](./ultimate-guide.md#33-settings--permissions) — Allow/deny rules, hooks
|
|
|
|
---
|
|
|
|
## Quick Reference
|
|
|
|
### Rule Severity
|
|
|
|
| Rule | Severity | Breaking this causes |
|
|
|------|----------|----------------------|
|
|
| 1. Port Stability | 🔴 Critical | Team downtime, deployment failures |
|
|
| 2. Database Safety | 🔴 Critical | Data loss, customer impact |
|
|
| 3. Feature Completeness | 🟡 High | Production bugs, tech debt |
|
|
| 4. Infrastructure Lock | 🟠 High | Downtime, security issues |
|
|
| 5. Dependency Safety | 🟡 Medium | Bundle bloat, license issues |
|
|
| 6. Pattern Following | 🟢 Low | Code inconsistency, maintenance burden |
|
|
|
|
### Enforcement Methods
|
|
|
|
| Method | Strictness | Setup Time | Best For |
|
|
|--------|------------|------------|----------|
|
|
| **Permission deny** | 100% (blocks) | 2 min | Critical rules (1, 2, 4) |
|
|
| **Pre-tool hooks** | 100% (blocks) | 10 min | Custom logic, team-specific |
|
|
| **CLAUDE.md rules** | ~70% (Claude respects) | 5 min | Conventions, guidelines |
|
|
| **Post-tool warnings** | ~30% (warns only) | 5 min | Best practices, suggestions |
|
|
| **Git hooks** | 100% (blocks commits) | 15 min | Final safety net before push |
|
|
|
|
### Common Patterns
|
|
|
|
**Allow staging changes, block production**:
|
|
```json
|
|
{
|
|
"permissions": {
|
|
"allow": ["Edit(docker-compose.dev.yml)"],
|
|
"deny": ["Edit(docker-compose.prod.yml)"]
|
|
}
|
|
}
|
|
```
|
|
|
|
**Require confirmation for sensitive ops**:
|
|
```json
|
|
{
|
|
"permissions": {
|
|
"ask": ["Bash(rm -rf *)", "Bash(DROP TABLE *)"]
|
|
}
|
|
}
|
|
```
|
|
|
|
**Temporary override with expiry**:
|
|
```markdown
|
|
## Temporary Override (expires 2026-02-01)
|
|
Infrastructure changes allowed for migration project.
|
|
After expiry: revert to standard rules.
|
|
```
|
|
|
|
---
|
|
|
|
**Version**: 1.0.0
|
|
**Last Updated**: 2026-01-21
|
|
**Changelog**: Initial version based on community-validated production patterns
|