SH Consulting
Skills·Skill

Harness GC (Garbage Collection)

4 에이전트 병렬 분석으로 코드베이스 엔트로피를 GC하고 12원칙 하네스 성숙도(L1~L5)를 평가.

#harness#gc#audit#maturity#agents#knip#remediate
Install

설치

/harness-gc [--full|--doc-only|--arch-only|--quality-only|--quick|--remediate]
Requirements
  • Claude Code (CLI)
  • 프로젝트에 AGENTS.md 또는 agent.md (하네스 진입 문서)
  • (권장) Knip 설치 + knip.json
  • (선택) docs/harness/principles.md + maturity-framework.md + fix-catalog.md
Capabilities

주요 기능

  • 013 감사 에이전트 병렬 — doc-auditor (문서 신선도) · arch-inspector (아키텍처 드리프트) · quality-scorer (도메인 품질 + 12원칙)
  • 02gc-synthesizer 결과 통합 + 종합 보고서 + 자동 수정 분류 (즉시/준자동/수동)
  • 0312원칙 하네스 성숙도 (L1 Initial → L5 Optimizing) 정량 측정
  • 04도메인별 품질 등급 A~F (코드·문서·데이터·UI·문체·GEO/AEO 6축)
  • 05Knip strict 기반 미사용 코드(파일·export·의존성·타입) 권위 탐지
  • 06GC 히스토리 추적 (gc-history.md) + 반복 드리프트 감지 + ⚠️ CHRONIC 태깅
  • 07예방 프로토콜 — verify-docs 스크립트 + 팩트 마커(`<!-- FACT:key=value -->`) 자동 생성
  • 08Remediate 루프 — Sprint Contract 협상(quality-scorer ↔ 수정 에이전트) → Auto/Semi 액션 적용 → 재진단
  • 09하네스 자체 메타 검증 — 에이전트가 3회 연속 무이슈면 💡 SIMPLIFY 후보 태깅
  • 10회의적 채점 원칙 (Skeptical Evaluation) — 'Works'와 'Works well' 구분, 0점에서 가산
About

About

harness-gc applies OpenAI's 'Harness Engineering' garbage-collection model to your codebase. Three audit agents run in parallel — doc-auditor (doc freshness vs. reality), arch-inspector (layer drift, cycles, dead code), quality-scorer (domain quality A–F + 12-principle maturity L1–L5) — then gc-synthesizer consolidates findings into a single report with auto-fix, semi-auto, and manual buckets. Maturity scoring follows the 12 harness principles (Entry Point, Map-not-Manual, Invariant Enforcement, Convention, Progressive Disclosure, Layered Arch + Coverage, GC Automation, Observability, Knowledge in Repo, Reproducibility, Modularity, Self-Documentation). Each principle is anchored with 0-10 few-shot rubrics; the evaluator stays skeptical by default ('prove it's good' instead of 'prove it's bad') and adds rather than subtracts from zero. Knip strict is the authoritative source for dead code so the report doesn't disagree with the linter. The `--remediate` mode closes the loop: quality-scorer hands the fixer agent concrete diagnostic findings, the two negotiate a Sprint Contract with expected score deltas, Auto/Semi actions are applied, and the suite re-audits. If actual deltas underperform expectations twice in a row, the loop reports `approach-exhausted` rather than wasting another iteration on a strategy that isn't working. A harness self-audit (Part F) flags agents that haven't found a real issue in three consecutive runs — a `💡 SIMPLIFY` tag suggesting that part of the harness may have outlived its usefulness as the underlying model improves.