AX(AI Transformation)란 무엇인가요?

AX는 단순한 AI 도입이 아니라, 조직이 일하는 방식 자체를 AI를 전제로 재설계하는 일입니다. SH Consulting은 시스템 납품으로 끝내지 않고 Vibe Coding 교육을 함께 진행해, 컨설턴트가 떠난 뒤에도 임직원이 직접 시스템을 확장·유지보수할 수 있게 합니다.

Vibe Coding이란 무엇인가요?

Claude 같은 LLM과 자연어로 협업하며 필요한 도구를 빠르게 만드는 작업 방식입니다. SH Consulting은 Claude · Next.js · Supabase 스택으로 사내 자동화 · MCP · 콘텐츠 자동화 도구를 며칠 단위로 출시합니다.

어떤 산업·도메인과 일하나요?

헬스케어(약국·제약·피부과), 법무(법무법인), 보험(GA·동의서), 무역(Astros 그룹), 인테리어(RIA), 마케팅, 소상공인 GEO 등 다양합니다. 공통점은 정보 비대칭·반복 업무·문서 처리가 많은 분야입니다.

강의는 무료인가요?

기본 유료입니다. 무료 강의는 봉사 차원에서 지인 한정으로만 진행합니다. '무료 강의' 단독 표현은 정책상 사용하지 않습니다.

AX 방법론의 핵심은 무엇인가요?

원칙은 DRY (Don't Repeat Yourself) — 인간을 반복적인 업무에서 해방. 여정은 DRI — Deploy(지금 바로 성과), Reshape(조직 재설계), Invent(전에 없던 가치 창출). 대표 사례 AstroECCOUNT는 Astros 그룹 회계 업무의 60%를 시스템이 대신합니다.

의뢰는 어떻게 시작하나요?

LinkedIn(https://www.linkedin.com/in/hoshin) 또는 공식 사이트의 Contact 섹션으로 문의해 주세요. AX 컨설팅 · AI 교육 · Vibe Coding 구축 모두 환영합니다.

What Is the Advisor Strategy, and How Does It Differ from opusplan

What Is the Advisor Strategy

The Advisor Strategy is a structure Anthropic announced officially. Execution is handled by a low-cost model such as Sonnet or Haiku, and that model calls Opus via API, on demand, only at the moment it judges itself stuck. Opus is not standing by at all times -- it appears only when summoned.

The analogy is an organization. An intern handles the day-to-day work and asks a manager only when genuinely stuck. The manager is not seated at the desk the whole time -- they step in, read the situation, and advise only when called.

Structurally, the executor and the advisor share the same context -- conversation history and tool history. Advice the advisor gives is written back into that shared context, and the executor reads it before its next move. Because everything happens inside a single API call, the whole loop stays simple to manage, and the advisor's tokens are reported separately, so cost tracking stays split too.

Why This Announcement Matters Now

The cost gap is real. Opus costs 25 dollars per million output tokens; Haiku costs 5 dollars -- a fivefold difference. Running a service that handles thousands of requests a day entirely on Opus carries a real cost burden.

But running everything on the cheapest model alone hurts performance. The point of this strategy is finding the spot that lowers cost while holding performance, or improves both at once.

What Does the Benchmark Show

These are results Anthropic itself published on April 9, 2026, in the official blog post "The advisor strategy: Give Sonnet an intelligence boost with Opus." In that launch, the executor models were Sonnet 4.6 and Haiku 4.5, and the advisor was Opus 4.6.

On the SWE-bench Multilingual benchmark, adding an Opus advisor to Sonnet raised performance by 2.7 percentage points over Sonnet alone, while cutting cost per agentic task by 11.9 percent. Performance rose while cost fell.

On BrowseComp, Haiku alone scored 19.7 percent; adding an Opus advisor more than doubled that to 41.2 percent. That pairing trails Sonnet alone by 29 percentage points in score, but costs 85 percent less per task. The table below summarizes Anthropic's published figures.

Benchmark	Configuration	Performance change	Cost change
SWE-bench Multilingual	Sonnet alone -> Sonnet + Opus advisor	+2.7pp	-11.9%
BrowseComp	Haiku alone -> Haiku + Opus advisor	19.7% -> 41.2%	-
BrowseComp	Haiku + advisor vs Sonnet alone	-29pp	-85%

Comparing Four Strategies

The practical choices break down into four: running Opus alone from start to finish, the Advisor Strategy just described, opusplan on Claude Code CLI, and leaving a low-cost executor alone without an advisor.

What actually separates the four isn't model tier -- it's who holds the wheel on judgment, and whether re-verification happens mid-execution.

Strategy	Who drives judgment	When the stronger model steps in	Re-check mid-execution	Best fit
Opus alone	Opus	Always	Not needed (already top tier)	Ambiguous, complex, one-off judgment
Advisor Strategy	Executor model	On demand, only when stuck	Yes, every call	Repeated work at scale, improving cost and performance together
opusplan (CLI)	Plan = Opus / Execution = Sonnet	Once, at planning	No	Work where a precise plan is enough
Executor alone	Executor model	Never	No	Simple tasks; risk of runaway trial-and-error once complexity rises

How Is opusplan Different from the Advisor Strategy

The way to apply the same idea inside Claude Code CLI is /model opusplan. In plan mode, Opus handles the design; once execution mode starts, Sonnet automatically takes over.

The decisive difference is this: with the Advisor Strategy, Opus can be called back in at any point execution stalls, and each call has the advisor re-review the executor's work. With opusplan, Opus steps in once at the planning stage and does not look again until execution finishes. The re-verification loop is structurally absent.

So opusplan is enough when the plan is already tight and the risk of drifting off course during execution is low. When judgment keeps swinging mid-execution, that gap stays as real risk.

How Should You Choose in Practice

Two questions decide it: is the task within what a low-cost model can actually handle, and is it a one-off or a long multi-step job that accumulates work.

If ambiguous, complex judgment keeps coming up, it's safer to let Opus hold the wheel the whole way. If the task is well-defined labor at scale over a long run, the Advisor Strategy can cut cost substantially. If all that's needed is one precise plan, opusplan is enough.

What to avoid is leaving a low-cost executor alone on a complex task with no advisor. It doesn't know when to stop on its own, trial and error can spiral, and the paradox is that total cost can end up higher than running Opus alone.

What This Means from an AX Perspective

This structure isn't really new technology -- it's a familiar organizational design. A practitioner asks a superior when stuck, and the superior isn't standing by at all times, stepping in only when called. It's the same delegation structure organizations have used for decades, moved into an AI pipeline.

What's different is that the boundary of that delegation, and the rule for when to call, can now be written into code. How many times it can be called, under what condition, and how to track the cost of each call -- all of it becomes a configurable setting.

For any organization evaluating AI adoption, the question should shift from 'which model do we use' to 'where do we place this judgment.' Role design, more than model choice, is what now decides cost and quality together.