AX(AI Transformation)란 무엇인가요?

AX는 단순한 AI 도입이 아니라, 조직이 일하는 방식 자체를 AI를 전제로 재설계하는 일입니다. SH Consulting은 시스템 납품으로 끝내지 않고 Vibe Coding 교육을 함께 진행해, 컨설턴트가 떠난 뒤에도 임직원이 직접 시스템을 확장·유지보수할 수 있게 합니다.

Vibe Coding이란 무엇인가요?

Claude 같은 LLM과 자연어로 협업하며 필요한 도구를 빠르게 만드는 작업 방식입니다. SH Consulting은 Claude · Next.js · Supabase 스택으로 사내 자동화 · MCP · 콘텐츠 자동화 도구를 며칠 단위로 출시합니다.

어떤 산업·도메인과 일하나요?

헬스케어(약국·제약·피부과), 법무(법무법인), 보험(GA·동의서), 무역(Astros 그룹), 인테리어(RIA), 마케팅, 소상공인 GEO 등 다양합니다. 공통점은 정보 비대칭·반복 업무·문서 처리가 많은 분야입니다.

강의는 무료인가요?

기본 유료입니다. 무료 강의는 봉사 차원에서 지인 한정으로만 진행합니다. '무료 강의' 단독 표현은 정책상 사용하지 않습니다.

AX 방법론의 핵심은 무엇인가요?

원칙은 DRY (Don't Repeat Yourself) — 인간을 반복적인 업무에서 해방. 여정은 DRI — Deploy(지금 바로 성과), Reshape(조직 재설계), Invent(전에 없던 가치 창출). 대표 사례 AstroECCOUNT는 Astros 그룹 회계 업무의 60%를 시스템이 대신합니다.

의뢰는 어떻게 시작하나요?

LinkedIn(https://www.linkedin.com/in/hoshin) 또는 공식 사이트의 Contact 섹션으로 문의해 주세요. AX 컨설팅 · AI 교육 · Vibe Coding 구축 모두 환영합니다.

Why Doesn't ChatGPT Remember Your Conversations?

Why Does the Conversation Seem to Continue?

The secret is in the messages array. The core of an API call is a message list made up of three roles: system, user, and assistant. To build a multi-turn conversation, every question and answer so far must be bundled and resent on every turn. It is a structure where you retell the whole conversation from the beginning to someone you just met, every single time. That is why, as conversations grow longer, history management becomes a core design task in practice, in terms of both context window and cost.

What Do You See When You Open the Tokenizer?

You see Korean's structural disadvantage. 'Annyeong' splits into two tokens and 'Annyeonghaseyo, eotteoke jinaeseyo?' into eight, while 'How are you?' takes only six. Measuring the same text as a token-to-character ratio, Korean lands around 0.47-0.75 while English sits around 0.13-0.26. Even with the same context window size, Korean can hold less content. If you are planning a Korean-language AI service, this is a constraint you must build in from the starting line.

Why Does the Model Plausibly Describe a Paper That Doesn't Exist?

Ask about a fictional 2019 journal paper on Korean sentiment analysis, and the model says it cannot name the authors yet plausibly invents the paper's main contributions. For a next-token predictor, continuing with plausible tokens comes more naturally than admitting it does not know. The knowledge cutoff shows up in the same place: ask for today's date and the exchange rate, and the answer reveals knowledge frozen at October 2023. A service that needs real-time information cannot rely on the model alone; it needs complementary structures like RAG or tool use.

Same Model, Same Input, So Why Do the Answers Diverge?

Because temperature reshapes the probability distribution. Run the same sentence three times at 0.1 and you get nearly identical answers; raise it to 1.8 and a completely different sentence begins every time. You can control this directly: keep it low when consistency matters, as in code generation, and higher when you need ideas. System prompts carry the same weight. Swap a single line among 'friendly science teacher,' 'physics PhD,' and 'Socratic educator,' and the same black-hole question returns an analogy, equations, and counter-questions respectively. One line of prompt redefines the model's entire behavior.

Why Do Non-Developers Need Code Demos Too?

Because hearing an explanation and seeing it on screen carry different weight. Someone who has watched hallucination happen attaches verification steps instead of blaming the model, and someone who has seen the full history retransmitted every turn starts treating context management in long conversations as a design task. This is why SH Consulting insists on showing the tokenizer and live API calls even to practitioners with no programming experience in its AX training. A person who has seen how the machine works once handles the tool at a different depth than one who has only heard about it.