AI Trend2026.05.25

Claude 4 Can Think for 10 Hours Straight — What That Means for AI Work

Claude 4, 10시간 연속 추론이 가능해졌어요 — AI 업무의 새 기준

English

한국어

Hook

An AI That Thinks Overnight

밤새 생각하는 AI

Within a year, you may ask an AI to spend an entire weekend reading every contract your company has ever signed — and come back Monday with a ranked list of risks your legal team had missed. That future moved significantly closer on May 24th, when Anthropic Claude 4 with an extended thinking mode that lets the model reason continuously for up to ten hours.

1년 안에 여러분은 AI에게 회사가 지금까지 체결한 모든 계약서를 주말 내내 읽어달라고 요청할 수 있을지도 몰라요 — 그리고 월요일 아침에는 법무팀이 놓쳤던 위험 항목을 순위별로 정리해서 받을 수 있을지도요. 그 미래가 훨씬 가까워진 건 5월 24일이에요. Anthropic이 모델이 최대 10시간 연속으로 추론할 수 있는 extended thinking 모드를 탑재한 Claude 4를 출시했거든요.

What's new

What Claude 4 Actually Does

Claude 4가 실제로 달라진 점

Previous AI models — including earlier Claude versions — could think deeply for a few minutes before producing a response, which was already an improvement over instant answers. Claude 4's extended thinking mode removes that ceiling — it can work through multi-day research projects, review an entire , or hundreds of scientific papers in a session. Extended thinking is opt-in and priced separately — usage is in "thinking tokens" rather than standard output tokens, meaning you pay only when you activate the deeper . On performance, Claude 4 achieved top scores on FrontierMath — a frontier benchmark of genuinely unsolved mathematical problems — and on GPQA Diamond, which tests PhD-level science across biology, chemistry, and physics.

이전 AI 추론 모델들은, 이전 버전의 Claude를 포함해서, 응답을 내기 전 몇 분 정도 깊게 생각할 수 있었어요. 그것만으로도 즉각적인 답변보다는 나은 개선이었지만요. Claude 4의 extended thinking 모드는 그 한계를 없앴어요. 며칠 걸리는 리서치 프로젝트, 전체 코드베이스 검토, 수백 편의 논문 합성을 단 한 번의 요청으로 처리할 수 있거든요. Extended thinking은 opt-in 방식이고 별도로 과금돼요. 사용량은 일반 output 토큰이 아닌 "thinking 토큰"으로 측정되기 때문에, 더 깊은 추론을 활성화할 때만 요금이 발생해요. 성능 측면에서 Claude 4는 FrontierMath(실제 미해결 수학 문제로 구성된 frontier benchmark)와 생물·화학·물리 분야의 박사급 과학 지식을 평가하는 GPQA Diamond에서 최고 점수를 기록했어요.

Why it matters

From Tool to Independent Analyst

도구에서 독립적 분석가로

The difference between "a few minutes of thinking" and "ten hours of thinking" is not just quantitative — it crosses a where AI moves from answering questions to conducting the kind of investigation humans normally spend days on. Consider the industries where this matters most: a lawyer who currently spends three days reviewing transaction documents, a financial analyst building a due diligence report from dozens of filings, or a researcher mapping the literature on a new drug target. For those professionals, extended thinking is not an improvement — it is a potential of what counts as a day's work.

"몇 분의 추론"과 "10시간의 추론" 사이의 차이는 단순한 양의 문제가 아니에요. 이는 AI가 질문에 답하는 수준을 넘어, 사람이 며칠을 써야 하는 지속적인 조사를 수행하는 영역으로 넘어가는 임계점이에요. 이게 가장 중요하게 작용하는 분야들을 떠올려보세요. 현재 거래 서류 검토에 사흘을 쓰는 변호사, 수십 개의 공시 자료로 실사 보고서를 만드는 금융 애널리스트, 새로운 신약 타겟에 대한 연구 문헌을 정리하는 연구자들이죠. 그런 전문가들에게 extended thinking은 점진적인 개선이 아니에요. 하루치 업무가 무엇인지를 다시 정의할 수도 있는 변화예요.

Korea angle

What This Means in Korea

한국에서 이 변화가 갖는 의미

Claude's user base in Korea has been growing steadily as companies move beyond ChatGPT to explore models with stronger multilingual , and Anthropic has been incrementally improving Korean language quality across model versions. Korean knowledge workers in law, consulting, finance, and research are among the most likely early adopters — these are roles where deep-dive analysis is time-consuming and on-demand compute can directly substitute for hours of human effort. One practical note on cost: extended thinking sessions can thinking tokens quickly on complex tasks, so Korean teams evaluating Claude 4 should build a budget estimate into any pilot before scaling up.

국내 기업들이 ChatGPT를 넘어 다국어 추론 능력이 더 뛰어난 모델을 찾기 시작하면서, 한국에서의 Claude 사용자 기반은 꾸준히 성장하고 있어요. Anthropic도 버전을 거치며 한국어 품질을 지속적으로 개선해 왔고요. 법률, 컨설팅, 금융, 연구 분야의 한국 지식 노동자들이 가장 빠르게 도입할 가능성이 높아요. 이 직군들은 심층 분석에 시간이 많이 들고, on-demand compute가 사람의 수 시간 노력을 직접적으로 대체할 수 있거든요. 비용에 대한 실질적인 조언도 필요해요. 복잡한 작업에서 extended thinking 세션은 thinking 토큰을 빠르게 소모할 수 있어서, Claude 4를 도입하려는 국내 팀이라면 규모를 키우기 전에 파일럿 단계에서 예산 예측부터 해두는 게 좋아요.

What you can do

Starting With Extended Thinking

Extended Thinking 시작하는 법

The best use cases to start with are tasks that have a clearly defined scope and a verifiable output — "summarize this set of documents" or "audit this for security issues" rather than open-ended questions that could spiral indefinitely. It is also worth knowing when not to use extended thinking — for quick factual lookups, draft emails, or simple Q&A, the standard mode is faster and cheaper. Think of extended thinking as a specialist you hire for the deep work, not the intern you ask to check a quick fact — matching the tool to the task determines whether you get value or just a large invoice. Before you hand off your next complex analysis to an AI, try activating Extended Thinking in Claude 4 and giving it a full hour — you may find that what used to take your team two days now arrives in your inbox before lunch.

처음 시도하기 좋은 작업은 명확하게 정의된 범위와 검증 가능한 결과물이 있는 것들이에요. 끝없이 이어질 수 있는 열린 질문보다는 "이 문서들을 요약해줘"나 "이 코드베이스의 보안 취약점을 검토해줘" 같은 작업이요. 언제 쓰지 말아야 하는지 아는 것도 중요해요. 빠른 사실 확인, 이메일 초안 작성, 간단한 질답에는 기본 모드가 더 빠르고 저렴하거든요. Extended thinking을 깊이 있는 작업을 위해 고용하는 전문가라고 생각하세요, 빠른 확인을 부탁하는 인턴이 아니라요. 도구와 작업을 얼마나 잘 맞추느냐가 가치를 얻는지, 아니면 큰 청구서만 받는지를 결정해요. 다음에 복잡한 분석 작업을 AI에 맡기기 전에, Claude 4에서 Extended Thinking을 활성화하고 충분히 시간을 줘보세요. 여러분 팀이 이틀 걸리던 작업이 점심 전에 메일함에 도착할 수도 있어요.

KEY TERMS

reasoningn.추론, 사고 과정, (AI에서) 입력을 받아 논리적으로 답을 도출하는 연산 과정
synthesizev.통합하다, 종합하다, 여러 정보·자료를 하나의 결론으로 통합하다
thresholdn.임계점, 한계점, 어떤 변화나 효과가 시작되는 기준점
incrementaladj.점진적인, 단계적인, 한 번에 크게 바뀌는 것이 아니라 조금씩 늘어나는
sustainedadj.지속적인, 끊임없는, 일정 기간 동안 중단 없이 유지되는
due diligencephr.실사, 사전 조사, 투자·계약 전 대상을 철저히 검토하는 과정
accumulatev.축적하다, 쌓아올리다, 점점 늘어나 특정 수준에 도달하다
codebasen.코드베이스, 소프트웨어 프로젝트를 구성하는 전체 소스 코드의 집합
meteredv. (past part.)계량되다, 측정·과금되다, 사용량을 단위별로 측정하여 요금을 부과하는 방식
redefinitionn.재정의, 기존의 개념이나 기준을 새롭게 바꾸는 것

0 / 16 pairs explored