ChatGPT, Claude and Productivity - What Anthropic’s Field Evidence Really Says

ChatGPT, Claude and Productivity - What Anthropic’s Field Evidence Really Says

Anthropic

ChatGPT

AI

19 ene 2026

A diverse group of three professionals in casual attire discuss productivity strategies at a wooden table in a modern office, with laptops and tablets in use, overlooking a city view from large windows, emphasizing collaboration and technology.
A diverse group of three professionals in casual attire discuss productivity strategies at a wooden table in a modern office, with laptops and tablets in use, overlooking a city view from large windows, emphasizing collaboration and technology.

Not sure what to do next with AI?
Assess readiness, risk, and priorities in under an hour.

Not sure what to do next with AI?
Assess readiness, risk, and priorities in under an hour.

➔ Reserva una Consulta

Anthropic analysed ~100k Claude conversations and estimates that AI reduces task time by ~80% on work that otherwise takes ~1.4 hours (~$55 of labour). Adjusting for task reliability trims economy-wide gains to ~+1.0pp annual labour-productivity growth over a decade—still material, but realistic.

Why it matters: Rather than lab prompts, this is field evidence from organic usage. The headline finding: AI cut task time by ~80% on typical work that would otherwise take ~1.4 hours and cost ~$55 in labour.

How to read it: Anthropic also published macro updates—incorporating task reliability halves headline economy-wide gains from +1.8pp to ~+1.0pp annual labour-productivity growth over the next decade. Treat the task-level 80% as an upper bound; your realised gains depend on reliability and fit.

👉 Download & read the research: Estimating AI productivity gains from Claude conversations (Anthropic).
https://www.anthropic.com/research/estimating-productivity-gains

Key findings

  • Big task acceleration: Across a large, mixed workload, Claude reduced completion time by ~80%. Typical tasks were complex and non-trivial (median ~1.4h baseline).

  • Meaningful cost proxies: Mapping tasks to occupations/wages implies ~$55 in human labour saved per task at baseline—directional, not a billing rate.

  • Macro realism matters: When Anthropic adjust for reliability (how often AI actually succeeds), the economy-wide uplift falls from ~+1.8pp to ~+1.0pp annual productivity growth.

What this means for teams (and where to point AI)

  1. Target long, complex tasks first. The dataset skews to substantial work; expect outsized gains on research, analysis, drafting, coding assistance and synthesis.

  2. Instrument reliability, not just speed. Mirror Anthropic’s macro adjustment: track success rate and rework alongside time saved. Your true gains = time saved × success rate.

  3. Build an “AI operating model”. Gains are higher when tasks are well-scoped, grounded in company knowledge and reviewed with lightweight guardrails. (Anthropic’s wider Index work also shows adoption patterns across occupations.)

A 60-day rollout (a Generation Digital playbook)

  • Weeks 1–2: Baseline. Select 3 task types (e.g., customer research synthesis, policy drafting, QA test writing). Capture baseline time, quality, and error rates.

  • Weeks 3–4: Ground & prompts. Ground Claude in your knowledge base; define prompt templates and acceptance criteria per task.

  • Weeks 5–6: Pilot & measure. Ship to a pilot team; track P50/P95 time, success rate, rework minutes, and cost per task.

  • Weeks 7–8: Scale safely. Add reviewer flows, red-team prompts for failure modes, and expand to adjacent tasks.
    (We implement with Notion for SOPs/decisions, Glean for permission-aware grounding, Miro for collaboration workflows, and Asana for intake/metrics.)

FAQs

What exactly did Anthropic measure?
They used Claude to evaluate anonymised user conversations, infer the underlying task, and estimate time/cost baselines via O*NET/BLS, then compared against AI-assisted effort.

Is the “80% faster” number credible for my org?
Treat it as an upper-bound from field data. Anthropic’s macro model halves economy-wide gains when reliability is included—so measure your success rate and rework.

What’s the big-picture impact?
Earlier Index work suggested AI could add ~+1.8pp/yr to US labour productivity growth; with reliability adjustments it’s ~+1.0pp/yr—still a step-change historically.

Where should we apply AI first?
Long, cognitively heavy tasks with clear acceptance criteria—research synthesis, drafting, data analysis, and code assistance—using grounding + human review.

Next Steps

Work with us: Generation Digital will scope your first 3 task types, ground your assistant in enterprise knowledge, and prove value in 60 days.

Anthropic analysed ~100k Claude conversations and estimates that AI reduces task time by ~80% on work that otherwise takes ~1.4 hours (~$55 of labour). Adjusting for task reliability trims economy-wide gains to ~+1.0pp annual labour-productivity growth over a decade—still material, but realistic.

Why it matters: Rather than lab prompts, this is field evidence from organic usage. The headline finding: AI cut task time by ~80% on typical work that would otherwise take ~1.4 hours and cost ~$55 in labour.

How to read it: Anthropic also published macro updates—incorporating task reliability halves headline economy-wide gains from +1.8pp to ~+1.0pp annual labour-productivity growth over the next decade. Treat the task-level 80% as an upper bound; your realised gains depend on reliability and fit.

👉 Download & read the research: Estimating AI productivity gains from Claude conversations (Anthropic).
https://www.anthropic.com/research/estimating-productivity-gains

Key findings

  • Big task acceleration: Across a large, mixed workload, Claude reduced completion time by ~80%. Typical tasks were complex and non-trivial (median ~1.4h baseline).

  • Meaningful cost proxies: Mapping tasks to occupations/wages implies ~$55 in human labour saved per task at baseline—directional, not a billing rate.

  • Macro realism matters: When Anthropic adjust for reliability (how often AI actually succeeds), the economy-wide uplift falls from ~+1.8pp to ~+1.0pp annual productivity growth.

What this means for teams (and where to point AI)

  1. Target long, complex tasks first. The dataset skews to substantial work; expect outsized gains on research, analysis, drafting, coding assistance and synthesis.

  2. Instrument reliability, not just speed. Mirror Anthropic’s macro adjustment: track success rate and rework alongside time saved. Your true gains = time saved × success rate.

  3. Build an “AI operating model”. Gains are higher when tasks are well-scoped, grounded in company knowledge and reviewed with lightweight guardrails. (Anthropic’s wider Index work also shows adoption patterns across occupations.)

A 60-day rollout (a Generation Digital playbook)

  • Weeks 1–2: Baseline. Select 3 task types (e.g., customer research synthesis, policy drafting, QA test writing). Capture baseline time, quality, and error rates.

  • Weeks 3–4: Ground & prompts. Ground Claude in your knowledge base; define prompt templates and acceptance criteria per task.

  • Weeks 5–6: Pilot & measure. Ship to a pilot team; track P50/P95 time, success rate, rework minutes, and cost per task.

  • Weeks 7–8: Scale safely. Add reviewer flows, red-team prompts for failure modes, and expand to adjacent tasks.
    (We implement with Notion for SOPs/decisions, Glean for permission-aware grounding, Miro for collaboration workflows, and Asana for intake/metrics.)

FAQs

What exactly did Anthropic measure?
They used Claude to evaluate anonymised user conversations, infer the underlying task, and estimate time/cost baselines via O*NET/BLS, then compared against AI-assisted effort.

Is the “80% faster” number credible for my org?
Treat it as an upper-bound from field data. Anthropic’s macro model halves economy-wide gains when reliability is included—so measure your success rate and rework.

What’s the big-picture impact?
Earlier Index work suggested AI could add ~+1.8pp/yr to US labour productivity growth; with reliability adjustments it’s ~+1.0pp/yr—still a step-change historically.

Where should we apply AI first?
Long, cognitively heavy tasks with clear acceptance criteria—research synthesis, drafting, data analysis, and code assistance—using grounding + human review.

Next Steps

Work with us: Generation Digital will scope your first 3 task types, ground your assistant in enterprise knowledge, and prove value in 60 days.

Recibe consejos prácticos directamente en tu bandeja de entrada

Al suscribirte, das tu consentimiento para que Generation Digital almacene y procese tus datos de acuerdo con nuestra política de privacidad. Puedes leer la política completa en gend.co/privacy.

¿Listo para obtener el apoyo que su organización necesita para usar la IA con éxito?

Miro Solutions Partner
Asana Platinum Solutions Partner
Notion Platinum Solutions Partner
Glean Certified Partner

¿Listo para obtener el apoyo que su organización necesita para usar la IA con éxito?

Miro Solutions Partner
Asana Platinum Solutions Partner
Notion Platinum Solutions Partner
Glean Certified Partner

Generación
Digital

Oficina en el Reino Unido
33 Queen St,
Londres
EC4R 1AP
Reino Unido

Oficina en Canadá
1 University Ave,
Toronto,
ON M5J 1T1,
Canadá

Oficina NAMER
77 Sands St,
Brooklyn,
NY 11201,
Estados Unidos

Oficina EMEA
Calle Charlemont, Saint Kevin's, Dublín,
D02 VN88,
Irlanda

Oficina en Medio Oriente
6994 Alsharq 3890,
An Narjis,
Riyadh 13343,
Arabia Saudita

UK Fast Growth Index UBS Logo
Financial Times FT 1000 Logo
Febe Growth 100 Logo (Background Removed)

Número de la empresa: 256 9431 77 | Derechos de autor 2026 | Términos y Condiciones | Política de Privacidad

Generación
Digital

Oficina en el Reino Unido
33 Queen St,
Londres
EC4R 1AP
Reino Unido

Oficina en Canadá
1 University Ave,
Toronto,
ON M5J 1T1,
Canadá

Oficina NAMER
77 Sands St,
Brooklyn,
NY 11201,
Estados Unidos

Oficina EMEA
Calle Charlemont, Saint Kevin's, Dublín,
D02 VN88,
Irlanda

Oficina en Medio Oriente
6994 Alsharq 3890,
An Narjis,
Riyadh 13343,
Arabia Saudita

UK Fast Growth Index UBS Logo
Financial Times FT 1000 Logo
Febe Growth 100 Logo (Background Removed)


Número de Empresa: 256 9431 77
Términos y Condiciones
Política de Privacidad
Derechos de Autor 2026