How do I enable dynamic filtering in the Claude API?

Use the web_search_20260209 (and/or web_fetch_20260209) tool version and include the required beta header. Dynamic filtering requires the code execution tool to be enabled.

Does code execution add extra cost when used for dynamic filtering?

Anthropic states that code execution is free when used with web search or web fetch tools, with no additional tool-call charges beyond standard token costs.

Dynamic Filtering in Claude Web Search: Faster, Cheaper

Q: How does dynamic filtering improve search accuracy?

Dynamic filtering lets Claude write and execute code to post-process web search and fetch results before they enter the context window, removing irrelevant content so the model can focus on what answers the query.

Q: What does “24% fewer input tokens” mean in practice?

It means less raw web content is pulled into the model’s context because filtering happens first, which typically reduces token overhead for search-heavy workflows.

Q: Is dynamic filtering available to everyone?

Dynamic filtering is available via the newer Claude API web search and web fetch tool versions, and works with supported models such as Opus 4.6 and Sonnet 4.6 when enabled via the documented tool versions and headers.

Claude

Anthropic

17 févr. 2026

A futuristic digital visualization of dynamic filtering in Claude web search, featuring vibrant data streams and network connections, illustrating the concept of faster, cheaper search technology.

Pas sûr de quoi faire ensuite avec l'IA?
Évaluez la préparation, les risques et les priorités en moins d'une heure.

➔ Téléchargez notre kit de préparation à l'IA gratuit

Dynamic filtering in Claude’s web search lets the model write and execute code to trim search results before they enter the context window. That means Claude reads less irrelevant content, which can improve search accuracy (about 11% on benchmarks) while using fewer input tokens (about 24%). It’s available via the Claude API web search and web fetch tools.

Web search is one of the quickest ways to make an AI assistant more useful—and one of the easiest ways to make it waste tokens. Traditional “browse then summarise” flows pull a lot of raw HTML into the context window, and much of it isn’t relevant to the question you’re trying to answer.

Anthropic’s latest update tackles that problem directly. Claude’s web search and web fetch tools can now automatically write and execute code to post-process results before they enter the context window, keeping only what matters and discarding the rest. Anthropic reports this improves performance on agentic search benchmarks by ~11% on average, while using ~24% fewer input tokens.

Why this matters now

If you’re building anything beyond a toy demo—research agents, citation checkers, “answer from docs” assistants, or competitive intelligence bots—you’ll recognise the trade-offs:

Too much context can dilute relevance and increase hallucination risk.
Too many tokens can make an otherwise-good workflow too expensive to run at scale.
Messy web pages (navigation, cookie banners, unrelated sections) often swamp the useful parts.

Dynamic filtering is a pragmatic solution: treat web results like raw data, run a quick processing step, then pass only the cleaned output into Claude’s reasoning step.

What’s actually new: dynamic filtering inside web search and web fetch

In the updated flow, Claude can:

Run a web search (or fetch a page).
Generate a small script to extract the relevant parts (for example: headings, pricing tables, a specific paragraph, or citation candidates).
Execute that script in a sandboxed code execution environment.
Only send the filtered output into the context window for final reasoning and writing.

Anthropic describes this as filtering “before loading results into context”, rather than asking the model to reason over full HTML.

The headline results

Anthropic evaluated web search on Sonnet 4.6 and Opus 4.6 with and without dynamic filtering (and no other tools enabled). Across BrowseComp and DeepsearchQA, dynamic filtering improved performance by an average of 11%, while using 24% fewer input tokens.

They also share DeepsearchQA F1 improvements (for example, Sonnet 4.6 moving from 52.6% to 59.4%, and Opus 4.6 from 69.8% to 77.3%).

Practical note: token costs can still vary based on how much code Claude writes to do the filtering, so it’s worth testing against your own real queries.

How to enable dynamic filtering via the Claude API

Dynamic filtering is supported in the latest web search tool version: web_search_20260209. The Claude API docs explicitly note that this version supports dynamic filtering for Opus 4.6 and Sonnet 4.6, and that Claude can write and execute code to filter results before they reach the context window.

What you’ll need:

Tool version: web_search_20260209 (and/or web_fetch_20260209)
Beta header: anthropic-beta: code-execution-web-tools-2026-02-09
Code execution tool enabled, because dynamic filtering depends on it.

Cost and governance considerations you should know

One helpful detail: Anthropic states that code execution is free when used with web search or web fetch—i.e., there are no additional charges for code execution tool calls beyond standard input/output token costs when those web tools are included.

There are also important policy nuances:

The web search tool page notes dynamic filtering and includes ZDR eligibility (where applicable).
The code execution tool page notes code execution capabilities and that some arrangements may differ for retention depending on feature use—so it’s worth aligning with your organisation’s security posture early.

When dynamic filtering delivers the most value

You’ll get the biggest uplift when the “raw web” is noisy or large, for example:

Technical documentation searches where you only need a specific section or parameter definition.
Literature review and citation verification, where you want to extract and compare multiple sources quickly.
Multi-step research (the kind that usually explodes token usage because every step drags more pages into context).

If your current approach involves “search → fetch 5 pages → dump HTML into context”, dynamic filtering is a straightforward upgrade.

A practical implementation pattern (what to build)

Here’s a clean pattern we see working well in production:

Write prompts that declare the extraction goal
Example: “Find the pricing tiers and extract only the pricing table + any footnotes about limits.”
Let the tools do the messy work
Use web search + web fetch with dynamic filtering so Claude processes results outside the context window first.
Return structured output
Ask Claude to return a table (tiers, limits, sources) or a concise bullet list with citations.
Add an evaluation loop
Measure accuracy on your own queries, because your domain will differ from benchmarks. Anthropic recommends evaluating against representative production queries.

How this fits into the bigger “efficient agents” direction

Dynamic filtering is part of a wider shift: keep large, messy intermediate artefacts out of the model’s context window, and only pass through what’s needed to answer the user.

Anthropic’s engineering write-ups on code execution with MCP and advanced tool use describe the same core principle: reduce context bloat, improve orchestration, and let code handle loops, filtering, and transformation where it makes sense.

Summary and next steps

Dynamic filtering makes Claude’s web search more practical at scale: cleaner context, fewer tokens, and better accuracy on search-heavy workflows.

Next steps for teams:

Audit one or two of your most token-hungry search workflows.
Enable web_search_20260209 + the beta header.
Compare accuracy, cost, and latency before/after.
If you want help designing an evaluation plan or rolling this into a governed internal tool, Generation Digital can support implementation and adoption.

FAQ

Q1: How does dynamic filtering improve search accuracy?
By letting Claude post-process search results with code execution before the content enters the context window. That reduces irrelevant text and helps the model focus on what actually answers the question.

Q2: What does “24% fewer input tokens” mean in practice?
It means the model is typically given less raw web content to read because filtering happens first—reducing context bloat and, often, overall cost for search-heavy tasks.

Q3: Is dynamic filtering available to everyone?
It’s available via the Claude API’s newer web search/web fetch tool versions, and Anthropic states it’s enabled by default when using those newer tools with Opus 4.6 and Sonnet 4.6 (subject to tool version and headers).

Q4: How do I enable it?
Use web_search_20260209 (and/or web_fetch_20260209) and include the beta header code-execution-web-tools-2026-02-09. Dynamic filtering requires the code execution tool.

Q5: Does code execution add extra cost?
Anthropic notes that code execution is free when used with web search or web fetch tools (no extra tool-call charges beyond standard token costs).

Dynamic filtering in Claude’s web search lets the model write and execute code to trim search results before they enter the context window. That means Claude reads less irrelevant content, which can improve search accuracy (about 11% on benchmarks) while using fewer input tokens (about 24%). It’s available via the Claude API web search and web fetch tools.

Why this matters now

If you’re building anything beyond a toy demo—research agents, citation checkers, “answer from docs” assistants, or competitive intelligence bots—you’ll recognise the trade-offs:

Too much context can dilute relevance and increase hallucination risk.
Too many tokens can make an otherwise-good workflow too expensive to run at scale.
Messy web pages (navigation, cookie banners, unrelated sections) often swamp the useful parts.

Dynamic filtering is a pragmatic solution: treat web results like raw data, run a quick processing step, then pass only the cleaned output into Claude’s reasoning step.

What’s actually new: dynamic filtering inside web search and web fetch

In the updated flow, Claude can:

Run a web search (or fetch a page).
Generate a small script to extract the relevant parts (for example: headings, pricing tables, a specific paragraph, or citation candidates).
Execute that script in a sandboxed code execution environment.
Only send the filtered output into the context window for final reasoning and writing.

Anthropic describes this as filtering “before loading results into context”, rather than asking the model to reason over full HTML.

The headline results

They also share DeepsearchQA F1 improvements (for example, Sonnet 4.6 moving from 52.6% to 59.4%, and Opus 4.6 from 69.8% to 77.3%).

Practical note: token costs can still vary based on how much code Claude writes to do the filtering, so it’s worth testing against your own real queries.

How to enable dynamic filtering via the Claude API

What you’ll need:

Tool version: web_search_20260209 (and/or web_fetch_20260209)
Beta header: anthropic-beta: code-execution-web-tools-2026-02-09
Code execution tool enabled, because dynamic filtering depends on it.

Cost and governance considerations you should know

There are also important policy nuances:

The web search tool page notes dynamic filtering and includes ZDR eligibility (where applicable).
The code execution tool page notes code execution capabilities and that some arrangements may differ for retention depending on feature use—so it’s worth aligning with your organisation’s security posture early.

When dynamic filtering delivers the most value

You’ll get the biggest uplift when the “raw web” is noisy or large, for example:

Technical documentation searches where you only need a specific section or parameter definition.
Literature review and citation verification, where you want to extract and compare multiple sources quickly.
Multi-step research (the kind that usually explodes token usage because every step drags more pages into context).

If your current approach involves “search → fetch 5 pages → dump HTML into context”, dynamic filtering is a straightforward upgrade.

A practical implementation pattern (what to build)

Here’s a clean pattern we see working well in production:

Write prompts that declare the extraction goal
Example: “Find the pricing tiers and extract only the pricing table + any footnotes about limits.”
Let the tools do the messy work
Use web search + web fetch with dynamic filtering so Claude processes results outside the context window first.
Return structured output
Ask Claude to return a table (tiers, limits, sources) or a concise bullet list with citations.
Add an evaluation loop
Measure accuracy on your own queries, because your domain will differ from benchmarks. Anthropic recommends evaluating against representative production queries.

How this fits into the bigger “efficient agents” direction

Dynamic filtering is part of a wider shift: keep large, messy intermediate artefacts out of the model’s context window, and only pass through what’s needed to answer the user.

Summary and next steps

Dynamic filtering makes Claude’s web search more practical at scale: cleaner context, fewer tokens, and better accuracy on search-heavy workflows.

Next steps for teams:

Audit one or two of your most token-hungry search workflows.
Enable web_search_20260209 + the beta header.
Compare accuracy, cost, and latency before/after.
If you want help designing an evaluation plan or rolling this into a governed internal tool, Generation Digital can support implementation and adoption.

FAQ

Q5: Does code execution add extra cost?
Anthropic notes that code execution is free when used with web search or web fetch tools (no extra tool-call charges beyond standard token costs).

‹ The Knowledge Paradox: Fix Knowledge Management to Scale AI in Europe

Pentagon Reviews Anthropic Deal: AI Safeguards Clash ›

Recevez chaque semaine des nouvelles et des conseils sur l'IA directement dans votre boîte de réception

En vous abonnant, vous consentez à ce que Génération Numérique stocke et traite vos informations conformément à notre politique de confidentialité. Vous pouvez lire la politique complète sur gend.co/privacy.

In a modern office with large windows overlooking a European city street, three business professionals engage in a discussion around a table with a laptop displaying a business analytics dashboard, capturing the essence of "Mistral Buys Koyeb: Europe’s AI Cloud Ambitions Accelerate."

Mistral Buys Koyeb: Europe’s AI Cloud Ambitions Accelerate

A group of professionals engaged in a discussion around a laptop screen displaying a report, set in a modern office with large windows and plants, illustrating collaboration in enterprise AI and digital strategy transformation.

Perplexity’s Subscription Pivot: What It Means for Enterprise AI

A group of four professionals sit around a wooden conference table with laptops and documents, discussing a digital diagram on a large screen and a whiteboard reading "AI Agents + Guardrails," with a cityscape visible through the window, highlighting innovation and collaboration in AI technology.

OpenAI + OpenClaw: Why Agents Are Replacing Chatbots

Mistral Buys Koyeb: Europe’s AI Cloud Ambitions Accelerate

Perplexity’s Subscription Pivot: What It Means for Enterprise AI

OpenAI + OpenClaw: Why Agents Are Replacing Chatbots

Ateliers et webinaires à venir

A diverse group of professionals collaborating around a table in a bright, modern office setting.

Clarté opérationnelle à grande échelle - Asana

Webinaire Virtuel
Mercredi 25 février 2026
En ligne

Collaborez avec des coéquipiers IA - Asana

Atelier en personne
Jeudi 26 février 2026
London, UK

De l'idée au prototype - L'IA dans Miro

Webinaire virtuel
Mercredi 18 février 2026
En ligne

Génération
Numérique

Miro
Asana
Notion
Glean

Quel outil d'IA? Quiz

Le chemin vers le succès avec l'IA

À propos de Generation Digital

Contact

Bureau du Royaume-Uni

Génération Numérique Ltée
33 rue Queen,
Londres
EC4R 1AP
Royaume-Uni

Bureau au Canada

Génération Numérique Amériques Inc
181 rue Bay, Suite 1800
Toronto, ON, M5J 2T9
Canada

Bureau aux États-Unis

Generation Digital Americas Inc
77 Sands St,
Brooklyn, NY 11201,
États-Unis

Bureau de l'UE

Génération de logiciels numériques
Bâtiment Elgee
Dundalk
A91 X2R3
Irlande

Bureau du Moyen-Orient

6994 Alsharq 3890,
An Narjis,
Riyad 13343,
Arabie Saoudite

Numéro d'entreprise : 256 9431 77 | Droits d'auteur 2026 | Conditions générales | Politique de confidentialité