Build with Claude — Overview ↗

anthropic-platform concept beginner models prompts

Summary: Central hub for understanding how to build applications with the Claude API.

Editorial Notes

This is the map for the entire “Build with Claude” section. Skim it to understand what capabilities exist (messages, streaming, tool use, vision, caching, etc.) before diving into individual features. The mental model here is that everything flows through the Messages API — other features like tool use, streaming, and vision are extensions of that core primitive.

Original Documentation

Explore Claude’s advanced features and capabilities.

Claude’s API surface is organized into five areas:

Model capabilities: Control how Claude reasons and formats responses.
Tools: Let Claude take actions on the web or in your environment.
Tool infrastructure: Handles discovery and orchestration at scale.
Context management: Keeps long-running sessions efficient.
Files and assets: Manage the documents and data you provide to Claude.

If you’re new, start with model capabilities and tools. Return to the other sections when you’re ready to optimize cost, latency, or scale.

Model capabilities#

Ways to steer Claude and Claude’s direct outputs, including response format, reasoning depth, and input modalities.

Feature	Description	Availability
1M token context window	An extended context window that allows you to process much larger documents, maintain longer conversations, and work with more extensive codebases.
Adaptive thinking	Let Claude dynamically decide when and how much to think. The recommended thinking mode for Opus 4.6. Use the effort parameter to control thinking depth.
Batch processing	Process large volumes of requests asynchronously for cost savings. Send batches with a large number of queries per batch. Batch API calls cost 50% less than standard API calls.
Citations	Ground Claude’s responses in source documents. With Citations, Claude can provide detailed references to the exact sentences and passages it uses to generate responses, leading to more verifiable, trustworthy outputs.
Data residency	Control where model inference runs using geographic controls. Specify `"global"` or `"us"` routing per request via the `inference_geo` parameter.
Effort	Control how many tokens Claude uses when responding with the effort parameter, trading off between response thoroughness and token efficiency. Supported on Opus 4.6 and Opus 4.5.
Extended thinking	Enhanced reasoning capabilities for complex tasks, providing transparency into Claude’s step-by-step thought process before delivering its final answer.
PDF support	Process and analyze text and visual content from PDF documents.
Search results	Enable natural citations for RAG applications by providing search results with proper source attribution. Achieve web search-quality citations for custom knowledge bases and tools.
Structured outputs	Guarantee schema conformance with two approaches: JSON outputs for structured data responses, and strict tool use for validated tool inputs.

Tools#

Built-in tools that Claude invokes via tool_use. Server-side tools are run by the platform; client-side tools are implemented and executed by you.

Server-side tools#

Feature	Description	Availability
Code execution	Run code in a sandboxed environment for advanced data analysis, calculations, and file processing. Free when used with web search or web fetch.
Memory	Enable Claude to store and retrieve information across conversations. Build knowledge bases over time, maintain project context, and learn from past interactions.
Web fetch	Retrieve full content from specified web pages and PDF documents for in-depth analysis.
Web search	Augment Claude’s comprehensive knowledge with current, real-world data from across the web.

Client-side tools#

Feature	Description	Availability
Bash	Execute bash commands and scripts to interact with the system shell and perform command-line operations.
Computer use	Control computer interfaces by taking screenshots and issuing mouse and keyboard commands.
Text editor	Create and edit text files with a built-in text editor interface for file manipulation tasks.

Tool infrastructure#

Infrastructure that supports discovering, orchestrating, and scaling tool use.

Feature	Description	Availability
Agent Skills	Extend Claude’s capabilities with Skills. Use pre-built Skills (PowerPoint, Excel, Word, PDF) or create custom Skills with instructions and scripts. Skills use progressive disclosure to efficiently manage context.
Fine-grained tool streaming	Stream tool use parameters without buffering/JSON validation, reducing latency for receiving large parameters.
MCP connector	Connect to remote MCP servers directly from the Messages API without a separate MCP client.
Programmatic tool calling	Enable Claude to call your tools programmatically from within code execution containers, reducing latency and token consumption for multi-tool workflows.
Tool search	Scale to thousands of tools by dynamically discovering and loading tools on-demand using regex-based search, optimizing context usage and improving tool selection accuracy.

Context management#

Infrastructure for controlling and optimizing Claude’s context window.

Feature	Description	Availability
Compaction	Server-side context summarization for long-running conversations. When context approaches the window limit, the API automatically summarizes earlier parts of the conversation. Supported on Opus 4.6 and Haiku 4.5.
Context editing	Automatically manage conversation context with configurable strategies. Supports clearing tool results when approaching token limits and managing thinking blocks in extended thinking conversations.
Automatic prompt caching	Simplify prompt caching to a single API parameter. The system automatically caches the last cacheable block in your request, moving the cache point forward as conversations grow.
Prompt caching (5m)	Provide Claude with more background knowledge and example outputs to reduce costs and latency.
Prompt caching (1hr)	Extended 1-hour cache duration for less frequently accessed but important context, complementing the standard 5-minute cache.
Token counting	Token counting enables you to determine the number of tokens in a message before sending it to Claude, helping you make informed decisions about your prompts and usage.

Files and assets#

Manage files and assets for use with Claude.

Feature	Description	Availability
Files API	Upload and manage files to use with Claude without re-uploading content with each request. Supports PDFs, images, and text files.

Link last verified June 7, 2026. View original ↗

Source: Anthropic Platform Docs

Appears in Learning Paths

Claude API Essentials

Link last verified: 2026-02-26