Patching ↗

Original Documentation

Patching adds structured output features to LLM client libraries. This page explains how it works. For most users, from_provider is simpler than manual patching.

Recommended Approach

Use from_provider instead of manual patching. It works the same way across all providers. See the Migration Guide if you’re using older patching patterns.

What is Patching?#

Patching adds new features to LLM client objects without changing their original code. When Instructor patches a client, it adds:

New parameters: response_model, max_retries, and context to completion methods
Validation: Checks responses against Pydantic models
Retry logic: Retries when validation fails
Compatibility: The patched client still works with all original methods

How Patching Works#

When Instructor patches a client, it:

Wraps the completion method: Intercepts calls to create() or chat.completions.create()
Converts schemas: Changes Pydantic models into provider-specific formats (JSON schema, tool definitions, etc.)
Validates responses: Checks LLM outputs against your Pydantic model
Handles retries: Retries with validation feedback if needed
Returns typed objects: Converts validated JSON into Pydantic model instances

Patching Modes#

Different providers support different modes for structured extraction. Instructor automatically selects the best mode for each provider, but you can override it:

Tool Calling (TOOLS)#

Uses the provider’s function/tool calling API. This is the default for OpenAI.

Supported by: OpenAI, Anthropic (ANTHROPIC_TOOLS), Google (GENAI_TOOLS), Ollama (for supported models)

JSON Mode#

Instructs the model to return JSON directly. Works with most providers.

Supported by: OpenAI, Anthropic, Google, Ollama, and most providers

Markdown JSON (MD_JSON)#

Asks for JSON wrapped in markdown. Only use for specific providers like Databricks.

Supported by: Databricks, some vision models

Default Modes by Provider#

Each provider uses a recommended default mode:

OpenAI: Mode.TOOLS (function calling)
Anthropic: Mode.TOOLS (tool use)
Google: Mode.TOOLS (function calling)
Ollama: Mode.TOOLS (if model supports it) or Mode.JSON
Others: Provider-specific defaults

When using from_provider, these defaults are applied automatically. You can override them with the mode parameter.

Manual Patching (Advanced)#

If you need to patch a client manually (not recommended for most users):

import openai
import instructor
from pydantic import BaseModel


class YourModel(BaseModel):
    message: str


# Create the base client
openai_client = openai.OpenAI()

# Patch it manually
client = instructor.patch(openai_client, mode=instructor.Mode.TOOLS)

# Now use it
response = client.chat.completions.create(
    response_model=YourModel,
    messages=[{"role": "user", "content": "Say hello"}],
)

However, using from_provider is simpler and recommended:

import instructor
from pydantic import BaseModel


# Simpler approach
class YourModel(BaseModel):
    message: str


client = instructor.from_provider("openai/gpt-4o-mini")
_response = client.create(
    response_model=YourModel,
    messages=[{"role": "user", "content": "Say hello"}],
)

What Gets Patched?#

Instructor adds these features to patched clients:

New Parameters#

response_model: A Pydantic model or type that defines the expected output structure
max_retries: Number of retry attempts if validation fails (default: 0)
context: Additional context for validation hooks

Enhanced Methods#

The patched client’s create() method:

Accepts response_model parameter
Validates responses automatically
Retries on validation failures
Returns typed Pydantic objects instead of raw responses

Provider-Specific Considerations#

OpenAI#

Default mode: TOOLS (function calling)
Supports streaming with structured outputs

Anthropic#

Default mode: ANTHROPIC_TOOLS (tool use)
Uses Claude’s native tool calling API

Google Gemini#

Default mode: GENAI_TOOLS (function calling)
Requires jsonref package for tool calling
Some limitations with strict validation and enums

Ollama (Local Models)#

Default mode: TOOLS (if model supports it) or JSON
Models like llama3.1, llama3.2, mistral-nemo support tools
Older models fall back to JSON mode

When to Use Manual Patching#

Manual patching is rarely needed. Use it only if:

You need fine-grained control over the patching process
You’re working with a custom client implementation
You’re debugging patching behavior

For 99% of use cases, from_provider is the better choice.

from_provider Guide - Recommended way to create patched clients
Migration Guide - Migrating from manual patching to from_provider
Modes Comparison - Detailed comparison of different modes
Integrations - Provider-specific documentation

Link last verified June 7, 2026. View original ↗

Source: Instructor Docs

Link last verified: 2026-03-04