Runs ↗

langchain concept beginner agents workflows

Summary: An overview of runs in Agent Server, including how to kick off background runs, stateless runs, and cancel runs.

Original Documentation

Documentation Index#
Fetch the complete documentation index at: https://docs.langchain.com/llms.txt Use this file to discover all available pages before exploring further.

An overview of runs in Agent Server, including how to kick off background runs, stateless runs, and cancel runs.

A run is an invocation of an assistant. When you execute a run, you specify which assistant to use—either by graph ID for the default assistant, or by assistant ID for a specific configuration.

flowchart TB
    subgraph deploy[Deployment]
        G[Graph Code<br/>━━━━━━━━━<br/>Deployed logic]
    end

    subgraph config[Configuration]
        A1[Assistant 1<br/>GPT-4, Formal]
        A2[Assistant 2<br/>Claude, Casual]
    end

    subgraph state[State]
        T1[Thread 1<br/>User A]
        T2[Thread 2<br/>User B]
    end

    subgraph runs[Runs]
        A1T1["Run: A1 + T1"]
        A1T2["Run: A1 + T2"]
        A2T1["Run: A2 + T1"]
    end

    A1 -.-> T1
    A1 -.-> T2
    A2 -.-> T1

    A1T1 --> G
    A1T2 --> G
    A2T1 --> G

    style G fill:#DBEAFE,stroke:#2563EB,stroke-width:2px,color:#1E3A8A
    style A1 fill:#B3E0F2,stroke:#4A90E2,stroke-width:2px,color:#1E3A5F
    style A2 fill:#B3E0F2,stroke:#4A90E2,stroke-width:2px,color:#1E3A5F
    style T1 fill:#FFE0B3,stroke:#F59E0B,stroke-width:2px,color:#78350F
    style T2 fill:#FFE0B3,stroke:#F59E0B,stroke-width:2px,color:#78350F
    style A1T1 fill:#B3F2C9,stroke:#10B981,stroke-width:2px,color:#14532D
    style A1T2 fill:#B3F2C9,stroke:#10B981,stroke-width:2px,color:#14532D
    style A2T1 fill:#B3F2C9,stroke:#10B981,stroke-width:2px,color:#14532D

This diagram shows how a run combines an assistant with a thread to execute the graph:

Graph (blue): The deployed code containing your agent’s logic
Assistants (light blue): Configuration options (model, prompts, tools)
Threads (orange): State containers for conversation history
Runs (green): Executions that pair an assistant + thread

Example combinations:

Run: A1 + T1: Assistant 1 configuration applied to User A’s conversation
Run: A1 + T2: Same assistant serving User B (different conversation)
Run: A2 + T1: Different assistant applied to User A’s conversation (configuration switch)

When executing a run:

Each run may have its own input, configuration overrides, and metadata.
Runs can be stateless (no thread) or stateful (executed on a thread for conversation persistence).
Multiple runs can use the same assistant configuration.
The assistant’s configuration affects how the underlying graph executes.

The Agent Server API provides several endpoints for creating and managing runs. For more details, refer to the API reference.

In this section#

Run your agent asynchronously and poll for results.

Use multiple assistants on a shared thread to combine agent capabilities.

Execute runs without persisting state when conversation history isn’t needed.

Cancel a single run or multiple runs via the API.

Edit this page on GitHub or file an issue.

Connect these docs to Claude, VSCode, and more via MCP for real-time answers.

Link last verified June 7, 2026. View original ↗

Source: LangChain Docs

Link last verified: 2026-04-05