W&B Inference ↗
noOriginal Documentation
Documentation Index#
Fetch the complete documentation index at: https://docs.wandb.ai/llms.txt Use this file to discover all available pages before exploring further.
Access open-source foundation models through W&B Weave and an OpenAI-compatible API
W&B Inference gives you access to leading open-source foundation models through W&B Weave and an OpenAI-compatible API. You can:
- Build AI applications and agents without signing up for a hosting provider or self-hosting a model
- Try supported models in the W&B Weave Playground
With Weave, you can trace, evaluate, monitor, and improve your W&B Inference-powered applications.
Quickstart#
Here’s a simple example using Python:
import openai
client = openai.OpenAI(
# The custom base URL points to W&B Inference
base_url='https://api.inference.wandb.ai/v1',
# Create an API key at https://wandb.ai/settings
api_key="<your-api-key>",
# Optional: Team and project for usage tracking
project="<your-team>/<your-project>",
)
response = client.chat.completions.create(
model="meta-llama/Llama-3.1-8B-Instruct",
messages=[
{"role": "system", "content": "You are a helpful assistant."},
{"role": "user", "content": "Tell me a joke."}
],
)
print(response.choices[0].message.content)Next steps#
- Review the available models and usage information and limits
- Set up your account using the prerequisites
- Use the service through the API or UI
- Try the usage examples
Usage details#
For information about pricing, usage limits, and credits, see Usage Information and Limits.
Link last verified
June 7, 2026.
View original ↗
Source: Weights & Biases Docs
Link last verified: 2026-04-05