Available models ↗

wandb guide intermediate fine-tuning models

Summary: See the models you can train with Serverless RL

Original Documentation

Documentation Index#
Fetch the complete documentation index at: https://docs.wandb.ai/llms.txt Use this file to discover all available pages before exploring further.

See the models you can train with Serverless RL

Serverless RL currently supports the following foundation models for training.

To express interest in a particular model, contact support.

Model catalog#

Model	Model ID (for API usage)	Type	Context Window	Parameters	Description
OpenPipe Qwen3 14B Instruct	`OpenPipe/Qwen3-14B-Instruct`	Text	32.8K	14.8B (Total)	An efficient multilingual, dense, instruction-tuned model, optimized by OpenPipe for building agents with finetuning.
Qwen3 30B A3B	`Qwen/Qwen3-30B-A3B-Instruct-2507`	Text	262K	3.3B-30.5B (Active-Total)	Qwen3-30B-A3B-Instruct-2507 is a 30.5B MoE instruction-tuned model with enhanced reasoning, coding, and long-context understanding.

Link last verified June 7, 2026. View original ↗

Source: Weights & Biases Docs

Link last verified: 2026-03-04