Available models

no
Summary: See the models you can train with Serverless RL

Original Documentation

Documentation Index#

Fetch the complete documentation index at: https://docs.wandb.ai/llms.txt Use this file to discover all available pages before exploring further.

See the models you can train with Serverless RL

Serverless RL currently supports the following foundation models for training.

To express interest in a particular model, contact support.

Model catalog#

ModelModel ID (for API usage)TypeContext WindowParametersDescription
OpenPipe Qwen3 14B InstructOpenPipe/Qwen3-14B-InstructText32.8K14.8B (Total)An efficient multilingual, dense, instruction-tuned model, optimized by OpenPipe for building agents with finetuning.
Qwen3 30B A3BQwen/Qwen3-30B-A3B-Instruct-2507Text262K3.3B-30.5B (Active-Total)Qwen3-30B-A3B-Instruct-2507 is a 30.5B MoE instruction-tuned model with enhanced reasoning, coding, and long-context understanding.
Link last verified June 7, 2026. View original ↗
Source: Weights & Biases Docs
Link last verified: 2026-03-04