Available models ↗
noSummary: See the models you can train with Serverless RL
Original Documentation
Documentation Index#
Fetch the complete documentation index at: https://docs.wandb.ai/llms.txt Use this file to discover all available pages before exploring further.
See the models you can train with Serverless RL
Serverless RL currently supports the following foundation models for training.
To express interest in a particular model, contact support.
Model catalog#
| Model | Model ID (for API usage) | Type | Context Window | Parameters | Description |
|---|---|---|---|---|---|
| OpenPipe Qwen3 14B Instruct | OpenPipe/Qwen3-14B-Instruct | Text | 32.8K | 14.8B (Total) | An efficient multilingual, dense, instruction-tuned model, optimized by OpenPipe for building agents with finetuning. |
| Qwen3 30B A3B | Qwen/Qwen3-30B-A3B-Instruct-2507 | Text | 262K | 3.3B-30.5B (Active-Total) | Qwen3-30B-A3B-Instruct-2507 is a 30.5B MoE instruction-tuned model with enhanced reasoning, coding, and long-context understanding. |
Link last verified
June 7, 2026.
View original ↗
Source: Weights & Biases Docs
Link last verified: 2026-03-04