Training Guide: UI ↗

fireworks guide intermediate fine-tuning ide

Summary: Launch RFT jobs using the Fireworks dashboard

Original Documentation

Documentation Index#
Fetch the complete documentation index at: https://docs.fireworks.ai/llms.txt Use this file to discover all available pages before exploring further.

Launch RFT jobs using the Fireworks dashboard

Reinforcement Fine-Tuning (RFT) is free for models under 16B parameters. When creating an RFT job in the UI, filter for free tuning models in the model selection area on the fine-tuning creation page. If kicking off jobs from the terminal, you can find the model ID from the Model Library. Note: SFT and DPO jobs are billed per training token for all model sizes—see the pricing page for details.

The Fireworks RFT UI provides a visual interface for creating RFT jobs, with guided parameter selection. Results for all jobs can also be found in the UI.

When to use Web UI#

Start with the UI to learn the options, then switch to CLI for faster iteration and automation. Remember, your results will always live in the UI.

Feature	CLI (eval-protocol)	Web UI
Best for	Experienced users, automation	First-time users, exploration
Parameter discovery	Need to know flag names	Guided with tooltips
Speed	Fast - single command	Slower - multiple steps
Automation	Easy to script and reproduce	Manual process
Batch operations	Easy to launch multiple jobs	One at a time
Reproducibility	Excellent - save commands	Manual tracking needed

Launch training via Web UI#

Go to Fireworks Dashboard
Click Fine-Tuning in the left sidebar
Click Fine-tune a Model

Fine-tuning dashboard showing list of jobs

Choose Reinforcement as the tuning method
Select your base model from the dropdown

The UI shows only models that support fine-tuning. Popular choices appear at the top.

Not sure which model to choose? Start with llama-v3p1-8b-instruct for a good balance of quality and speed.

Upload new dataset or select existing from your account
Preview dataset entries to verify format
The UI validates your JSONL format automatically

Each dataset row should have messages array:

    {
      "messages": [
        {"role": "system", "content": "You are a helpful assistant."},
        {"role": "user", "content": "What is 25 * 4?"}
      ]
    }
    ```
  <span class="step-end"></span>

  <span class="step-marker" data-step-title="Select Evaluator"></span>
1. Choose from your uploaded evaluators
2. Preview evaluator code and test results
3. View recent evaluation metrics

If you haven't uploaded an evaluator yet, you'll need to do that first via CLI:

```bash
    pytest my-evaluator-name.py -vs
    ```

<span class="callout-start" data-callout-type="note"></span>
  For remote evaluators, you'll enter your server URL in the environment configuration section.
<span class="callout-end"></span>
  <span class="step-end"></span>

  <span class="step-marker" data-step-title="Set Training Parameters"></span>
Configure how the model learns:

**Core parameters**:

* **Output model name**: Custom name for your fine-tuned model
* **Epochs**: Number of passes through the dataset (start with 1)
* **Learning rate**: How fast the model updates (use default 1e-4)
* **LoRA rank**: Model capacity (8-16 for most tasks)
* **Batch size**: Training throughput (use default 32k tokens)

The UI shows helpful tooltips for each parameter. See [Parameter Tuning](/fine-tuning/parameter-tuning) for detailed guidance.
  <span class="step-end"></span>

  <span class="step-marker" data-step-title="Configure Rollout Parameters"></span>
Control how the model generates responses during training:

* **Temperature**: Sampling randomness (0.7 for balanced exploration)
* **Top-p**: Probability mass cutoff (0.9-1.0)
* **Top-k**: Token candidate limit (40 is standard)
* **Number of rollouts (n)**: Responses per prompt (4-8 recommended)
* **Max tokens**: Maximum response length (2048 default)

Higher temperature and more rollouts increase exploration but also cost.
  <span class="step-end"></span>

  <span class="step-marker" data-step-title="Review and Launch"></span>
1. Review all settings in the summary panel
2. See estimated training time and cost
3. Click **Start Fine-Tuning** to launch

The dashboard will redirect you to the job monitoring page where you can track progress in real-time.
  <span class="step-end"></span>
<span class="steps-end"></span>

## Next steps

<span class="card-group-start" data-cols="3"></span>
  <span class="card-start" data-card-title="Prerequisites & Validation" data-card-icon="list-check" data-card-href="/fine-tuning/training-prerequisites"></span>
Review requirements, validation, and common errors
  <span class="card-end"></span>

  <span class="card-start" data-card-title="Monitor training" data-card-icon="chart-line" data-card-href="/fine-tuning/monitor-training"></span>
Track job progress, inspect rollouts, and debug issues
  <span class="card-end"></span>

  <span class="card-start" data-card-title="Parameter tuning" data-card-icon="sliders" data-card-href="/fine-tuning/parameter-tuning"></span>
Learn how to adjust parameters for better results
  <span class="card-end"></span>
<span class="card-group-end"></span>

Link last verified June 7, 2026. View original ↗

Source: Fireworks AI Docs

Link last verified: 2026-06-07