Build with Fireworks AI

no
Summary: Fast inference and fine-tuning for open source models

Original Documentation

Documentation Index#

Fetch the complete documentation index at: https://docs.fireworks.ai/llms.txt Use this file to discover all available pages before exploring further.

Fast inference and fine-tuning for open source models

Fireworks AI is the fastest platform for building with open source AI models. Get production-ready inference and fine-tuning with best-in-class speed, cost and quality.

Get started in minutes#

Use popular models instantly with pay-per-token pricing. Perfect for quality vibe testing and prototyping.

Deploy with high performance on dedicated GPUs with fast autoscaling and minimal cold starts. Optimize deployments for speed and throughput.

Boost model quality with supervised and reinforcement fine-tuning of models up to 1T+ parameters. Start training in minutes, deploy immediately.

Not sure where to start? First, pick the right model for your use case with our model selection guide. Then choose Serverless to prototype quickly, move to Deployments to optimize and run production workloads, or use Fine-tuning to improve quality.

New to AI or Fireworks? Look up any term in the Glossary.

Need help optimizing deployments, fine-tuning models, or setting up production infrastructure? Talk to our team - we’ll help you get the best performance and reliability.

What you can build#

Text, vision, audio, image, and embeddings

Drop-in replacement for inference and fine-tuning — same API, same SFT data format

Connect models to tools and APIs

Reliable JSON responses for agentic workflows

Analyze images and documents

Use embeddings & reranking in search & context retrieval

Run async inference jobs at scale, faster and cheaper

Resources & help#

Find the best model for your use case

Code examples and tutorials

Complete API documentation

Ask questions and get help from developers

SOC 2, HIPAA, and audit reports

Check service uptime

Talk to our team

Link last verified June 7, 2026. View original ↗
Source: Fireworks AI Docs
Link last verified: 2026-06-07