Build with Fireworks AI ↗
noOriginal Documentation
Documentation Index#
Fetch the complete documentation index at: https://docs.fireworks.ai/llms.txt Use this file to discover all available pages before exploring further.
Fast inference and fine-tuning for open source models
Fireworks AI is the fastest platform for building with open source AI models. Get production-ready inference and fine-tuning with best-in-class speed, cost and quality.
Get started in minutes#
Use popular models instantly with pay-per-token pricing. Perfect for quality vibe testing and prototyping.
Deploy with high performance on dedicated GPUs with fast autoscaling and minimal cold starts. Optimize deployments for speed and throughput.
Boost model quality with supervised and reinforcement fine-tuning of models up to 1T+ parameters. Start training in minutes, deploy immediately.
Not sure where to start? First, pick the right model for your use case with our model selection guide. Then choose Serverless to prototype quickly, move to Deployments to optimize and run production workloads, or use Fine-tuning to improve quality.
New to AI or Fireworks? Look up any term in the Glossary.
Need help optimizing deployments, fine-tuning models, or setting up production infrastructure? Talk to our team - we’ll help you get the best performance and reliability.
What you can build#
Text, vision, audio, image, and embeddings
Drop-in replacement for inference and fine-tuning — same API, same SFT data format
Connect models to tools and APIs
Reliable JSON responses for agentic workflows
Analyze images and documents
Use embeddings & reranking in search & context retrieval
Run async inference jobs at scale, faster and cheaper
Resources & help#
Find the best model for your use case
Code examples and tutorials
Complete API documentation
Ask questions and get help from developers
SOC 2, HIPAA, and audit reports
Check service uptime
Talk to our team