Building LLM Applications for Production

Mon, 01 Jan 0001 00:00:00 +0000

Comprehensive guide to production LLM challenges covering prompt engineering, evaluation, cost analysis, latency, fine-tuning vs prompting tradeoffs, and testing strategies.

Evaluating and Debugging Generative AI Models

Mon, 01 Jan 0001 00:00:00 +0000

Covers evaluation metrics, debugging techniques, and systematic testing for generative AI applications using Weights & Biases. The practical companion to the Evaluation & Testing learning path — the course provides hands-on practice with evaluation tools, while the path covers the full evaluation landscape across providers.

Evaluation on AI Knowledge Base

Building LLM Applications for Production

Evaluating and Debugging Generative AI Models