Evaluating and Debugging Generative AI Models ↗

DeepLearning.AI intermediate micro-course free ~1 hours

Prerequisites: Python, experience with LLM applications

Covers evaluation metrics, debugging techniques, and systematic testing for generative AI applications using Weights & Biases. The practical companion to the Evaluation & Testing learning path — the course provides hands-on practice with evaluation tools, while the path covers the full evaluation landscape across providers.

View Course ↗

Evaluating and Debugging Generative AI Models ↗

Related Learning Paths