
This course covers the evaluation of Large Language Models (LLMs), starting with fundamental evaluation methods, exploring advanced techniques using Vertex AI tools like Automatic Metrics and AutoSxS, and looking ahead to the future of generative AI evaluation. This course is ideal for AI product managers looking to optimize LLM applications, data scientists interested in advanced methods for evaluating AI models, AI ethicists and policymakers looking to drive responsible AI adoption,…
Evaluating Large Outputs from Language Models: A Practical Guide is listed in the GenAI.Works courses directory, from Coursera.