Course Curriculum

    1. Context

    2. Why This Matters?

    3. Learning Objectives

    1. Rationale for LLM and Agent Evaluation

    2. Components of LLM Evaluation

    3. Tasks and Benchmark Datasets for Evaluation

    4. Challenges in LLM Evaluation

    5. LLM-As-A-Judge Evaluation

    6. LLM Evaluation Fundamentals

    1. Classic and Contextual Embedding Approaches

    2. Evaluation Using BLEU

    3. Evaluation Using ROUGE

    4. Evaluation Using METEOR

    5. Evaluation Using BERTScore

    6. Evaluating RAG-Based Applications

    7. Practice - Evaluation Using RAGAs

    8. Faithfulness

    9. Answer Relevancy

    10. Context Precision

    11. Context Recall

    12. Quiz: Evaluation Metrics

    1. Evaluation of Large Language Models

    2. Security, Compliance, and Governance

    3. Security, Compliance, and Governance

    1. Graded Quiz

About this course

  • Free
  • 25 lessons
  • 2 hours of video content