Adding online agent evaluation sample notebooks for all evaluators #286
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Agent Online Evaluation Notebooks
This directory contains comprehensive sample notebooks for evaluating AI agents using the Azure AI Projects SDK with online evaluation capabilities. These notebooks demonstrate how to assess various aspects of agent performance and response quality using modern batch evaluation patterns.
Overview
The Agent Online Evaluation notebooks provide complete coverage of Azure AI evaluation capabilities specifically designed for agent scenarios. Each notebook uses the modern Azure AI Projects SDK with standardized
SourceFileContentContent(item={})format for consistent and maintainable evaluation workflows.Evaluators Included (14 Total)
Agent-Specific Evaluators
Quality Evaluators
Key Features
azure-ai-projectswithAIProjectClientand OpenAI evals APISourceFileContentContent(item={})format across all notebooksrun_evaluatorhelperEach notebook includes detailed documentation, prerequisite setup, scoring system explanation, comprehensive samples demonstrating all input type variants, and batch evaluation examples.
Checklist