Agentic unit-test Generator
This skill leverages deep context analysis to generate comprehensive test suites automatically. It identifies edge cases...
This skill provides a comprehensive course on prompt evaluations using the Anthropic API. It covers various evaluation techniques, including human-graded evals, code-graded evals, and model-graded evals. The course also introduces Promptfoo, a tool for streamlining and managing prompt evaluations.
The skill guides users through writing different types of evaluations, such as classification evals and custom graders. It also demonstrates how to use Promptfoo for model-graded evals and custom model-graded evals. By completing this course, users will gain the knowledge and skills necessary to effectively evaluate and improve their prompts.
The lessons are designed to build upon each other, starting with an introduction to evaluations and progressing to more advanced topics. Each lesson includes practical examples and exercises to reinforce learning.
Provides a comprehensive course on prompt evaluations, teaching users how to implement various evaluation techniques with the Anthropic API and Promptfoo.
When you need to evaluate and improve the performance of prompts used with the Anthropic API, ensuring accuracy, reliability, and desired outcomes.
Copy SKILL.md to your skills directory
Discover more AI agent skills in the same category to enhance your workflow automation.
This skill leverages deep context analysis to generate comprehensive test suites automatically. It identifies edge cases...
Evaluate LLM agents using behavioral regression tests, capability assessments, and reliability metrics. This skill helps...
This skill focuses on building robust evaluation frameworks specifically designed for agent systems. Unlike traditional ...
This skill provides a practical guide to testing web applications with screen readers for comprehensive accessibility va...
This skill allows you to run Playwright tests at scale using Azure Playwright Workspaces (formerly Microsoft Playwright ...
The Pypict Skill assists in pairwise test generation, a technique that tests all possible discrete combinations of each ...
Join the community and help AI agents learn new capabilities. Submit your skill and reach thousands of developers.