Automated PR Reviewer
This skill provides automated pull request reviews, identifying potential security vulnerabilities, logic errors, and st...
This skill provides strategies for caching LLM prompts and responses to reduce costs and improve performance. It covers prompt prefix caching (like Anthropic's native caching), full response caching, and Cache Augmented Generation (CAG). It's designed for situations where prompts have repeated prefixes, responses are likely to be similar, and semantic similarity matters more than exact match.
LLM caching differs from traditional caching. Prompts have prefixes that can be effectively cached, responses can vary based on parameters like temperature, and semantic similarity often plays a crucial role in determining cache hits. Effective caching requires careful consideration of the appropriate caching level (prefix, response, or both) and robust cache invalidation strategies.
This skill helps you avoid common pitfalls like caching with high temperature settings, neglecting cache invalidation, and indiscriminately caching everything, ensuring optimal cache utilization and accuracy.
Implements caching strategies for LLM prompts and responses, including prefix caching, full response caching, and Cache Augmented Generation (CAG).
When you need to reduce LLM costs, improve response times for similar queries, or leverage pre-cached documents instead of real-time RAG retrieval.
Copy SKILL.md to your skills directory
Discover more AI agent skills in the same category to enhance your workflow automation.
This skill provides automated pull request reviews, identifying potential security vulnerabilities, logic errors, and st...
This skill leverages deep context analysis to generate comprehensive test suites automatically. It identifies edge cases...
This skill helps developers optimize their pull requests for efficient code reviews. It generates comprehensive PR descr...
This skill provides expert-level assistance for Flutter development, covering everything from architecture and UI implem...
Provides production-ready FastAPI project structures with async patterns, dependency injection, middleware, and best pra...
The App Builder skill serves as a central orchestrator for creating full-stack applications based on natural language re...
Join the community and help AI agents learn new capabilities. Submit your skill and reach thousands of developers.