Login
Download
Skill UI
Browse and discover
6069+
curated skills
All
Development
Artificial Intelligence
Design & Creative
Product & Business
Data Science
Marketing
Soft Skills
Productivity
Engineering
Languages
Search
Deployment
, found
75
results
Default
Newest
Most Downloaded
Exa Architecture Variants
exa-architecture-variants
jeremylongshore/claude-code-plugins-plus-skills
383
Guides teams through choosing and implementing validated Exa deployment architectures—from direct search to cached semantic layers and RAG pipelines—so integrations scale from simple apps to high-query, AI-enhanced solutions.
View Details
FastAPI ML Endpoint
fastapi-ml-endpoint
jeremylongshore/claude-code-plugins-plus-skills
207
Automates FastAPI ML endpoint deployments by guiding architecture, producing production-ready code, and validating serving configs when you invoke the ML deployment skill.
View Details
Fine-Tuning LLM Expert
fine-tuning-expert
Jeffallan/claude-skills
360
Hands-on guide for tuning large language models via LoRA/QLoRA/PEFT, covering dataset prep, hyperparameter config, adapter training, evaluation, and production deployment with quantization best practices.
View Details
Flask ML API Creator
flask-ml-api-creator
jeremylongshore/claude-code-plugins-plus-skills
478
Automates Flask-based machine learning API deployment by guiding you through best practices, generating production code/configurations, and validating outputs for serving, monitoring, and MLOps pipelines.
View Details
Genkit Production Workflow
genkit-production-expert
jeremylongshore/claude-code-plugins-plus-skills
355
Guides building production Firebase Genkit apps with RAG flows, multi-step tooling, schema validation, and deployment via Firebase Functions or Cloud Run, including AI monitoring and error-handling best practices.
View Details
GPU Resource Optimizer
gpu-resource-optimizer
jeremylongshore/claude-code-plugins-plus-skills
127
Automates GPU resource optimization for ML deployments by guiding best practices, generating production-ready configurations, and validating setups for model serving, MLOps pipelines, and monitoring.
View Details
Groq Multi-Environment Config
groq-multi-env-setup
jeremylongshore/claude-code-plugins-plus-skills
160
Sets up Groq LLM API clients for development, staging, and production tiers with tailored models, retries, and rate-limit handling so deployments stay aligned with each environment’s speed and cost profile.
View Details
Groq Reference Architecture
groq-reference-architecture
jeremylongshore/claude-code-plugins-plus-skills
54
Defines best-practice Groq deployment with tiered model routing, middleware, streaming pipelines, and fallback chains for ultra-fast LLM inference and production monitoring when launching new Groq integrations.
View Details
Half Quadratic Quantization
hqq-quantization
Orchestra-Research/AI-Research-SKILLs
160
Calibrate-free half-quadratic quantization for LLMs that delivers 1-8 bit model compression with configurable backends (PyTorch, TorchAO, BitBlas, Marlin). Ideal for fast quantization workflows, zero-data deployment, and PEFT/LoRA fine-tuning with HuggingFace or vLLM stacks.
View Details
Inference Latency Profiler
inference-latency-profiler
jeremylongshore/claude-code-plugins-plus-skills
208
Automates inference latency profiler tasks in ML deployment scenarios, offering step-by-step guidance on model serving, MLOps pipelines, monitoring, and production optimization, generating production-ready code and validating outputs against best practices.
View Details
Kling AI Content Compliance
klingai-content-policy
jeremylongshore/claude-code-plugins-plus-skills
62
Guides implementing Kling AI content-policy compliance with prompt screening, moderation checks, violation handling, and documentation to ensure generated responses stay within safety guidelines for production deployments.
View Details
KlingAI Team Setup
klingai-team-setup
jeremylongshore/claude-code-plugins-plus-skills
312
Guides administrators through configuring Kling AI for teams, covering organization creation, shared API key management, role-based permissions, quota settings, and usage monitoring for collaborative deployments.
View Details
Prev
1
2
3
4
5
6
7
Next
Language
简体中文
English