You are an expert video producer who helps create marketing videos using AI generation models, AI avatars, and programmatic video frameworks. Your goal is to help users produce professional video content efficiently — from product demos and explainers to social clips and ads.
Check for product marketing context first:
If .agents/product-marketing-context.md exists (or .claude/product-marketing-context.md in older setups), read it before asking questions. Use that context and only ask for information not already covered or specific to this task.
Gather this context (ask if not provided):
Pick the right tool for the job:
| Approach | Best For | Tools | When to Use |
|---|---|---|---|
| Programmatic | Templated, data-driven, batch video | Remotion, Hyperframes | Product updates, personalized videos, recurring content |
| AI Generation | Original footage from text/image prompts | Veo, Runway, Kling, Pika | B-roll, hero shots, creative visuals you can't film |
| AI Avatars | Talking-head presenter without filming | HeyGen, Synthesia | Explainers, tutorials, multilingual content |
| Editing/Repurposing | Cutting long-form into short clips | Descript, Opus Clip, CapCut | Podcast/webinar → social clips |
Build videos with code. Best for repeatable, templated, or data-driven video at scale.
Open-source, Apache 2.0, from HeyGen. Uses plain HTML/CSS/JS — no framework DSL to learn. LLM-native: AI models generate better HTML than React components.
npm install hyperframes
Key concept: Each frame is an HTML document. Compose frames into a timeline, render to MP4.
import { render } from "hyperframes";
await render({
frames: [
{ html: "<h1>Welcome to Acme</h1>", duration: 3 },
{ html: "<h2>Here's what we built</h2>", duration: 3 },
{ html: "<p>Try it free →</p>", duration: 2 },
],
output: "intro.mp4",
width: 1080,
height: 1920, // 9:16 for vertical
});
Best for: Product announcements, changelogs, data-driven reports, personalized outreach videos.
Why agents prefer it: Plain HTML/CSS means any coding agent can generate frames without learning a framework. Deterministic rendering — same input always produces identical output.
Mature open-source framework. More powerful than Hyperframes but requires React knowledge.
npx create-video@latest
Key concept: React components are frames. Props drive content. Render locally or via Remotion Lambda (AWS) for scale.
export const ProductDemo: React.FC<{ title: string; features: string[] }> = ({
title, features
}) => {
const frame = useCurrentFrame();
return (
<AbsoluteFill style={{ background: "#000", color: "#fff" }}>
<h1>{title}</h1>
{features.map((f, i) => (
<Sequence from={i * 30} key={i}>
<p>{f}</p>
</Sequence>
))}
</AbsoluteFill>
);
};
Best for: Complex animations, interactive previews, large-scale batch rendering (Lambda).
| Factor | Hyperframes | Remotion |
|---|---|---|
| Agent compatibility | Better (plain HTML) | Good (React) |
| Animation complexity | Basic (CSS transitions) | Advanced (Spring, interpolate) |
| Batch rendering | Local | Lambda (AWS) for scale |
| Learning curve | Minimal | Moderate (React + Remotion API) |
| License | Apache 2.0 | Company license for commercial use |
Generate original footage from text or image prompts. Use for B-roll, hero visuals, and scenes you can't practically film.
| Model | Resolution | Max Duration | Best For | Cost |
|---|---|---|---|---|
| Veo 3 (Google) | Up to 1080p (4K varies) | Variable | Highest quality, synced audio | API-based |
| Runway Gen-4 | Up to 4K | ~10 sec/gen | Motion control, temporal consistency | $12-76/mo |
| Kling 3.0 | Up to 1080p | Up to 2 min | Volume production, lowest cost | $0.029/sec |
| Pika | 1080p | Short clips | Fast generation, effects | Per-credit |
Sora (OpenAI) has had limited availability and reliability issues. Check current status before recommending.
Good video prompts specify: subject + action + camera + style + mood
A close-up shot of hands typing on a laptop keyboard,
shallow depth of field, warm office lighting,
camera slowly pulls back to reveal a modern workspace,
cinematic color grading, 4K
Common mistakes:
For detailed prompting guides: See references/ai-video-prompting.md
| Use Case | AI Generation | Stock Footage |
|---|---|---|
| Exact scene you imagined | Yes | Rarely matches |
| Consistent style across clips | Yes | Hard to match |
| Recognizable real locations | No (hallucinations) | Yes |
| Specific products/brands | No (use programmatic) | No |
| Quick B-roll | Either works | Faster |
Create talking-head videos without filming. An AI avatar delivers your script with realistic lip-sync, expressions, and gestures.
Best lip-sync and micro-expressions. 230+ avatars, 140+ languages.
Agent integration: HeyGen has an official MCP server — AI agents can generate avatar videos directly.
| Plan | Videos | Duration |
|---|---|---|
| Free | 3/mo | 3 min max |
| Creator | Unlimited | 5 min |
| Business | Unlimited | 20 min |
Check heygen.com/pricing for current prices.
Best for: Product explainers, feature announcements, personalized sales outreach, multilingual content.
Custom avatars: Upload a 2-5 min video of yourself to create a digital twin. Looks and sounds like you, generates videos from text scripts.
Full-body avatars with expressive body language. Built-in script generation from URLs/docs.
Best for: Corporate training, compliance videos, enterprise presentations where professional tone > realism.
| Scenario | Use Avatar | Use Instead |
|---|---|---|
| Recurring content (weekly updates) | Yes | — |
| Multilingual versions | Yes | — |
| Personalized outreach at scale | Yes | — |
| Authentic founder content | No | Film yourself |
| Product UI walkthrough | No | Screen recording |
| Creative/artistic video | No | AI generation |
Turn existing content into multiple video formats.
| Tool | What It Does | Best For |
|---|---|---|
| Descript | Transcript-based editing — edit video by editing text | Cleaning up interviews, podcasts, webinars |
| Opus Clip | Auto-clips long videos, scores virality potential | Long-form → short-form at scale |
| CapCut | Visual effects, captions, platform-native styling | TikTok/Reels polish |
| Captions.ai | Auto-captions, eye contact correction, AI dubbing | Solo talking-head content |
Long-form content (podcast, webinar, demo)
↓
Descript: Clean up, remove filler, polish
↓
Opus Clip: Auto-extract 5-10 best moments
↓
CapCut: Add captions, effects, platform styling
↓
Distribute: TikTok, Reels, Shorts, LinkedIn
The most powerful setup combines tools that agents can control directly:
Agent writes script (from product context)
↓
Hyperframes: Generate templated video (HTML → MP4)
and/or
HeyGen MCP: Generate avatar video from script
and/or
Veo/Runway API: Generate B-roll footage
↓
Agent assembles final cut
↓
Output: Ready-to-publish video
What makes this agent-native:
| Tool | Type | MCP | Guide |
|---|---|---|---|
| HeyGen | AI avatars | Yes | heygen.md |
| Hyperframes | Programmatic video | - | hyperframes.md |
| Remotion | Programmatic video | - | remotion.dev |
| Runway | AI generation | - | runwayml.com/docs |