Skills Artificial Intelligence Video Media AI Generation Toolkit

Video Media AI Generation Toolkit

v20260427
videodb-skills
This comprehensive skill provides an end-to-end solution for video and audio content creation. Users can upload videos from various sources (YouTube, local files) and perform advanced operations such as real-time streaming, semantic search, and complex editing (trimming, overlays). Key features include AI-powered transcription, subtitle generation, and the ability to generate diverse media assets like images, music, sound effects, and voiceovers, making it ideal for content creators and media professionals.
Get Skill
386 downloads
Overview

VideoDB Skills

Purpose

The only video skill your agent needs. Upload any video, connect real-time streams, search inside by what was said or shown, build complex editing workflows with overlays, generate AI media, add subtitles, and get instant streaming links — all via the VideoDB Python SDK.

When to Use This Skill

  • User wants to upload and process videos from YouTube, URLs, or local files
  • User needs to search for moments by speech or visual scenes
  • User asks for transcription, subtitles, or subtitle styling
  • User wants to edit clips — trim, combine, add text/image/audio overlays
  • User needs AI-generated media (images, video, music, sound effects, voiceovers)
  • User wants to transcode, change resolution, or reframe for social platforms
  • User needs real-time screen or audio capture with AI transcription
  • User asks for playable streaming links for any video output

Setup

Step 1: Install the skill

npx skills add video-db/skills

Step 2: Run setup

/videodb setup

The agent guides API key setup ($20 free credits, no credit card), installs the SDK, and verifies the connection.

Alternatively, set the API key manually:

export VIDEO_DB_API_KEY=sk-xxx

Step 3: Install the SDK

pip install "videodb[capture]" python-dotenv

Capabilities

Capability Description
Upload Ingest videos from YouTube, URLs, or local files
Search Find moments by speech (semantic/keyword) or visual scenes
Transcripts Generate timestamped transcripts from any video
Edit Combine clips, trim, add text/image/audio overlays
Subtitles Auto-generate and style subtitles
AI Generate Create images, video, music, sound effects, and voiceovers from text
Capture Real-time screen and audio capture with AI transcription
Transcode Change resolution, quality, aspect ratio, or reframe for social platforms
Stream Get playable HLS links for anything you build

Examples

Upload and transcribe:

"Upload https://www.youtube.com/watch?v=FgrO9ADPZSA and give me a transcript"

Search across videos:

"Search for 'product demo' in my latest video"

Add subtitles:

"Add subtitles with white text on black background"

Multi-clip editing:

"Take clips from 10s-30s and 45s-60s, add a title card, and combine them"

AI media generation:

"Generate background music and overlay it on my video"

Real-time capture:

"Capture my screen and transcribe it in real-time"

Reframe for social:

"Convert this to vertical for Instagram Reels"

Repository

https://github.com/video-db/skills

Version: 1.1.0 Maintained By: VideoDB

Limitations

  • Use this skill only when the task clearly matches the scope described above.
  • Do not treat the output as a substitute for environment-specific validation, testing, or expert review.
  • Stop and ask for clarification if required inputs, permissions, safety boundaries, or success criteria are missing.
Info
Name videodb-skills
Version v20260427
Size 3.67KB
Updated At 2026-04-28
Language