This skill generates images using Google Gemini's image generation model (gemini-3-pro-image-preview). It enables seamless image creation during any Claude Code session - whether you're building frontend UIs, creating documentation, or need visual representations of concepts.
Cross-Platform: Works on Windows, macOS, and Linux.
Automatically activate this skill when:
# Basic usage
python scripts/generate_image.py "A futuristic city skyline at sunset"
# With custom output path
python scripts/generate_image.py "A minimalist app icon for a music player" "./assets/icons/music-icon.png"
# With custom size
python scripts/generate_image.py --size 2K "High resolution landscape" "./wallpaper.png"
GEMINI_API_KEY environment variable must be setGenerated images are saved as PNG files. The script returns:
User: "I need a hero image for my landing page - something abstract and tech-focused"
-> Generates and saves image, provides path for use in HTML/CSS
User: "Create a diagram showing microservices architecture"
-> Generates visual representation, ready for README or docs
User: "Generate a placeholder avatar image for the user profile component"
-> Creates image in appropriate size for component use