技能 编程开发 AI图像生成技能

AI图像生成技能

v20260423
imagen
该技能利用Google Gemini强大的图像生成模型,能够创建高质量、上下文相关的视觉内容。它在开发中用途广泛,特别适用于前端开发中的占位符图片、技术文档的流程图和插图,以及任何需要可视化概念、架构图或UI资产的场景。
获取技能
311 次下载
概览

Imagen - AI Image Generation Skill

Overview

This skill generates images using Google Gemini's image generation model (gemini-3-pro-image-preview). It enables seamless image creation during any Claude Code session - whether you're building frontend UIs, creating documentation, or need visual representations of concepts.

Cross-Platform: Works on Windows, macOS, and Linux.

When to Use This Skill

Automatically activate this skill when:

  • User requests image generation (e.g., "generate an image of...", "create a picture...")
  • Frontend development requires placeholder or actual images
  • Documentation needs illustrations or diagrams
  • Visualizing concepts, architectures, or ideas
  • Creating icons, logos, or UI assets
  • Any task where an AI-generated image would be helpful

How It Works

  1. Takes a text prompt describing the desired image
  2. Calls Google Gemini API with image generation configuration
  3. Saves the generated image to a specified location (defaults to current directory)
  4. Returns the file path for use in your project

Usage

Python (Cross-Platform - Recommended)

# Basic usage
python scripts/generate_image.py "A futuristic city skyline at sunset"

# With custom output path
python scripts/generate_image.py "A minimalist app icon for a music player" "./assets/icons/music-icon.png"

# With custom size
python scripts/generate_image.py --size 2K "High resolution landscape" "./wallpaper.png"

Requirements

  • GEMINI_API_KEY environment variable must be set
  • Python 3.6+ (uses standard library only, no pip install needed)

Output

Generated images are saved as PNG files. The script returns:

  • Success: Path to the generated image
  • Failure: Error message with details

Examples

Frontend Development

User: "I need a hero image for my landing page - something abstract and tech-focused"
-> Generates and saves image, provides path for use in HTML/CSS

Documentation

User: "Create a diagram showing microservices architecture"
-> Generates visual representation, ready for README or docs

UI Assets

User: "Generate a placeholder avatar image for the user profile component"
-> Creates image in appropriate size for component use

Limitations

  • Use this skill only when the task clearly matches the scope described above.
  • Do not treat the output as a substitute for environment-specific validation, testing, or expert review.
  • Stop and ask for clarification if required inputs, permissions, safety boundaries, or success criteria are missing.
信息
Category 编程开发
Name imagen
版本 v20260423
大小 2.79KB
更新时间 2026-04-24
语言