Login
Download
Skill UI
Browse and discover
9183+
curated skills
All
Development
Artificial Intelligence
Design & Creative
Product & Business
Data Science
Marketing
Soft Skills
Productivity
Engineering
Languages
Search
Image Captioning
, found
2
results
Default
Newest
Most Downloaded
Blip2 Vision Language
blip-2-vision-language
Orchestra-Research/AI-Research-SKILLs
350
Bridges frozen vision encoders and LLMs for high-quality image captioning, VQA, retrieval, and multimodal chat, enabling zero-shot reasoning without fine-tuning and offering efficient Q-Former training.
View Details
LLaVA Vision Assistant
llava
Orchestra-Research/AI-Research-SKILLs
396
LLaVA is an open-source vision-language assistant pairing CLIP vision encoders with Vicuna/LLaMA models to power multimodal chatbots, multi-turn image Q&A, captioning, and instruction-following tasks.
View Details
1
Language
简体中文
English