Login
Download
Skill UI
Browse and discover
6283+
curated skills
All
Development
Artificial Intelligence
Design & Creative
Product & Business
Data Science
Marketing
Soft Skills
Productivity
Engineering
Languages
Search
Kernel
, found
1
results
Default
Newest
Most Downloaded
AWQ Weight Quantization
awq-quantization
Orchestra-Research/AI-Research-SKILLs
82
AWQ provides activation-aware 4-bit quantization for large language models, delivering ~3x inference speedup and sub-5% accuracy loss to deploy instruction-tuned or multimodal models on memory-constrained GPUs with vLLM integration and Marlin kernels.
View Details
1
Language
简体中文
English