homesafe-bench
SharpAI/DeepCamera
This benchmark evaluates Vision-Language Models (VLMs) on detecting potential safety hazards within indoor environments. It uses static camera frames, simulating real-world fixed security camera monitoring. The test covers 40 scenarios across 5 critical categories, including fire/smoke, electrical risks, trips/falls, child safety, and falling objects, providing a rigorous measure of VLM's practical safety inspection capability.