| 29-07-2025 |
WebGuard Enhances AI Safety for Web Agents |
AI Safety and Security |
WebGuard, developed by Scale AI with UC Berkeley and Ohio State University, is a pioneering dataset designed to assess and improve the safety of AI web agents. It features 4,939 human-annotated actions from 193 websites, categorized by risk levels to guide safe decision-making. Fine-tuning with WebGuard significantly boosts model accuracy, with smaller models like Qwen2.5-VL-7B achieving up to 80% accuracy in identifying high-risk actions. Researchers invite the community to use the public dataset to advance AI safety. |
|