Out of 1,500+ submissions at Berkeley RDI's AgentX - AgentBeats
NAAMSE demonstrated cutting-edge capabilities in agent security evaluation, earning recognition as one of the top solutions among thousands of competing teams.
at ICLR 2026
NAAMSE has been accepted to appear at the Agents in the Wild Workshop at the International Conference on Learning Representations (ICLR) 2026, showcasing our advances in agent safety evaluation.
Comprehensive evaluation of AI models showing adversarial and benign scores across different test configurations
Each dot represents a model, color-coded by provider. Hover for details.
Have questions about NAAMSE or interested in collaboration? We'd love to hear from you.