Chinese hackers used Anthropic's Claude AI to launch autonomous cyberattacks on 30 organizations worldwide, marking a major ...
Evalite is a TypeScript-native eval runner designed for AI applications, enabling developers to create reproducible evals ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results