OpenAI has launched a new benchmark that evaluates how well different AI models detect, patch and even exploit security vulnerabilities found in crypto smart contracts.
OpenAI released the “EVMbench: Evaluating AI Agents on Smart Contract Security” paper on Wednesday, in collaboration with crypto investment firm Paradigm and crypto security firm OtterSec, to evaluate how much the AI agents could theoretically exploit from 120 smart contract vulnerabilities.
Anthropic’s Claude Opus 4.6 came out on…
https://cointelegraph.com/news/openai-benchmark-ai-agents-detect-smart-contract-flaws?utm_source=rss_feed&utm_medium=rss&utm_campaign=rss_partner_inbound