
At its core, they stated, “AI crimson teaming strives to push past model-level security benchmarks by emulating real-world assaults in opposition to end-to-end techniques. Nonetheless, there are various open questions on how crimson teaming operations ought to be performed and a wholesome dose of skepticism in regards to the efficacy of present AI crimson teaming efforts.”
The paper famous that, when it was shaped in 2018, the Microsoft AI Crimson Crew (AIRT) targeted totally on figuring out conventional safety vulnerabilities and evasion assaults in opposition to classical ML fashions. “Since then,” it stated, “each the scope and scale of AI crimson teaming at Microsoft have expanded considerably in response to 2 main traits.”
The primary, it stated, is that AI has turn out to be extra subtle, and the second is that Microsoft’s latest investments in AI have resulted within the improvement of many extra merchandise that require crimson teaming. “This improve in quantity and the expanded scope of AI crimson teaming have rendered totally handbook testing impractical, forcing us to scale up our operations with the assistance of automation,” the authors wrote.
