As one of many defining applied sciences of this century, synthetic intelligence (AI) appears to witness day by day developments with new entrants to the sphere, technological breakthroughs, and inventive and modern purposes. The panorama for AI safety shares the identical breakneck tempo with streams of newly proposed laws, novel vulnerability discoveries, and rising menace vectors.
Whereas the velocity of change is thrilling, it creates sensible boundaries for enterprise AI adoption. As our Cisco 2024 AI Readiness Index factors out, issues about AI safety are regularly cited by enterprise leaders as a main roadblock to embracing the complete potential of AI of their organizations.
That’s why we’re excited to introduce our inaugural State of AI Safety report. It supplies a succinct, simple overview of among the most vital developments in AI safety from the previous 12 months, together with tendencies and predictions for the 12 months forward. The report additionally shares clear suggestions for organizations seeking to enhance their very own AI safety methods, and highlights among the methods Cisco is investing in a safer future for AI.
Right here’s an summary of what you’ll discover in our first State of AI Safety report:
Evolution of the AI Risk Panorama
The fast proliferation of AI and AI-enabled applied sciences has launched a large new assault floor that safety leaders are solely starting to take care of.
Threat exists at just about each step throughout the whole AI growth lifecycle; AI property may be instantly compromised by an adversary or discreetly compromised although a vulnerability within the AI provide chain. The State of AI Safety report examines a number of AI-specific assault vectors together with immediate injection assaults, knowledge poisoning, and knowledge extraction assaults. It additionally displays on the usage of AI by adversaries to enhance cyber operations like social engineering, supported by analysis from Cisco Talos.
Trying on the 12 months forward, cutting-edge developments in AI will undoubtedly introduce new dangers for safety leaders to concentrate on. For instance, the rise of agentic AI which may act autonomously with out fixed human supervision appears ripe for exploitation. Alternatively, the scale of social engineering threatens to develop tremendously, exacerbated by highly effective multimodal AI instruments within the unsuitable arms.
Key Developments in AI Coverage
The previous 12 months has seen important developments in AI coverage, each domestically and internationally.
In the US, a fragmented state-by-state method has emerged within the absence of federal rules with over 700 AI-related payments launched in 2024 alone. In the meantime, worldwide efforts have led to key developments, such because the UK and Canada’s collaboration on AI security and the European Union’s AI Act, which got here into power in August 2024 to set a precedent for world AI governance.
Early actions in 2025 recommend better focus in direction of successfully balancing the necessity for AI safety with accelerating the velocity of innovation. Current examples embrace President Trump’s government order and rising assist for a pro-innovation setting, which aligns properly with themes from the AI Motion Summit held in Paris in February and the U.Okay.’s current AI Alternatives Motion Plan.
Unique AI Safety Analysis
The Cisco AI safety analysis workforce has led and contributed to a number of items of groundbreaking analysis that are highlighted within the State of AI Safety report.
Analysis into algorithmic jailbreaking of huge language fashions (LLMs) demonstrates how adversaries can bypass mannequin protections with zero human supervision. This system can be utilized to exfiltrate delicate knowledge and disrupt AI providers. Extra not too long ago, the workforce explored automated jailbreaking of superior reasoning fashions like DeepSeek R1, to reveal that even reasoning fashions can nonetheless fall sufferer to conventional jailbreaking strategies.
The workforce additionally explores the security and safety dangers of fine-tuning fashions. Whereas fine-tuning is a well-liked methodology for bettering the contextual relevance of AI, many are unaware of the inadvertent penalties like mannequin misalignment.
Lastly, the report critiques two items of unique analysis into poisoning public datasets and extracting coaching knowledge from LLMs. These research make clear how simply—and cost-effectively—a nasty actor can tamper with or exfiltrate knowledge from enterprise AI purposes.
Suggestions for AI Safety
Securing AI methods requires a proactive and complete method.
The State of AI Safety report outlines a number of actionable suggestions, together with managing safety dangers all through the AI lifecycle, implementing sturdy entry controls, and adopting AI safety requirements such because the NIST AI Threat Administration Framework and MITRE ATLAS matrix. We additionally have a look at how Cisco AI Protection will help companies adhere to those greatest practices and mitigate AI threat from growth to deployment.
Learn the State of AI Safety 2025
Able to learn the complete report? Yow will discover it right here.
We’d love to listen to what you assume. Ask a Query, Remark Under, and Keep Linked with Cisco Safe on social!
Cisco Safety Social Channels
Share:
