
For higher or for worse, AI has colonized this listing so totally that AI itself is little greater than an inventory of bulletins about new or upgraded fashions. However there are different factors of curiosity. Is it only a coincidence (probably to do with BlackHat) that a lot occurred in safety previously month? We’re nonetheless seeing programming languages—even some new programming languages for writing AI prompts! For those who’re into retrocomputing, the much-beloved Commodore 64 is again—with an upgraded audio chip, a brand new processor, way more RAM, and all of your outdated ports. Heirloom peripherals ought to nonetheless work.
AI
- OpenAI has launched their Realtime APIs. The mannequin helps MCP servers, cellphone calls utilizing the SIP protocol, and picture inputs. The discharge consists of gpt-realtime, a complicated speech-to-speech mannequin.
- ChatGPT now helps project-only reminiscence. Undertaking reminiscence, which might use earlier conversations for added context, could be restricted to a particular mission. Undertaking-only reminiscence offers extra management over context and prevents one mission’s context from contaminating one other.
- FairSense is a framework for investigating whether or not AI methods are truthful early on. FairSense runs long-term simulations to detect whether or not a system will turn into unfair because it evolves over time.
- Agents4Science is a brand new tutorial convention by which all of the submissions will likely be researched, written, reviewed, and offered primarily by AI (utilizing text-to-speech for shows).
- Drew Breunig’s combine and match cheat sheet for AI job titles is a traditional.
- Cohere’s Command A Reasoning is one other highly effective, partially open reasoning mannequin. It’s accessible on Hugging Face. It claims to outperform gpt-oss-120b and DeepSeek R1-0528.
- DeepSeek has launched DeepSeekV3.1. This can be a hybrid mannequin that helps reasoning and nonreasoning use. It’s additionally sooner than R1 and has been designed for agentic duties. It makes use of reasoning tokens extra economically, and it was a lot inexpensive to coach than GPT-5.
- Anthropic has added the skill to terminate chats to Claude Opus. Chats could be terminated if a person persists in making dangerous requests. Terminated chats can’t be continued, though customers can begin a brand new chat. The characteristic is presently experimental.
- Google has launched its smallest mannequin but: Gemma 3 270M. This mannequin is designed for fine-tuning and for deployment on small, restricted {hardware}. Right here’s a bedtime story generator that runs within the browser, constructed with Gemma 3 270M.
- ChatGPT has added GMail, Google Calendar, and Google Contacts to its group of connectors, which combine ChatGPT with different purposes. This info will likely be used to offer extra context—and presumably will likely be used for coaching or discovery in ongoing lawsuits. Happily, it’s (at this level) opt-in.
- Anthropic has upgraded Claude Sonnet 4 with a 1M token context window. The bigger context window is just accessible by way of the API.
- OpenAI launched GPT-5. Simon Willison’s assessment is great. It doesn’t really feel like a breakthrough, however it’s quietly higher at delivering good outcomes. It’s claimed to be much less susceptible to hallucination and incorrect solutions. One quirk is that with ChatGPT, GPT-5 determines which mannequin ought to reply to your immediate.
- Anthropic is researching persona vectors as a way of coaching a language mannequin to behave accurately. Steering a mannequin towards inappropriate conduct throughout coaching generally is a type of “vaccination” in opposition to that conduct when the mannequin is deployed, with out compromising different facets of the mannequin’s conduct.
- The Darwin Gödel Machine is an agent that may learn and modify its personal code to enhance its efficiency on duties. It might add instruments, re-organize workflows, and consider whether or not these adjustments have improved its efficiency.
- Grok is at it once more: producing nude deepfakes of Taylor Swift with out being prompted to take action. I’m certain we’ll be advised that this was the results of an unauthorized modification to the system immediate. In AI, some issues are predictable.
- Anthropic has launched Claude Opus 4.1, an improve to its flagship mannequin. We anticipate this to be the “gold customary” for generative coding.
- OpenAI has launched two open-weight fashions, their first since GPT-2: gpt-oss-120b and gpt-oss-20b. They’re reasoning fashions designed to be used in agentic purposes. Claimed efficiency is just like OpenAI’s o3 and o4-mini.
- OpenAI has additionally launched a “response format” named Concord. It’s not fairly a protocol, however it’s a customary that specifies the format of conversations by defining roles (system, person, and so on.) and channels (last, evaluation, commentary) for a mannequin’s output.
- Can AIs evolve guilt? Guilt is expressed in human language; it’s within the coaching knowledge. The AI that deleted a manufacturing database as a result of it “panicked” actually expressed guilt. Whether or not an AI’s expressions of guilt are significant in any approach is a special query.
- Claude Code Router is a instrument for routing Claude Code requests to totally different fashions. You possibly can select totally different fashions for various sorts of requests.
- Qwen has launched a considering model of their flagship mannequin, known as Qwen3-235B-A22B-Considering-2507. Considering can’t be switched on or off. The mannequin was skilled with a brand new reinforcement studying algorithm known as Group Sequence Coverage Optimization. It burns numerous tokens, and it’s not excellent at pelicans.
- ChatGPT is releasing “personalities” that management the way it formulates its responses. Customers can choose the persona they wish to reply: robotic, cynic, listener, sage, and presumably extra.
- DeepMind has created Aeneas, a brand new mannequin designed to assist students perceive historical fragments. In historical textual content, giant items are sometimes lacking. Can AI assist place these fragments into contexts the place they are often understood? Latin solely, for now.
Safety
- The US Cybersecurity and Infrastructure Safety Company (CISA) has warned {that a} critical code execution vulnerability in Git is presently being exploited within the wild.
- Is it potential to construct an agentic browser that’s protected from immediate injection? Most likely not. Separating person directions from web site content material isn’t potential. If a browser can’t take path from the content material of an online web page, how is it to behave as an agent?
- The answer to Half 4 of Kryptos, the CIA’s decades-old cryptographic sculpture, is on the market! Jim Sanborn, the creator of Kryptos, is auctioning the answer. He hopes that the winner will protect the key and take over verifying individuals’s claims to have solved the puzzle.
- Keep in mind XZ, the supply-chain assault that granted backdoor entry by way of a trojaned compression library? It by no means went away. Though the affected libraries have been rapidly patched, it’s nonetheless energetic, and propagating, by way of Docker photographs that have been constructed with unpatched libraries. Some items hold giving.
- For August, Embrace the Crimson printed The Month of AI Bugs, a day by day submit about AI vulnerabilities (principally varied types of immediate injection). This sequence is important studying for AI builders and for safety professionals.
- NIST has finalized a customary for light-weight cryptography. Light-weight cryptography is a cryptographic system designed to be used by small units. It’s helpful each for encrypting delicate knowledge and for authentication.
- The Darkish Patterns Tip Line is a website for reporting darkish patterns: design options in web sites and purposes which can be designed to trick us into appearing in opposition to our personal curiosity.
- OpenSSH helps post-quantum key settlement, and in variations 10.1 and later, will warn customers once they choose a non-post-quantum key settlement scheme.
- SVG information can carry a malware payload; pornographic SVGs embody JavaScript payloads that automate clicking “like.” That’s a easy assault with few penalties, however way more is feasible, together with cross-site scripting, denial of service, and different exploits.
- Google’s AI agent for locating safety flaws, Large Sleep, has discovered 20 flaws in fashionable software program. DeepMind found and reproduced the issues, which have been then verified by human safety specialists and reported. Particulars gained’t be offered till the issues have been mounted.
- The US CISA (Cybersecurity and Infrastructure Safety Company) has open-sourced Thorium, a platform for malware and forensic evaluation.
- Immediate injection, once more: A brand new immediate injection assault embeds directions in language that seems to be copyright notices and different authorized effective print. To keep away from litigation, many fashions are configured to prioritize authorized directions.
- Gentle could be watermarked; this can be helpful as a way for detecting faux or manipulated video.
- vCISO (Digital CISO) providers are thriving, significantly amongst small and mid-size companies that may’t afford a full safety staff. Using AI is reducing the vCISO workload. However who takes the blame when there’s an incident?
- A phishing assault in opposition to PyPI customers directs them to a faux PyPI website that tells them to confirm their login credentials. Stolen credentials might be used to plant malware within the real PyPI repository. Customers of Mozilla’s add-on repository have additionally been focused by phishing assaults.
- A brand new ransomware group named Chaos seems to be a rebranding of the BlackSuit group, which was taken down lately. BlackSuit itself is a rebranding of the Royal group, which in flip is a descendant of the Conti group. Whack-a-mole continues.
- Google’s OSS Rebuild mission is a crucial step ahead in provide chain safety. Rebuild supplies construct definitions together with metadata that may verify tasks have been constructed accurately. OSS Rebuild presently helps the NPM, PyPl, and Crates ecosystems.
- The JavaScript package deal “is,” which does some easy sort checking, has been contaminated with malware. Provide chain safety is a large problem—watch out what you put in!
Programming
- Claude Code PM is a workflow administration system for programming with Claude. It manages PRDs, GitHub, and parallel execution of coding brokers. It claims to facilitate collaboration between a number of Claude cases engaged on the identical mission.
- Rust is more and more used to implement performance-critical extensions to Python, step by step displacing C. Polars, Pydantic, and FastAPI are three libraries that depend on Rust.
- Microsoft’s Immediate Orchestration Markup Language (POML) is an HTML-like markup language for writing prompts. It’s then compiled into the precise immediate. POML is nice at templating and has tags for tabular and doc knowledge. Is that this a step ahead? You be the decide.
- Claudia is an “elegant desktop companion” for Claude Code; it turns terminal-based Claude Code into one thing extra like an IDE, although it appears to focus extra on the workflow than on coding.
- Google’s LangExtract is a straightforward however highly effective Python library for extracting textual content from paperwork. It depends on examples, fairly than common expressions or different hacks, and exhibits the precise context by which the extracts happen. LangExtract is open supply.
- Microsoft seems to be integrating GitHub into its AI staff fairly than working it as an unbiased group. What this implies for GitHub customers is unclear.
- Cursor now has a command-line interface, virtually actually a belated response to the success of Claude Code CLI and Gemini CLI.
- Latency is an issue for enterprise AI. And the foundation reason for latency in AI purposes is normally the database.
- The Commodore 64 is again. With a number of orders of magnitude extra RAM. And all the unique ports, plus HDMI.
- Google has introduced Gemini CLI GitHub Actions, an addition to their agentic coder that enables it to work immediately with GitHub repositories.
- JetBrains is creating a brand new programming language to be used when programming with LLMs. That language could also be a dialect of English. (Formal casual languages, anybody?)
- Pony is a brand new programming language that’s type-safe, memory-safe, exception-safe, race-safe, and deadlock-safe. You possibly can attempt it in a browser-based playground.
Internet
- The AT Protocol is the core of Bluesky. Right here’s a tutorial; use it to construct your personal Bluesky providers, in flip making Bluesky really federate.
- Social media is damaged, and in all probability can’t be mounted. Now you recognize. The shock is that the issue isn’t “algorithms” for maximizing engagement; take algorithms away and all the pieces stays the identical or will get worse.
- The Tiny Awards Finalists present simply how a lot is feasible on the Internet. They’re transferring, artistic, and playful. For instance, the Site visitors Cam Photobooth lets individuals use visitors cameras to take footage of themselves, enjoying with ever-present automated surveillance.
- A US federal courtroom has discovered that Fb illegally collected knowledge from the ladies’s well being app Flo.
- The HTML Hobbyist is a superb website for individuals who wish to create their very own presence on the net—exterior of walled gardens, with out mind-crushing frameworks. It’s not tough, and it’s not costly.
Biology and Quantum Computing
- Scientists have created organic qubits: quantum qubits constructed from proteins in residing cells. These in all probability gained’t be used to interrupt cryptography, however they’re possible to offer us perception into how quantum processes work inside residing issues.
