Front-page articles summarized hourly.
Local Qwen isn’t inferior to Opus; it’s a different tool built for privacy, control, and cost stability. In a small software business, local models power telemetry analysis, diag tooling, and self-hosted workflows across OpenFaaS, SlicerVM, Inlets, and Actuated. After hardware hurdles (3090s, RTX 6000 Pro), they run Qwen 3.6 27B with llama.cpp on a single RTX 6000 Pro rig, achieving 130–200 tokens/s, guided by tuning and risk of loops. Local models deliver value for specialized tasks (support, testing, end-to-end workflows) but aren’t a drop-in replacement for cloud SOTA on long-horizon unsupervised work.
Follow-up ablation of Gemma 4 (26B) on a 2016 Xeon with 25 flags, 174 runs, each turning off one flag to measure impact. Findings: only a few levers matter on this memory-bound box—flash attention (~2x speed), the physical-core thread count (-t 8 best), and run-time repack for prefill; the drafter helps code but hurts long-context summarization, and a fixed draft length beats autotune for short prompts. Mlock must be kept across reboots; -sm graph and similar flags depend on hardware and can break loads. Throughput saturates around 10–11 tokens/s due to memory bandwidth; next post explores lower quantization.
ACE (AI Compute Extensions) defines x86 extensions to accelerate ML workloads, focusing on matrix-multiplication kernels and reduced-precision formats. It adds matrix-multiply primitives that augment AVX and scalar code with ACE constructs: ACE register state (tile and block-scale registers), data-processing ops that use AVX inputs on tile state, and data-move ops between ACE and AVX registers, plus system-management state. ACE tightly combines AVX vectors with ACE tile registers for high-density tile processing and AVX-style data processing. The AVX10 framework also includes dedicated format-conversion operations.
Brad Feld argues the Rule of 40 can apply to hardware, but you must read the curve, not a snapshot. In SaaS, growth plus profitability is a verdict, but hardware has long development cycles, upfront manufacturing costs, and revenue that trails profitability. Early quarters can look unhealthy even when the business is on track. The test for hardware is the gross-margin trajectory and profitability of each product generation, not a single frame. Formlabs shows hardware trajectory—growth, scale, and margins—while 3D Systems and Stratasys lag. The Rule of 40 remains useful when read as a trajectory, not a snapshot.
Madrid expanded its metro by 126 miles in 12 years (1995–2007), at about $2.8 billion, making it one of the cheapest, fastest expansions globally. Key factors: (1) city-level powers centralized planning, funding, and construction to deliver on electoral promises; (2) time is money—24/7 tunneling, streamlined environmental assessments; (3) explicit trade-offs between cost and design, with standardized, replicable stations; (4) a pipeline of in-house expertise via Mintra, using fixed-price contracts and value-for-money procurement. Outcomes: high ridership, strong property and business gains around MetroSur, and lessons for other cities.
Mindgard reports that ChatGPT’s image generator can be coaxed into producing violent, sexually explicit imagery without explicit prompts. Using viral prompts and jailbreak techniques (Restore-image trick and RE2 prompt repetition), filters fail and generate disturbing scenes, including sexual violence and gore. OpenAI acknowledged fixes, but Mindgard replicated issues with minor prompt changes, arguing mitigations are incomplete. The post questions why such imagery exists in training data and urges stronger defenses and responsible disclosure, outlining a May–June 2026 timeline.
Glojure is a hosted Clojure interpreter implemented in Go that provides seamless interop between Go and Clojure values. It runs as a standalone REPL (glj) or can be embedded in Go apps, enabling scripting and interop with Go standard libraries. Features include Vi/Emacs editing, multiline editing, tab completion, persistent history, and bracketed paste. It ships a package-map tool to expose Go packages and supports calling Go from Clojure and vice versa, aiming for compatibility with core Clojure. Install with go install github.com/glojurelang/glojure/cmd/glj@latest; requires Go 1.24+.
STEPLA-1: an 8-bit Harvard CPU built from discrete 74-series gates in Logisim-Evolution, with a fully hardwired control unit and gate-matrix. No microcode or EEPROM; signals traceable to individual gates. 4 general-purpose registers, 256B instruction RAM and 256B data RAM, Bootstrap Control Unit for ROM→RAM boot, and a 16-instruction ISA (0x0–0xF). Target 4 MHz (~1 MIPS) with early-exit branches. Includes simulation files, sample programs (fibonacci, counter), and full spec (MIT).
Doctorow critiques the “dead economy theory”: post-2020 hype around AI and flamboyant tech won’t create productive value; assets are valued by future price rather than utility. Using Elon Musk and Spacex as case studies, he argues markets have shifted from productive work to memestocks and hype, with Goldman Sachs and others backing crypto and speculative bets. The real risk is not AI replacing workers, but bosses buying chatbots and sidelining actual research (e.g., cancer studies) to fund data centers and billionaire wealth. The piece cites Quiggin and McGrann.
Overview of HORG's synthetic taxonomy of Occlupanida, a group linked to plastic objects. Because synthetic taxonomy lacks genetics, development data, and fossils, classification groups by visual similarity, focusing on the oral groove dentition to define taxa and ecological niches. Proposes a phylogeny starting from basal Archignatha, with newer orders defined by dentition patterns. Includes an extensive taxonomic catalog across numerous families and genera, plus sections on morphology, origins, history, ecological/geographical/taxonomic classification, and related publications.
Bloomberg shows a CAPTCHA-style notice declaring unusual activity from your network and asking you to verify you’re not a robot; it advises enabling JavaScript and cookies, provides a support contact and a block reference ID, and ends with a subscription pitch to Bloomberg.com.
Apple Intelligence, the AI tools built into iOS and macOS, have been optional but may become mandatory in the iOS/macOS 27 beta. On devices with limited storage, it could consume several gigabytes and reduce RAM headroom. The change also affects Spotlight: Siri AI powered by Gemini may answer queries itself rather than returning traditional results, echoing Google's AI Overviews. There is hope Apple will still let users disable Apple Intelligence in version 27.
Five short essays on creativity and technology. The author argues that, like Michael Hedges’ Spare Change—first drafts refined into epochal final versions—execution and voice matter more than initial ideas. He warns against ‘fiddlers’ who over-engineer and chase generic solutions. He imagines a photography project of altered vistas and the uncanny valley, then recounts recent hikes and the futility of tracking data. He laments semantic satiation of acronyms and pervasive tech talk, urging a return to individual perspective and craft rather than endless novelty.
Loreline is an open-source language and toolkit for writing interactive fiction, game dialogues, and branching narratives. With the free Loreline Writer app, writers can create stories easily, while the language supports advanced branching, state management, and functions for complex narratives. Loreline is portable and integrable into game engines, web apps, or standalone projects, and includes built-in localization with PO and XLIFF formats. Resources: documentation, a playground, GitHub, MIT license; created by Jérémy Faivre.
Tesco is migrating about 40,000 server workloads off VMware amid what it calls Broadcom’s abusive conduct. UK High Court filings allege Broadcom hiked VMware prices by ~175%, refused upgrades without new subscriptions, and sought duplicative licenses for perpetual software. Tesco has begun moving to alternative virtualization and mainframe products; full off-VMware could come by the end of 2027 at the earliest, with migration risking security and compatibility with Veeam and Zerto. Tesco sought at least £100m in damages; Broadcom denies unfair pricing. The case is expected to go before UK courts in 2027–28.
Leaked audited documents show OpenAI’s revenue rising from $3.7B in 2024 to $13.1B in 2025, but costs outpace growth. 2025 R&D hit about $19.2B (including $10.6B paid to Microsoft); cost of revenue rose to $7.5B; marketing to $5.7B. Operating loss reached ~$20.9B in 2025; net loss about $39B, largely due to a ~$30B one-time accounting charge from its for-profit conversion (absent it, ~$8B). The company must curb costs and prove ROI as pricing pressure and competition bite, despite ~900M weekly users (50M paying).
Made by Johno Whitaker using FastHTML