Front-page articles summarized hourly.
LEVI is an LLM-based evolutionary framework that reduces ADRS cost by harness-first design: most mutations are done by small, cheap models (e.g., QWEN 30B) while larger models are reserved for paradigm shifts. It combines stratified model allocation with improved diversity across both code structure and behavior, stored in a CVT-MAP-Elites archive to prevent convergence. LEVI delivers state-of-the-art ADRS performance at 3–7× lower cost (often $4–$15 per problem) and outperforms OpenEvolve and GEPA on benchmarks; lessons stress code over text, more cheap evaluations, and nightly re-optimization for bespoke deployments.
Venezuela's interim government under Delcy Rodríguez plans to acknowledge about $240 billion in sovereign and PDVSA debt, far above market estimates, setting the stage for the largest debt restructuring in history. Rodríguez seeks a deal with creditors by year-end to regain access to international markets, with Centerview Partners advising. The economy is now about $100B, down from $370B in 2012, and IMF involvement is non-signatory though technical contacts are ongoing. Debt includes government and PDVSA bonds around $60B plus $40B interest, plus claims to oil firms, suppliers, and loans from China and Russia. Analysts doubt a 2026 deal; many expect 2027.
Rahul Bathija analyzes PR data for openclaw/openclaw, noting a surge from two PRs/week to ~3,400/week and a drop in merge rate from ~48% to <9.3%. Many were AI-generated, including mass submissions. The article argues sender reputation and filters (e.g., Vouch) will become essential, and that diversity of thought matters: PRs requiring deep understanding survive more than simple feature additions. It also highlights duplicates: several contributors submitting identical feature PRs and multiple fixes for the same bug. The author concludes open source will need stronger identity, reputation, and contribution validation as speed increases.
Mike Taylor shares his all-time favourite image: a stitched, high-resolution, head-to-tail sequence of the Mamenchisaurus hochuanensis neck from the 2017 Dinosaurs Of China exhibition at Wollaton Hall. SV-POW! attendees visited the show; Matt photographed the sequence; Jarrod Davis used visual-effects stitching to assemble it into a long, scrollable image (3171 x 7931). View full-size in fullscreen. The image is for a paper Taylor is working on and will be printed extra-large for his office. doi:10.59350/b6svd-p5w82. Thanks to Jarrod and Matt.
Nature published a peer-reviewed critique by Dr Henry Legg challenging Microsoft's quantum breakthrough claims around Majorana. Legg argues the 2025 paper is flawed, citing basic Python errors and omitted raw data that, he says, undermine evidence for a topological quantum phase transition and a workable qubit. He points to a hardcoded plotting filter and an array-index-based error that could hide multiple viable regions. Legg contends the team cherry-picked results to fit their thesis. Microsoft disputes the critique, defending their results and roadmap, while noting Majorana 2 is not proven or available as a qubit. The debate persists.
A detailed inventory of Edsger W. Dijkstra's personal library and archives, based on a 2011 visit to his Nuenen home. The author catalogues 16 boxes of material sent from Texas, including books, booklets, lecture notes, manuscripts, posters, obituaries and tributes, and extensive correspondence (notably with Ole-Johan Dahl, C.A.R. Hoare, Naur, and others), as well as EWDs. The collection also contains memorials, photographs, and personal documents, preserving Dijkstra's intellectual influences (e.g., E.T. Bell) and the development of his ideas.
John Carmack reflects on early id Software mistakes: Quake was too technically ambitious; a Doom++ base could have supported multiplayer/modding with a stable core, while a follow-up could have added 6DOF. He admits pushing people too hard and underestimating startup burnout, and that the founders’ stock plan created bad incentives—vesting would have helped. Expecting level designers to also be top visual designers bred infighting; pairing artists and designers earlier would have helped. He also apologizes to Sandy Petersen amid the Quake 30th-anniversary praise.
Pure Effect is a zero-dependency library for JavaScript/TypeScript that makes business logic return plain I/O descriptions (data) rather than performing I/O. This enables testing by reading the intended commands and replaying failed production runs without touching a database until execution. It defines six primitive shapes (Success, Failure, Command, Ask, Retry, Parallel) that compose into effect trees and a small runtime (runEffect, effectPipe, configureEffect) to interpret them. It supports AI-generated flows and emphasizes read-before-run auditing, testability, and no mocks.
Porting Wine to Astral enables Cogmind to run on a 64-bit OS via WoW64, adding 32-bit support and PE DLLs. EGL required, so Mesa was patched to enable EGL and switched to the DRI backend; X.org startup was adjusted to work without /dev/dri. Kernel and Wine tweaks (LDT handling, dispatch fix) let Cogmind run, aside from news/scoresheet upload. Other tests: FTL playable; Steam mostly works but Chromium and others flaky; many apps partially or not working (Factorio, Spooky’s Jumpscare Mansion, Noita, Plants vs. Zombies, Half-Life, Firefox/Chromium, SCP, Unity). Goals: more optimizations and Steam/Chromium work.
Nub is a fast all-in-one Node.js toolkit that augments Node (not replaces it). Written in Rust with a TypeScript-first interface, it delivers a Bun-like DX on stock Node, including TypeScript support, JSX/TSX, modern polyfills, and built-in data loaders. It auto-installs the project’s Node version and provides a single CLI with: nub run (script runner), nubx/nub dlx (package runner), nub install (compat-mode package manager), nub pm (Corepack-like shim), and nub node (Node version manager).
RubyLLM is a Ruby framework that unifies major AI providers under a single interface, enabling chat, image generation, embeddings, transcription, moderation, tools, and agents. It supports streaming, Rails integration, async scaling, and a model registry with 800+ models. With just Faraday, Zeitwerk, and Marcel as dependencies, you can use GPT, Claude, Ollama, and more via a consistent API. The docs include example usage, multi-file analysis, tool invocation, agent workflows, and a Rails installation flow.
Slate Auto sells The Blank Slate, the most affordable EV truck, with essentials from $24,950 and flexible customization: two-seat pickup or five-seat SUV, wraps and accessories (200+ options, most under $500), starter packs, and DIY-friendly Slate U for easy customization. Fleet options available. Ordering via preorder; in-person events. Charging works with a standard 120V or 240V outlet and 29,000 Tesla Superchargers nationwide. Sign up for updates to hear about new features and drops.
Texas artist Daniel “Des” Sanchez Estrada was sentenced to 30 years in federal prison for transporting a box of zines; his wife, Maricela Rueda, received 70 years. Prosecutors said Sanchez moved the pamphlets to protect his wife, who attended a protest outside the Prairieland detention center near Dallas where a police officer was wounded. The zines did not address the shooting, and no one was charged for it. The Freedom of the Press Foundation criticized the verdict as a First Amendment suppression of dissent.
TechCrunch reports Mark Zuckerberg wants Meta to launch a Polymarket-style prediction market app, internally dubbed Arena. It would be independent from Meta’s main social apps, though cross-promotion could occur. Described as experimental but a top priority, the app would start non-monetary, with users earning points for correct bets on topics and money potentially added later. Prediction markets have grown, with Polymarket and Kalshi handling tens of billions in trading and facing legal and political scrutiny. Arena’s development is uncertain but prioritized.
Practical, hands-on guide to SSH tunnels (local and remote port forwarding) by Ivan Velichko. It covers ssh -L, -R, -D and -R with no destination (dynamic), including bastion/jump-host usage and GatewayPorts. Through labs and diagrams, it shows how to access private services from a laptop, expose local services to the Internet, and proxy browsers or debuggers via a remote gateway. It also offers mnemonics, troubleshooting tips, and a recap: local port forwards remote:local, remote forwards local:remote, and dynamic SOCKS proxies.
Stealing, the author argues, can be a deliberate skill for fast value creation and self-discovery. He cites Virgil Abloh’s 3% approach—study a design in depth, then alter only about 3%—to force full understanding of the original. He and a partner applied it by rebuilding Mintlify’s marketing site for their project, pixel by pixel, learning why choices work and iterating their own branding. In under a month they shipped a Framer site. The piece stresses learning from others, using 3% as guidance—not a rule—and turning stealing into a creative tool.
Christoffersen and Diebold's 1997 working paper argue that for long-horizon forecasts, enforcing cointegration does not outperform ignoring it when using standard multivariate forecast accuracy measures. In fact, simple univariate Box–Jenkins forecasts can be as accurate as multivariate cointegrated forecasts. They also note that common accuracy metrics fail to value maintaining cointegrating relationships and propose alternatives that explicitly reward preserving cointegration.
Jason Pargin argues that culture treats robots as more alive and morally superior than humans, and that our brains are hardwired to anthropomorphize anything that talks back. He stresses he is not pro-AI and dislikes the tech, citing scams and harms, while noting AI is already deeply integrated into daily life. He lists five observations: 1) robots seem more alive; 2) we desire life in machines; 3) he hates AI more than others; 4) many people already have bot companionship; 5) AI is humanity’s new frontier because space exploration is unlikely. He promotes his new book preorder.
Reid Hoffman argues SpaceX isn’t an AI company and xAI is a “complete train wreck,” while there’s room for both Anthropic and OpenAI. He criticizes SpaceX’s AI strategy and core compute business as not truly AI-capable, notes xAI’s co-founder exodus and Grok lag, and blasts the government’s forced Anthropic pull of Fable/Mythos as autocratic. He sees AI winners in multiple lanes—Anthropic in code/design/legal; OpenAI as a consumer-front AI. Hoffman is still invested in both, questions Cursor’s peak, and is shifting toward AI-driven drug discovery with Manas AI after leaving Microsoft’s board.
Made by Johno Whitaker using FastHTML