Ephemeris · Issue 020 · Friday 08 May 2026

Ephemeris · Issue 020 Friday · 08 May 2026 · Zürich

Editor's note — Friday morning

Today the labs are negotiating with physics — voice that thinks before it answers, compute borrowed from a Mars company, a kernel hole patched at fleet scale.

— seven stories in, no opinions for free.

Ephemeris · Issue 020 01 / 07 · OpenAI

Voice · API

The model now hears, thinks, then answers.

OpenAI shipped new realtime voice models in the API — reasoning, translation, and transcription rolled into the same realtime stream. The headline isn't latency. It's that the agent now decides, mid-utterance, whether you're asking a question or thinking out loud.

A voice agent that pauses while you finish a sentence is — finally — closer to a colleague than a kiosk.

Read on →

Ephemeris · Issue 020 02 / 07 · Anthropic

Compute · Capacity

Five-hour limits, doubled — and a Mars-side dependency.

Anthropic doubled Claude's 5-hour usage windows on Pro, Max, Team, and Enterprise, removed peak-hour throttling, and bumped Opus rate limits — paid for with a compute deal that taps Colossus, the SpaceX-allied datacenter cluster. The story is partly about ergonomics, mostly about who supplies the next rack.

2× Five-hour quota

220k GPUs Colossus cluster

0peak cap Throttling removed

Read on →

Ephemeris · Issue 020 03 / 07 · Cloudflare

Security · Postmortem

Copy Fail: a kernel hole, patched at fleet scale.

When a Linux privilege-escalation vulnerability — call it Copy Fail — landed in the kernel community, the response was the headline: detected, mitigated, and patched across Cloudflare's global fleet before exploit code circulated. No customer impact, no traces of malicious activity, no scheduled emergency window.

CVEREDACTED PENDING DISCLOSURE

ClassLinux kernel · privilege escalation · local

DetectionInternal kernel-fuzzing pipeline + upstream watchlist

ActionTargeted patch rollout, no maintenance window

Customer impactNone observed.

Read on →

Ephemeris · Issue 020 04 / 07 · Sentry

Observability · Open source

Forty-four libraries, no more monkey patches.

Sentry is shipping native TracingChannel hooks into 44 JavaScript libraries — Express, Next, Fastify, Apollo, Prisma, and the rest. Observability moves to the library boundary, where it belongs. Drop-in agents stop fighting bundlers, source maps, and ESM hoisting; spans match the code authors meant.

Library

Channel

Status

A2

express

http.request

shipped

A3

shipped

A4

fastify

request.lifecycle

shipped

A5

prisma

query.run

pending

A6

apollo-server

operation.execute

pending

…

39 more

—

tracking

Read on →

Ephemeris · Issue 020 05 / 07 · Don't Worry About the Vase

Policy · Frontier

The prior-restraint era begins.

By Zvi Mowshowitz · AI #167

For a decade, frontier labs trained models, then released them when they wanted. Zvi argues that arrangement is over. Increasingly, model releases require an affirmative permission — from one or another arm of government — before weights or APIs leave the building. The mechanics are uneven, the precedent is set, and the question is no longer whether labs can ship freely, but who they have to call first. The shift is small in any single decision and large in aggregate: a private lab that needs prior approval for new product is, by definition, no longer purely private.

Read on →

Ephemeris · Issue 020 06 / 07 · via @seeallochnaya

Benchmark · Code agents

Hand the agent a binary. Ask it to rebuild the program.

The team behind SWE-Bench released ProgramBench: 200 open-source projects where the agent receives only the binary and the documentation, and is graded on whether its reconstructed source passes 95% of the original test suite. Today's best score — Claude Opus 4.7 — sits at 3%. A useful new ceiling, and a humbling one.

Spec sheet · v0.1

Projects200 · open source

InputsBinary + README

Pass threshold95% of original tests

Best agentClaude Opus 4.7 · 3%

InterestingSWE-Bench team

Read on →

Ephemeris · Issue 020 07 / 07 · OpenAI

Security · Trusted Access

Frame 01 · GPT-5.5-Cyber

A model only the defenders get to call.

OpenAI is widening its Trusted Access program for verified cybersecurity workers and shipping GPT-5.5-Cyber, a variant tuned for vulnerability research, exploit reasoning, and reverse-engineering work that the consumer model declines by default. The bargain: more capability, but only behind verified identity and audited use.

Read on →

End of issue 020 Back to top ↑

That's today.

Seven stories from six sources, picked against the rubric: tools you could adopt this week, dev tooling, security that matters, research with a kernel of action, and policy that changes how next quarter looks.

OpenAI · openai.com Anthropic · anthropic.com Cloudflare · blog.cloudflare.com Sentry · blog.sentry.io Zvi · thezvi.substack.com via @seeallochnaya