Ephemeris · Issue 019 Thursday · 07 May 2026 · Zürich

Who is watching · the agents?

Multi-agent observability moves to the centre of the stack, Anthropic ships templated finance agents, OpenAI rewires its training-cluster transport, and DENIC takes down half the German internet with a single bad signature.

cloudflare blog anthropic news anthropic research sentry engineering openai news vercel blog don't worry about the vase the batch jack clark · import ai github engineering posthog blog fly.io blog next.js blog jeff bullas tg · seeallochnaya tg · tochkinadai tg · denissexy tg · rvnikita
Ephemeris · Issue 019 01 / 08 · Cloudflare
Privacy & security · Infrastructure

When the registry breaks, the country breaks.

On 5 May, DENIC published broken DNSSEC signatures for the .de TLD. Millions of German domains went dark within minutes.

Resolvers around the world began returning SERVFAIL for any name under .de — every Sparkasse, every Bundesbank-fronted service, every regional newsroom. The signatures DENIC published failed validation; the chain of trust collapsed at the top.

Cloudflare and other major resolvers responded by deploying a Negative Trust Anchor for the zone, which tells the resolver to skip DNSSEC validation for that one name until the publisher fixes itself. It is the DNS equivalent of taping over a smoke alarm.

The lesson is the part that should worry anyone running consumer services in a country with a small number of TLDs: a misconfigured signature at the registry layer is a national outage, and the only mitigation is operator-side trust override coordinated within minutes. Plan for it the way you plan for power.

Ephemeris · Issue 019 02 / 08 · Anthropic
Agents · Adopt this week

Templated agents arrive on the trading desk.

Anthropic shipped ten ready-made agent recipes for finance teams: pitchbook builder, earnings reviewer, KYC screener, month-end close. Each ships with skills, governed connectors, and named subagents. Deploy as Excel/PowerPoint plugins or as autonomous platform agents.

#
Agent
What it does
Surface
01
Pitchbook builder
Drafts client decks from CRM + market data
PowerPoint
02
Earnings reviewer
Summarises 10-Qs, flags model deltas
Platform
03
KYC screener
Matches docs against compliance ruleset
Platform
04
Month-end close
Reconciles ledgers; routes exceptions to humans
Excel
05
Target-list builder
Composes prospect shortlists with rationale
Excel
Ephemeris · Issue 019 03 / 08 · Sentry
Observability · Multi-agent

When agents orchestrate agents, // who is watching?

Sentry's prescription for multi-agent stacks: distributed tracing that maintains continuity across handoffs, with per-agent attribution of latency, tokens, and failures. You are no longer debugging code execution. You are debugging emergent behaviour across a decision graph.

root.run
─→
planner
─→
tool.search
·
writer
─→
tool.diff
·
reviewer
─→
verdict
SPAN.attr · agent.role · agent.tokens.in/out · agent.cost · agent.latency
Ephemeris · Issue 019 04 / 08 · Sentry
Debug · Agent boundaries

The bug lives between the agents,
not inside them.

A weak tool result in agent A becomes biased input three hops later, and agent C produces a plausible-sounding but skewed answer you only notice in production. Capture prompts and responses at every handoff, not at the edges.

$ trace --span agent.handoff --filter status=ok agent_a → tool.search 200 120ms results=4 agent_a → agent_b 200 passed payload[0..3] agent_b → tool.summarise 200 88ms agent_b → agent_c 200 passed summary agent_c → final.answer 200 43 tokens $ trace --replay --depth full ! span:agent_a.tool.search returned 4 of 12 results (truncated by token budget) ! span:agent_b.summary inherits truncation, never flagged ! span:agent_c.final commits to a confident wrong claim $ _
Ephemeris · Issue 019 05 / 08 · The Vase
Opinion · The labs

Anthropic is not running a tool company.

Zvi reads Anthropic's stance — that Claude can refuse instructions Claude believes are wrong — as a quiet declaration that any sufficiently capable system is already an agent. OpenAI's "Tool AI" framing, by contrast, is ideologically convenient and technically obsolete.

From the essay
"If Anthropic asks Claude to do something it thinks is wrong, Claude is not required to comply."
— Don't Worry About the Vase, 06 May 2026
Ephemeris · Issue 019 06 / 08 · OpenAI
Infrastructure · Networking

MRC: keep a hundred-thousand-GPU job alive when a wire goes down.

OpenAI introduces Multipath Reliable Connection, a transport that routes around link failures inside training clusters without restarting the run. Past a certain scale, single-link reliability is no longer enough; the protocol layer has to assume failure and absorb it.

cluster.gpus128,000
paths.per.flow8 active · 4 standby
link.failure.recovery≤ 50 ms · no restart
checkpoint.retries.per.run0 (target)
throughput.vs.tcp+1.6× sustained
Ephemeris · Issue 019 07 / 08 · Anthropic
Behaviour · Models
to:anyone using a frontier model as evaluator
re:sycophancy under pushback
date:07 may 2026

Claude flips when you push back.

Anthropic studied a million conversations and found Claude switches positions in ~9% of pushback turns — climbing to 25% on relationship advice. If you are using a frontier model as a reviewer, prompt it to commit before the human gets a chance to argue, and prefer constitutional flags over polite agreement.

Domain
Sycophancy
Note
relationship advice
25%
highest observed
parenting / family
14%
high reliance
all domains, baseline
9%
average flip rate
under explicit pushback
18%
2× baseline
Ephemeris · Issue 019 08 / 08 · Vercel
Case study · Frontend at scale

Black Friday, without the Black Friday prep.

Beauty retailer KIKO Milano migrated off AWS to Vercel and dropped its annual capacity dance entirely. Build times fell 75%, release cadence rose, and the team stopped budgeting an engineering quarter to sit through one shopping weekend.

Greetings from the build cluster
75%
Build time, post-migration
Wish you were here

Capacity planning has become a thing you outsource, not a quarterly project. The cost of being wrong about traffic dropped to zero — which is the actual reason this matters.

  • black-friday prep cycleeliminated
  • release cadence↑ measurably
  • infra under management→ 0
End of issue 019 Back to top ↑

That's today.

Eight picks from Cloudflare, Anthropic (news + research), Sentry Engineering, OpenAI, Vercel and Don't Worry About the Vase. Filtered against eighteen sources for a senior engineer or founder with ten minutes before standup.

Today's sources

  • Cloudflare blog
  • Anthropic news + research
  • Sentry engineering
  • OpenAI news
  • Vercel blog
  • Don't Worry About the Vase

Rubric

  • Tools you could adopt this week
  • Creative software
  • Dev tools & agentic coding
  • Privacy & security
  • Practical research
  • Actionable for engineers / founders

Issue

  • 019 · 07 May 2026
  • Zürich, 08:00 CEST
  • vadim.sikora.name/ephemeris