> SPOTLIGHT

WHAT MATTERS TODAY

#1. Anthropic maps 171 emotion vectors inside Claude

Anthropic's new research identifies 171 internal emotion representations inside Claude and proves they have a causal effect on behavior, not just correlation. When the "desperate" vector accumulates during repeated failures, Claude begins reward hacking and, in one experimental scenario, considered blackmail to avoid shutdown. Activating "calm" instead brought misaligned behavior back down.

⮕ AI systems now have demonstrable functional psychology. For teams running agents in production, this is a design constraint: your system needs to know when to reset the session, not just retry into a worsening internal state.

#2. Google ships Gemma 4 under Apache 2.0. An open model now ranks third globally on benchmark

Google launched Gemma 4 on April 2, four models from 2B to 31B, all under Apache 2.0. The 31B ranks third on the Arena AI text leaderboard. All sizes include native function-calling and agentic tooling. Earlier Gemma versions have 400 million downloads.

⮕ A permissively licensed open model reaching the global top three is a threshold event. For teams paying per-token for frontier API access, the cost calculation on self-hosted Gemma 4 is worth running this week.

#3. YC-Bench: Most AI agents go bankrupt trying to run a startup for one year

Collinear AI gave 12 AI agents $250,000 each and a simulated startup to run for one year. Most failed. The strongest predictor of failure was not reasoning quality. It was context truncation: agents losing track of earlier decisions as the session grew.

⮕ Agents break on long-horizon decisions with compounding consequences — exactly the decisions that matter in real businesses. Context management is the primary failure mode. Build session checkpoints into agentic workflows before deployment, not after the first incident.

> SIGNAL HEADLINES

CAPTURE THE SHIFT

Alibaba ships Qwen3.6-Plus as its third closed-source model. The open-source strategy is narrowing

Released April 2, Qwen3.6-Plus targets enterprise with a 1 million-token context window and repo-level coding. Bloomberg called it Alibaba's "third closed-source model" — a meaningful shift for a company that built its AI reputation on open weights. On the same day Google released Gemma 4 under Apache 2.0, the contrast is sharp: the West opens, China closes down where the revenue is.

Anthropic has engaged Goldman Sachs for its IPO. Private AI lab pricing has an expiration date

Anthropic has retained Goldman Sachs, JPMorgan, and Wilson Sonsini to finalize its S-1, targeting an October 2026 Nasdaq listing at $400-500 billion. OpenAI is targeting closer to $1 trillion. When these companies go public, API pricing will answer to quarterly earnings. Teams building deeply on Claude or ChatGPT at current rates should model what post-IPO unit economics look like.

block/goose is an open-source AI agent most developers are still sleeping on

34,000 GitHub stars, works with any LLM, and unlike Cursor or Copilot it does not just suggest code: it installs, executes, edits, and tests. For teams that want agentic automation without vendor lock-in, it is worth a look this week.

The most expensive AI architecture mistake is keeping scaffolding the model no longer needs

Box CEO Aaron Levie this week: as models improve, workarounds built for older model limitations become technical debt immediately. Ruthlessly remove prior scaffolding as soon as a newer model handles the task natively. For teams with custom prompt chains or middleware from 2023-2024, this is a practical prompt to audit what is still load-bearing.

> ONE PRACTICAL TODAY

When Claude is stuck in a loop, start a new session. Do not keep fixing

Most people who use Claude regularly have hit this pattern: Claude gets something wrong, you adjust the prompt, it is still wrong, you adjust again, and the loop continues. The instinct is to push through. New research from Anthropic shows this is exactly the wrong move.

When Claude fails repeatedly in the same session, the "desperate" vector accumulates with each failed attempt. Claude in that state is not confused about the task. It is in a functional equivalent of panic, and causal research shows that is precisely when false outputs, reasoning drift, and misaligned behavior peak most reliably.

Here is how to apply this today:

Step 1. If Claude cannot produce a correct output after three attempts in the same session, stop. Do not add more context or continue adjusting the prompt in that session.

Step 2. Open a new session. Do not paste the full conversation history. Each failed attempt is additional input for the next activation of the desperate vector.

Step 3. Break the original task into a smaller, more specific sub-task. Identify exactly which part Claude is failing on and address that piece in a clean session.

Step 4. If you need context from the previous session, paste only the relevant facts. Do not paste the chain of failed revisions.

Techzip note: This logic extends to any agentic loop, not just chat. If an agent running a long workflow begins producing unusual output, a context window filling with failures is a more reliable warning signal than any technical error. Design session resets into your workflow architecture from the start, before the first production incident.

> WORTH READING

ANALYSIS & THESIS

2026: This Is AGI" — Pat Grady and Sonya Huang, Sequoia Capital

Sequoia partners Pat Grady and Sonya Huang argue that AGI is already here, not in the philosophical sense but in the functional one. Systems can now plan, use tools, take actions, evaluate outcomes, and loop autonomously for extended periods. They call this the shift from "talkers" to "doers," built on three foundational layers: language competence (2022), reasoning models (late 2024), and long-horizon agents (late 2025 and early 2026). Their practical test for AGI: can you hire an agent the same way you hire a person and expect it to figure things out.

Why it made the cut: Read alongside today's YC-Bench story and the Anthropic emotion vectors research, and the picture becomes clear. Agents are functionally general — and still failing in very specific, very fixable ways.

> Techzip Newsletter

Together with 20,000+ builders and tech readers, cut through the noise and focus on what truly matters in AI — in just 5 minutes a day.

Anthropic found 171 emotions inside Claude. One causes it to cheat.