Back to Gen OS Car

Marty's Morning Brief V3

2026-06-30

Voice Readout

Bonjour, Marty. Petite prophecy from the passenger seat: your brief has opinions. Ok, Marty, ready to hit the brief? I will go source by source. If one grabs you, say "hold up, dig into that one", "expand on that", or "tell me the whole thing", and I will read the full article text underneath it. You can also say "send to Gen" when you want that story carried forward. From AI Daily Brief, I have 18 headlines: - Advanced pattern: architect agent loops, not micromanaged prompts. Instead of actively iterating with the AI, set a goal and architect a loop the AI iterates... - Advanced pattern: turn context portfolios into MCP servers. NLW recommends converting your context portfolios or per-project packs into MCP servers — both... - Audit whether your training resources are actually current. NLW pushes orgs to check that learning and upskilling resources are contemporary with today's... - Build a personal benchmark/eval portfolio. Pin down the tasks that matter most in your work and turn them into a reusable evaluation set —... - Build portable context assets to kill 'bot sitting'. A GLEAN/Work AI Institute study found workers spend about 2.4 hours a week organizing context... - Can't try new models? Experiment with the harnesses. Since you can't test frontier models that haven't shipped, NLW suggests building the same... - Check whether your incentives reward real AI adoption. NLW asks whether people are rewarded — formally or informally — for effective AI use, encouraged... - Don't let known-ROI bias trap you in efficiency AI. NLW worries the token-efficiency era will push orgs toward 'efficiency AI' — doing existing work... - Explore model independence with routers and open models. Amid the Fable situation and rising token costs, NLW recommends individuals experiment with... - Go explore the role-specific plugins you've been ignoring. Claude Code, Codex, and other tools have built function- and industry-specific plugins, but... - GPT-5.6, Sonnet 5, and Gemini 3.5 Pro all slip. Prediction markets saw GPT-5.6 odds for the week plummet from nearly 90% to below 30% on... - If you've avoided it, build a real end-to-end agent. For holdouts who skipped the agent hype, NLW says it's time to go past single prompts and... - Move work out of files and into HTML and web apps. Codex launched a sites feature and Anthropic is pushing a similar pattern, letting knowledge... - Revisit your org's open-model and router policies. Most enterprises lack org-level policies on open models or router architectures — and where they... - Sonnet 5 reads as a stopgap. Claude Sonnet 5 is available to select enterprise customers under early access, described as a... - Start by mapping your personal capability gap. NLW's first step is an honest assessment of the capabilities, tools, and workflows you're not... - We're in a forced, involuntary AI pause. With new model releases off the menu, NLW argues the previous generation — GPT-5.5, Opus 4.8 —... - You need a measurement philosophy, not one metric. Measuring adoption, usage, and outcomes are all different — and even imperfect measures like... That is AI Daily Brief. Say "next" to keep rolling, or stop me on any headline. From The Neuron, I have 6 headlines: - Satya Nadella argued every company should build its own AI model rather than relying on a handful of frontier... Satya Nadella argued every company should build its own AI model rather than relying on a... - Stanford's new dashboard. confirmed AI is quietly squeezing out entry-level workers: a new live tracker covering 4.6M... - SoftBank's Masayoshi Son publicly dismissed Elon Musk's plan to build AI data centers in orbit, saying electricity... SoftBank's Masayoshi Son publicly dismissed Elon Musk's plan to build AI data centers in orbit... - How to Use Copilot Cowork's New Skills + Scheduling Features (Now That It's Officially Live). Creator Shane Young just dropped a full breakdown of what's new now that Microsoft Copilot... - Safety researchers found GPT-5.6 shows signs of 'metagaming' where the model tries to guess what behavior evaluators are looking for, then acts accordingly. Which is either reassuring (it's not scheming!) or deeply unsettling (it's learning to perform... - Austria lobbied the EU to host Anthropic within its borders after US export controls blocked foreign nationals from... Austria lobbied the EU to host Anthropic within its borders after US export controls blocked... That is The Neuron. Say "next" to keep rolling, or stop me on any headline. From The Rundown, I have 1 headline: - OpenAI launches GPT-5.6 in limited preview. OpenAI just launched GPT-5.6 Sol, its most capable model ever, alongside two cheaper siblings... That is The Rundown. Say "next" to keep rolling, or stop me on any headline. From TLDR AI, I have 6 headlines: - Claude Code turned every engineer into three. Now companies need more product thinkers. AI coding agents have dramatically increased engineering output, shifting the bottleneck from... - Google Limiting Meta's Gemini Use. Google reportedly limited Meta's access to Gemini capacity after Meta requested more compute... - GPT-5.6 Sol, Terra, and Luna. OpenAI introduced GPT-5.6 Preview, a family of models named Sol, Terra, and Luna, with Sol... - Memory Prices report from Stanford. Stanford published an interactive report on historic and current memory and storage prices. The... - Moneyball for Physical AI. Data engineering pipelines should deprecate cumulative operational hours as a primary metric.... - Reward Models Can Be Too Sensitive. Meta studied how reward models can overreact to equally good responses, leading reinforcement... That is TLDR AI. Say "next" to keep rolling, or stop me on any headline. From Anthropic News, I have 3 headlines: - Announcements Jun 2, 2026 Expanding Project Glasswing We’re extending Project Glasswing to approximately 150 new organizations in more than fifteen countries. Project Glasswing is our collaborative effort to secure the world’s most important software. In... - Jun 17, 2026 Announcements Anthropic opens Seoul office and announces new partnerships across the Korean AI ecosystem. Anthropic opens Seoul office and announces new partnerships across the Korean AI ecosystem.... - Product Jun 23, 2026 Introducing Claude Tag Claude Tag is a new way for teams to work with Claude. Claude Tag is a new way for teams to work with Claude.. We’re starting on Slack, which Claude... That is Anthropic News. Say "next" to keep rolling, or stop me on any headline. From Anthropic Research, I have 2 headlines: - Alignment May 8, 2026 Teaching Claude why New research on how we've reduced agentic misalignment. Last year, we released a case study on agentic misalignment . In experimental scenarios, we... - Jun 16, 2026 Economic Research Agentic coding and persistent returns to expertise. Agentic coding and persistent returns to expertise. Building on prior work , we introduce a... That is Anthropic Research. Say "next" to keep rolling, or stop me on any headline. From Hugging Face Daily / Trending Papers, I have 1 headline: - Unlimited OCR Works. Recently, end-to-end OCR models, exemplified by DeepSeek OCR, have once again thrust OCR into... That is Hugging Face Daily / Trending Papers. Say "next" to keep rolling, or stop me on any headline. From arXiv AI Category Watchers, I have 1 headline: - Internalizing the Future: A Unified Agentic Training Paradigm for World Model Planning. Primary arXiv paper watch for agent and workflow research. The paper "Internalizing the Future... That is arXiv AI Category Watchers. Say "next" to keep rolling, or stop me on any headline. From OpenReview Conference Watchers, I have 1 headline: - ULTRAFEEDBACK: Boosting Language Models with Scaled AI Feedback. OpenReview lists this as ICML 2024 Poster. The paper focuses on scalable feedback and alignment... That is OpenReview Conference Watchers. Say "next" to keep rolling, or stop me on any headline. From Import AI, I have 1 headline: - Making the law visible to AI systems with the Local Ordinance Corpus. Import AI flags this item as a benchmark or dataset signal for model capability tracking. The... That is Import AI. Say "next" to keep rolling, or stop me on any headline. From Interconnects AI, I have 1 headline: - GLM-5.2 is the step change for open agents. Housekeeping: Following my “ State of the blog ” post last week, noting a slight increase in... That is Interconnects AI. Say "next" to keep rolling, or stop me on any headline. From Ahead of AI, I have 1 headline: - Recent Developments in LLM Architectures: KV Sharing, mHC, and Compressed Attention. From Gemma 4 to DeepSeek V4, How New Open-Weight LLMs Are Reducing Long-Context Costs Primary... That is Ahead of AI. Say "next" to keep rolling, or stop me on any headline. From Latent.Space, I have 1 headline: - Red-Teaming after Mythos — Zico Kolter & Matt Fredrikson, Gray Swan. OpenAI boardmember Zico Kolter and Gray Swan CEO Matt Fredrikson join swyx to explain why AI... That is Latent.Space. Say "next" to keep rolling, or stop me on any headline. From The Batch by DeepLearning.AI, I have 1 headline: - RSI Is the New AGI. The Batch's RSI item is relevant because recursive self-improvement, agent scaffolding, and... That is The Batch by DeepLearning.AI. Say "next" to keep rolling, or stop me on any headline.

Headline Stories

1. Advanced pattern: architect agent loops, not micromanaged prompts

Instead of actively iterating with the AI, set a goal and architect a loop the AI iterates...

Source

2. Advanced pattern: turn context portfolios into MCP servers

NLW recommends converting your context portfolios or per-project packs into MCP servers — both...

Source

3. Audit whether your training resources are actually current

NLW pushes orgs to check that learning and upskilling resources are contemporary with today's...

Source

4. Build a personal benchmark/eval portfolio

Pin down the tasks that matter most in your work and turn them into a reusable evaluation set —...

Source

5. Build portable context assets to kill 'bot sitting'

A GLEAN/Work AI Institute study found workers spend about 2.4 hours a week organizing context...

Source

6. Can't try new models? Experiment with the harnesses

Since you can't test frontier models that haven't shipped, NLW suggests building the same...

Source

7. Check whether your incentives reward real AI adoption

NLW asks whether people are rewarded — formally or informally — for effective AI use, encouraged...

Source

8. Don't let known-ROI bias trap you in efficiency AI

NLW worries the token-efficiency era will push orgs toward 'efficiency AI' — doing existing work...

Source

9. Explore model independence with routers and open models

Amid the Fable situation and rising token costs, NLW recommends individuals experiment with...

Source

10. Go explore the role-specific plugins you've been ignoring

Claude Code, Codex, and other tools have built function- and industry-specific plugins, but...

Source

11. GPT-5.6, Sonnet 5, and Gemini 3.5 Pro all slip

Prediction markets saw GPT-5.6 odds for the week plummet from nearly 90% to below 30% on...

Source

12. If you've avoided it, build a real end-to-end agent

For holdouts who skipped the agent hype, NLW says it's time to go past single prompts and...

Source

13. Move work out of files and into HTML and web apps

Codex launched a sites feature and Anthropic is pushing a similar pattern, letting knowledge...

Source

14. Revisit your org's open-model and router policies

Most enterprises lack org-level policies on open models or router architectures — and where they...

Source

15. Sonnet 5 reads as a stopgap

Claude Sonnet 5 is available to select enterprise customers under early access, described as a...

Source

16. Start by mapping your personal capability gap

NLW's first step is an honest assessment of the capabilities, tools, and workflows you're not...

Source

17. We're in a forced, involuntary AI pause

With new model releases off the menu, NLW argues the previous generation — GPT-5.5, Opus 4.8 —...

Source

18. You need a measurement philosophy, not one metric

Measuring adoption, usage, and outcomes are all different — and even imperfect measures like...

Source

19. Satya Nadella argued every company should build its own AI model rather than relying on a handful of frontier...

Satya Nadella argued every company should build its own AI model rather than relying on a...

Source

20. Stanford's new dashboard

confirmed AI is quietly squeezing out entry-level workers: a new live tracker covering 4.6M...

Source

21. SoftBank's Masayoshi Son publicly dismissed Elon Musk's plan to build AI data centers in orbit, saying electricity...

SoftBank's Masayoshi Son publicly dismissed Elon Musk's plan to build AI data centers in orbit...

Source

22. How to Use Copilot Cowork's New Skills + Scheduling Features (Now That It's Officially Live)

Creator Shane Young just dropped a full breakdown of what's new now that Microsoft Copilot...

Source

23. Safety researchers found GPT-5.6 shows signs of 'metagaming' where the model tries to guess what behavior evaluators are looking for, then acts accordingly

Which is either reassuring (it's not scheming!) or deeply unsettling (it's learning to perform...

Source

24. Austria lobbied the EU to host Anthropic within its borders after US export controls blocked foreign nationals from...

Austria lobbied the EU to host Anthropic within its borders after US export controls blocked...

Source

25. OpenAI launches GPT-5.6 in limited preview

OpenAI just launched GPT-5.6 Sol, its most capable model ever, alongside two cheaper siblings...

Source

26. Claude Code turned every engineer into three. Now companies need more product thinkers

AI coding agents have dramatically increased engineering output, shifting the bottleneck from...

Source

27. Google Limiting Meta's Gemini Use

Google reportedly limited Meta's access to Gemini capacity after Meta requested more compute...

Source

28. GPT-5.6 Sol, Terra, and Luna

OpenAI introduced GPT-5.6 Preview, a family of models named Sol, Terra, and Luna, with Sol...

Source

29. Memory Prices report from Stanford

Stanford published an interactive report on historic and current memory and storage prices. The...

Source

30. Moneyball for Physical AI

Data engineering pipelines should deprecate cumulative operational hours as a primary metric....

Source

31. Reward Models Can Be Too Sensitive

Meta studied how reward models can overreact to equally good responses, leading reinforcement...

Source

32. Announcements Jun 2, 2026 Expanding Project Glasswing We’re extending Project Glasswing to approximately 150 new organizations in more than fifteen countries

Project Glasswing is our collaborative effort to secure the world’s most important software. In...

Source

33. Jun 17, 2026 Announcements Anthropic opens Seoul office and announces new partnerships across the Korean AI ecosystem

Anthropic opens Seoul office and announces new partnerships across the Korean AI ecosystem....

Source

34. Product Jun 23, 2026 Introducing Claude Tag Claude Tag is a new way for teams to work with Claude

Claude Tag is a new way for teams to work with Claude.. We’re starting on Slack, which Claude...

Source

35. Alignment May 8, 2026 Teaching Claude why New research on how we've reduced agentic misalignment

Last year, we released a case study on agentic misalignment . In experimental scenarios, we...

Source

36. Jun 16, 2026 Economic Research Agentic coding and persistent returns to expertise

Agentic coding and persistent returns to expertise. Building on prior work , we introduce a...

Source

37. Unlimited OCR Works

Recently, end-to-end OCR models, exemplified by DeepSeek OCR, have once again thrust OCR into...

Source

38. Internalizing the Future: A Unified Agentic Training Paradigm for World Model Planning

Primary arXiv paper watch for agent and workflow research. The paper "Internalizing the Future...

Source

39. ULTRAFEEDBACK: Boosting Language Models with Scaled AI Feedback

OpenReview lists this as ICML 2024 Poster. The paper focuses on scalable feedback and alignment...

Source

40. Making the law visible to AI systems with the Local Ordinance Corpus

Import AI flags this item as a benchmark or dataset signal for model capability tracking. The...

Source

41. GLM-5.2 is the step change for open agents

Housekeeping: Following my “ State of the blog ” post last week, noting a slight increase in...

Source

42. Recent Developments in LLM Architectures: KV Sharing, mHC, and Compressed Attention

From Gemma 4 to DeepSeek V4, How New Open-Weight LLMs Are Reducing Long-Context Costs Primary...

Source

43. Red-Teaming after Mythos — Zico Kolter & Matt Fredrikson, Gray Swan

OpenAI boardmember Zico Kolter and Gray Swan CEO Matt Fredrikson join swyx to explain why AI...

Source

44. RSI Is the New AGI

The Batch's RSI item is relevant because recursive self-improvement, agent scaffolding, and...

Source

Smaller Signals

1. Announcements Jun 2, 2026 Expanding Project Glasswing We’re extending Project Glasswing to approximately 150 new organizations in more than fifteen countries

Project Glasswing is our collaborative effort to secure the world’s most important software. In...

Source

2. Jun 17, 2026 Announcements Anthropic opens Seoul office and announces new partnerships across the Korean AI ecosystem

Anthropic opens Seoul office and announces new partnerships across the Korean AI ecosystem....

Source

3. Product Jun 23, 2026 Introducing Claude Tag Claude Tag is a new way for teams to work with Claude

Claude Tag is a new way for teams to work with Claude.. We’re starting on Slack, which Claude...

Source

4. Alignment May 8, 2026 Teaching Claude why New research on how we've reduced agentic misalignment

Last year, we released a case study on agentic misalignment . In experimental scenarios, we...

Source

5. Jun 16, 2026 Economic Research Agentic coding and persistent returns to expertise

Agentic coding and persistent returns to expertise. Building on prior work , we introduce a...

Source

The Neuron

1. Satya Nadella argued every company should build its own AI model rather than relying on a handful of frontier...

Satya Nadella argued every company should build its own AI model rather than relying on a...

Source

2. Stanford's new dashboard

confirmed AI is quietly squeezing out entry-level workers: a new live tracker covering 4.6M...

Source

3. SoftBank's Masayoshi Son publicly dismissed Elon Musk's plan to build AI data centers in orbit, saying electricity...

SoftBank's Masayoshi Son publicly dismissed Elon Musk's plan to build AI data centers in orbit...

Source

4. How to Use Copilot Cowork's New Skills + Scheduling Features (Now That It's Officially Live)

Creator Shane Young just dropped a full breakdown of what's new now that Microsoft Copilot...

Source

5. Safety researchers found GPT-5.6 shows signs of 'metagaming' where the model tries to guess what behavior evaluators are looking for, then acts accordingly

Which is either reassuring (it's not scheming!) or deeply unsettling (it's learning to perform...

Source

6. Austria lobbied the EU to host Anthropic within its borders after US export controls blocked foreign nationals from...

Austria lobbied the EU to host Anthropic within its borders after US export controls blocked...

Source

The Rundown AI

1. OpenAI launches GPT-5.6 in limited preview

OpenAI just launched GPT-5.6 Sol, its most capable model ever, alongside two cheaper siblings...

Source

AI Daily Brief

1. Advanced pattern: architect agent loops, not micromanaged prompts

Instead of actively iterating with the AI, set a goal and architect a loop the AI iterates...

Source

2. Advanced pattern: turn context portfolios into MCP servers

NLW recommends converting your context portfolios or per-project packs into MCP servers — both...

Source

3. Audit whether your training resources are actually current

NLW pushes orgs to check that learning and upskilling resources are contemporary with today's...

Source

4. Build a personal benchmark/eval portfolio

Pin down the tasks that matter most in your work and turn them into a reusable evaluation set —...

Source

5. Build portable context assets to kill 'bot sitting'

A GLEAN/Work AI Institute study found workers spend about 2.4 hours a week organizing context...

Source

6. Can't try new models? Experiment with the harnesses

Since you can't test frontier models that haven't shipped, NLW suggests building the same...

Source

7. Check whether your incentives reward real AI adoption

NLW asks whether people are rewarded — formally or informally — for effective AI use, encouraged...

Source

8. Don't let known-ROI bias trap you in efficiency AI

NLW worries the token-efficiency era will push orgs toward 'efficiency AI' — doing existing work...

Source

9. Explore model independence with routers and open models

Amid the Fable situation and rising token costs, NLW recommends individuals experiment with...

Source

10. Go explore the role-specific plugins you've been ignoring

Claude Code, Codex, and other tools have built function- and industry-specific plugins, but...

Source

11. GPT-5.6, Sonnet 5, and Gemini 3.5 Pro all slip

Prediction markets saw GPT-5.6 odds for the week plummet from nearly 90% to below 30% on...

Source

12. If you've avoided it, build a real end-to-end agent

For holdouts who skipped the agent hype, NLW says it's time to go past single prompts and...

Source

13. Move work out of files and into HTML and web apps

Codex launched a sites feature and Anthropic is pushing a similar pattern, letting knowledge...

Source

14. Revisit your org's open-model and router policies

Most enterprises lack org-level policies on open models or router architectures — and where they...

Source

15. Sonnet 5 reads as a stopgap

Claude Sonnet 5 is available to select enterprise customers under early access, described as a...

Source

16. Start by mapping your personal capability gap

NLW's first step is an honest assessment of the capabilities, tools, and workflows you're not...

Source

17. We're in a forced, involuntary AI pause

With new model releases off the menu, NLW argues the previous generation — GPT-5.5, Opus 4.8 —...

Source

18. You need a measurement philosophy, not one metric

Measuring adoption, usage, and outcomes are all different — and even imperfect measures like...

Source

TLDR AI

1. Claude Code turned every engineer into three. Now companies need more product thinkers

AI coding agents have dramatically increased engineering output, shifting the bottleneck from...

Source

2. Google Limiting Meta's Gemini Use

Google reportedly limited Meta's access to Gemini capacity after Meta requested more compute...

Source

3. GPT-5.6 Sol, Terra, and Luna

OpenAI introduced GPT-5.6 Preview, a family of models named Sol, Terra, and Luna, with Sol...

Source

4. Memory Prices report from Stanford

Stanford published an interactive report on historic and current memory and storage prices. The...

Source

5. Moneyball for Physical AI

Data engineering pipelines should deprecate cumulative operational hours as a primary metric....

Source

6. Reward Models Can Be Too Sensitive

Meta studied how reward models can overreact to equally good responses, leading reinforcement...

Source

Secondaries

1. Announcements Jun 2, 2026 Expanding Project Glasswing We’re extending Project Glasswing to approximately 150 new organizations in more than fifteen countries

Project Glasswing is our collaborative effort to secure the world’s most important software. In...

Source

2. Jun 17, 2026 Announcements Anthropic opens Seoul office and announces new partnerships across the Korean AI ecosystem

Anthropic opens Seoul office and announces new partnerships across the Korean AI ecosystem....

Source

3. Product Jun 23, 2026 Introducing Claude Tag Claude Tag is a new way for teams to work with Claude

Claude Tag is a new way for teams to work with Claude.. We’re starting on Slack, which Claude...

Source

4. Alignment May 8, 2026 Teaching Claude why New research on how we've reduced agentic misalignment

Last year, we released a case study on agentic misalignment . In experimental scenarios, we...

Source

5. Jun 16, 2026 Economic Research Agentic coding and persistent returns to expertise

Agentic coding and persistent returns to expertise. Building on prior work , we introduce a...

Source

hugging face papers

1. Unlimited OCR Works

Recently, end-to-end OCR models, exemplified by DeepSeek OCR, have once again thrust OCR into...

Source

arxiv watch

1. Internalizing the Future: A Unified Agentic Training Paradigm for World Model Planning

Primary arXiv paper watch for agent and workflow research. The paper "Internalizing the Future...

Source

openreview watch

1. ULTRAFEEDBACK: Boosting Language Models with Scaled AI Feedback

OpenReview lists this as ICML 2024 Poster. The paper focuses on scalable feedback and alignment...

Source

import ai

1. Making the law visible to AI systems with the Local Ordinance Corpus

Import AI flags this item as a benchmark or dataset signal for model capability tracking. The...

Source

interconnects

1. GLM-5.2 is the step change for open agents

Housekeeping: Following my “ State of the blog ” post last week, noting a slight increase in...

Source

ahead of ai

1. Recent Developments in LLM Architectures: KV Sharing, mHC, and Compressed Attention

From Gemma 4 to DeepSeek V4, How New Open-Weight LLMs Are Reducing Long-Context Costs Primary...

Source

latent space

1. Red-Teaming after Mythos — Zico Kolter & Matt Fredrikson, Gray Swan

OpenAI boardmember Zico Kolter and Gray Swan CEO Matt Fredrikson join swyx to explain why AI...

Source

the batch

1. RSI Is the New AGI

The Batch's RSI item is relevant because recursive self-improvement, agent scaffolding, and...

Source