Semi Doped – Lyssna här

Avsnitt

WEKA's Val Bercovici: KV Cache, DeepSeek V4, HBF, SLC vs QLC NAND, CXL, NVLink, Tokenomics
10 jul· Semi Doped
Vik welcomes Val Bercovici from Weka to discuss the rapidly evolving landscape of AI memory and storage. Val explains how Weka's architecture leverages high-bandwidth networks to make storage faster than motherboard DRAM. They dive into KV cache optimizations, the future of NAND flash tiers, and the role of CXL in AI inference. The episode concludes with a look at predictive memory offloading and the AI flywheel.

Chapters:
0:00 Welcome Val Bercovici, Weka
1:59 Memory situation and model routing
3:50 KV cache offloading to CMX
6:10 Network faster than motherboard
13:10 Weka as AI memory infrastructure
14:45 Inference market is different
16:06 Memory hierarchy and KV cache
19:40 KV cache optimizations and demand
25:20 DeepSeek's cache read pricing
34:49 NAND flash tiers: SLC vs QLC
43:01 High Bandwidth Flash (HBF)
49:59 CXL versus other interconnects

Follow Chipstrat:
Newsletter: https://www.chipstrat.com
X: https://x.com/chipstrat

Follow Vik:
Newsletter: https://www.viksnewsletter.com/
X: https://x.com/vikramskr

Follow Semi Doped:
Get more of Austin and Vik daily, free!
Sign up: https://daily.semidoped.com/
- Lyssna Lyssna igen Fortsätt Lyssnar...
- Lyssna senare Lyssna senare
Micron's Record Profits, Apple's CXMT Plea: AI is Eating All the Memory
3 jul· Semi Doped
Austin and Vik break down the current memory crisis, where AI demand is driving unprecedented profits for memory makers like Micron, while simultaneously causing inflation and price hikes for consumer electronics. They discuss how even premium brands like Apple are struggling to secure supply, leading them to controversial suppliers. The hosts also explore the long-term implications of AI's insatiable memory appetite and its impact on both data center and consumer markets.

Chapters:
0:00 Memory crisis hits
1:20 AI impacting consumers
3:00 AI causing inflation
6:48 Consumer demand drop?
8:49 AI demand inelastic
10:56 Long-term memory outlook
11:02 GoPro's memory woes
12:24 Apple's pricing power
18:20 Apple seeks CXMT DRAM
21:38 Shrinkflation for phones
23:16 Korea's memory investment
26:10 Micron's killing profits
33:00 Why AI needs so much DRAM
40:27 Future of AI training
44:34 Cost-optimizing inference

Follow Chipstrat:
Newsletter: https://www.chipstrat.com
X: https://x.com/chipstrat

Follow Vik:
Newsletter: https://www.viksnewsletter.com/
X: https://x.com/vikramskr

Follow Semi Doped:
Get more of Austin and Vik daily, free!
Sign up: https://www.semidoped.com/
- Lyssna Lyssna igen Fortsätt Lyssnar...
- Lyssna senare Lyssna senare
Saknas det avsnitt?

Klicka här för att uppdatera flödet manuellt.
Qualcomm's HBC Memory, Alphawave, Modular, and more
29 jun· Semi Doped
Qualcomm flew Austin to New York for its investor day, where the communications company laid out a plan to make data center, automotive, and IoT two-thirds of its business by FY29. Austin was in the room and asked Cristiano Amon a question on the record. Vik watched the whole thing on YouTube. Together they break down what actually changed.
The technical centerpiece is High Bandwidth Compute (HBC): stacking LPDDR on top of logic to expose the whole face of the chip for interconnects, claiming up to 100x more lanes and a path around the HBM bandwidth bottleneck. Austin and Vik dig into what's really under that memory, why "no advanced packaging needed" just moves the hard problem somewhere worse, and how it compares to d-Matrix. And as Austin put it: don't let anyone tell you HBM is dead. MOAR memory.
Then the roadmap: the AI200/250/300 accelerators, the C1000 server CPU (5 GHz, 250+ cores, Meta as a customer), the Alphawave and Modular acquisitions, and Chris Lattner's Mojo.
The most interesting takeaway might not be the data center at all. Qualcomm's edge play — AI-defined vehicles, cars as token generators, and a $1T robotics opportunity by 2040 — could be where High Bandwidth Compute matters most.
Chapters:
0:00 Communications? That's just the start
4:08 Inside Qualcomm's investor day
9:16 Can Qualcomm build a data center business?
13:09 Disaggregated inference opens the door
17:57 High Bandwidth Compute: memory on the XPU
30:29 "No advanced packaging" just moves the problem
36:20 The roadmap, Alphawave, and Modular
46:00 The C1000 CPU and the agentic shortage
50:40 Cars as token generators, the $1T robotics bet
57:32 The memory market: MOAR
Follow Semi Doped:
Get more of Austin and Vik daily, free!
Sign up: https://www.semidoped.com/
Connect with Vik and Austin:
Vik's Paid Substack: https://www.viksnewsletter.com
Austin's Paid Substack: https://www.chipstrat.com
- Lyssna Lyssna igen Fortsätt Lyssnar...
- Lyssna senare Lyssna senare
Advanced Packaging, TSMC CoWoS, Intel EMIB
19 jun· Semi Doped
New episode: Advanced packaging for AI chips, from wire bonds to TSMC CoWOS and Intel EMIB.
Packaging is no longer an afterthought. It is the chip, and Intel's EMIB challenges TSMC's CoWOS.
Three CoWOS flavors: silicon, organic RDL, local bridgesEMIB embeds tiny bridges into the substrate, no interposerEMIB-T and EMIB-M add through-silicon vias and power capacitorsGoogle is booking 3M TPUs on EMIB via MediaTek by 2028Package sizes keep climbing: 5.5x reticle today, 40x ahead
This episode is brought to you by SambaNova. Try SambaNova's fast inference today at the SambaNova Dashboard!
Connect with Vik and Austin via a daily free newsletter:
https://www.semidoped.com

Vik's Paid Substack: https://www.viksnewsletter.com
Austin's Paid Substack: https://www.chipstrat.com
Chapters:
(0:00) "There Is No Chip Without the Packaging"
(0:28) Intro and SpaceX IPO Day
(5:15) What We're Covering: CoWOS, EMIB, Google
(7:40) Simple Packaging: Wire Bonds to Flip Chip
(17:07) What Makes Packaging "Advanced"
(33:44) CoWOS: Three Flavors Explained
(45:30) EMIB: Intel's Embedded Bridge Approach
(52:47) EMIB-T and EMIB-M
(57:31) CoWOS vs. EMIB Trade-offs
(1:02:18) Google's 3M TPU EMIB Order
- Lyssna Lyssna igen Fortsätt Lyssnar...
- Lyssna senare Lyssna senare
Computex Mania 2026: Optics and Power
12 jun· Semi Doped
Austin and Vik discuss their recent experience at Computex, where they met for the first time in person after six months of podcasting together.

They share insights about the massive show, the people they connected with, and the exciting developments in AI hardware and interconnect technology.

Connect with Vik and Austin via a daily free newsletter:
https://www.semidoped.com

Vik's Paid Substack: https://www.viksnewsletter.com
Austin's Paid Substack: https://www.chipstrat.com

Chapters

00:00 Meeting in Person for the First Time
03:05 Experiencing Computex: A Massive Show
05:17 Connecting with the Audience: Real-Life Encounters
06:46 Networking with Industry Leaders
10:42 Keynote Highlights: Marvell's Vision
15:11 The Future of Interconnects: CPO and Beyond
22:54 Exploring Optical Interconnects and Future Technologies
25:56 Micro LED Developments and Future Conferences
27:34 Power Innovations in Data Centers
30:54 Intel's Keynote and New CPU Technologies
36:31 Intel Foundry's Advancements and Industry Implications
- Lyssna Lyssna igen Fortsätt Lyssnar...
- Lyssna senare Lyssna senare
Huawei's Tau Scaling Law: Is the "EUV Killer" Real?
29 maj· Semi Doped
Huawei dropped a paper claiming 1.4nm-class performance without EUV, and the internet immediately declared ASML dead and US export controls useless. Austin and Vik recorded one day after Memorial Day to unpack what Huawei actually announced at ISCAS 2026 — and why the "EUV killer" headline gets the story backwards.

They walk through the tau scaling law (tau is delay, and the idea is to attack it at the system level instead of the transistor), logic folding via hybrid bonding, the Kirin 2026 that doubles transistor count without shrinking, and who can actually manufacture stacked logic. Then the other tau knobs: a unified memory bus and near-packaged optics. Along the way: BESI vs EV Group, die-to-wafer vs wafer-to-wafer bonding, and why hybrid bonding isn't export-controlled the way EUV is.

The takeaway is the opposite of the headline. Tau scaling is rational engineering under constraint, it's bullish for ASML (two DUV wafers per product, not fewer), and the moment EUV-enabled fabs stack their own advanced-node wafers, the gap widens instead of narrowing. Bullish advanced packaging, bullish EDA and multiphysics.

Chapters:
0:00 The "EUV killer" paper that broke the internet
2:28 What Huawei actually announced at ISCAS
4:00 Tau scaling: optimize delay, not transistors
8:58 The equation and the 10x AI claim
11:05 Logic folding: stacking logic on logic
17:24 Who builds it, and can hybrid bonding be banned?
24:16 Why this is bullish for ASML
29:49 The other tau knobs: memory and optics
35:18 Takeaways: packaging, EDA, multiphysics

Follow Semi Doped:
Get more of Austin and Vik daily, free!
Sign up: https://www.semidoped.com/

Follow Chipstrat:
Newsletter: https://www.chipstrat.com
X: https://x.com/chipstrat

Follow Vik:
Newsletter: https://www.viksnewsletter.com
X: https://x.com/vikramskr
- Lyssna Lyssna igen Fortsätt Lyssnar...
- Lyssna senare Lyssna senare
Lithography Masterclass
22 maj· Semi Doped
Spend one hour here and you've caught up on the entire arc of semiconductor lithography. Austin and Vik run a masterclass on the technology that decides who gets to make leading-edge chips, and why so few companies can afford to.
The thread is economics. An EUV machine runs about $400 million, a new fab needs roughly 15 of them, and the total bill clears $20-30 billion before a single wafer ships. Austin and Vik trace the whole story: Rock's Law and the cost of a fab, what it actually takes to build one, the evolution from 193nm DUV through multi-patterning to 13.5nm EUV, how ASML generates EUV light by exploding falling tin droplets, and the move to high NA and its mirrors. Along the way, the fun history — i-line, krypton fluoride, immersion lithography, and the engineer who started it all by flipping a microscope upside down.
Then the part that matters most: where lithography goes next. Two startups, xLight and Substrate, are attacking the cost problem from first principles. xLight wants to decouple the light source from the scanner with a free-electron laser and sell photons as a service. Substrate wants to skip EUV entirely and revive X-ray lithography. If either works, the economics of who can build a fab change completely.
Chapters:
0:00 The 13F panic, and today's topic
2:23 Why the real story is economics, not physics
6:18 Austin in the clean room: graphene and bunny suits
10:06 Rock's Law and the $20 billion fab
18:08 DUV, the Sharpie, and a history of light
24:58 Multi-patterning, explained with a football field
34:45 How EUV makes 13.5nm light from tin droplets
41:14 High NA, anamorphic optics, and the half-field tax
46:45 The startups rethinking lithography: xLight and Substrate
Relevant reading:
Chipstrat — The economics of lithography: https://www.chipstrat.com/p/lithography-economics
Chipstrat — xLight and photons as a service: https://www.chipstrat.com/p/photons-as-a-service
Chipstrat — Substrate and X-ray lithography: https://www.chipstrat.com/p/substrate
Vik's Newsletter — the viability of X-ray lithography: https://www.viksnewsletter.com/p/an-in-depth-look-at-the-viability
Fred Chen — LELE multipatterning and EUV stochastics (Substack): https://frederickchen.substack.com/p/can-lele-multipatterning-help-against
Chip War, Chris Miller
Focus, Marc Hijink (the ASML book): https://www.amazon.com/Focus-Inside-struggle-complex-machine-ebook/dp/B0CW1FLCD4
Follow Chipstrat:
Newsletter: https://www.chipstrat.com
X: https://x.com/chipstrat
Follow Vik:
Newsletter: https://www.viksnewsletter.com/
X: https://x.com/vikramskr
Follow Semi Doped:
Get more of Austin and Vik daily, free!
Sign up: https://www.semidoped.com/
- Lyssna Lyssna igen Fortsätt Lyssnar...
- Lyssna senare Lyssna senare
Cerebras IPO
15 maj· Semi Doped
Cerebras IPO is the only thing to talk about this week. 🔥
IPO prices at $185/share. Pops nearly 70% right after. The first wafer-scale chip company to make it public — after a 40-year curse killed every prior attempt.
A water-cooler-style convo on what Cerebras actually builds, why a 23 kW wafer is a power and cooling nightmare, why 44 GB of SRAM is both the magic and the wall for LLM inference, and the cursed Trilogy Systems saga that Gene Amdahl tried — and failed — to pull off in 1983.
Why does Cerebras leave the whole wafer intact instead of dicing it? How do they route around defects to harvest ~900K working cores out of ~1M? Why is power delivery vertical, and why does the wafer literally expand a tenth of a millimeter when it heats up? What does the OpenAI deal actually buy — wafers, or tokens? And why does that distinction matter?
Chapters:
0:00 Cold open: 23 kW per wafer
0:15 Cerebras IPO day at $185
2:39 What's a wafer-scale engine
10:30 Power, cooling, and thermal expansion
18:12 The 44 GB wall
26:35 The Trilogy Systems curse
32:11 Supercomputing → training → inference
39:36 The OpenAI deal and the Wild West
Relevant reading:
Vik's Substack post on the Cerebras IPO and OpenAI deal: https://www.viksnewsletter.com/
Follow Chipstrat:
Newsletter: https://www.chipstrat.com
X: https://x.com/austinsemis
Follow Vik:
Newsletter: https://www.viksnewsletter.com/
X: https://x.com/vikramskr
Follow Semi Doped:
Get more of Austin and Vik daily, free!
Sign up: https://www.semidoped.com/
- Lyssna Lyssna igen Fortsätt Lyssnar...
- Lyssna senare Lyssna senare
Gimlet's Cross-Vendor Inference Cloud
12 maj· Semi Doped
Gimlet Labs runs an inference cloud built on heterogeneous silicon. Their software traces a PyTorch workload, segments it into its component parts, and schedules each piece onto the best-suited hardware — connecting chips from different vendors on a single high-speed fabric.

In this interview, Gimlet co-founder Natalie Serrino and former Intel executive Beltir walk through the architecture (graph trace, optimal split points, lowering each segment to TensorRT on NVIDIA and equivalents elsewhere), the three customer segments they sell into (frontier labs, sovereign clouds, AI natives), and a concrete demo: on GPT-OSS 120B at 8K input / 1K output, running the speculative decoder on a d-Matrix Corsair card while NVIDIA B200s handle the verifier shifts the throughput-vs-interactivity Pareto frontier roughly 4× over GPU-only speculative decode.

The most surprising takeaway: most Neoclouds gave significant equity to a single silicon vendor in exchange for capacity. Hardware amortization is around 70% of their annual costs, and the equity terms prevent them from diversifying their silicon. So the only software innovation they can ship is disaggregation on top of one vendor's stack — never across vendors. Gimlet's two-track model (deploying orchestration software inside customer data centers, plus running their own Neocloud built on mixed silicon) is the answer to that constraint.

Read the full transcript on Chipstrat.

Chapters:
0:00 Intro and the chips no one's connected before
0:33 Inference cloud for agents
1:02 From Intel to Gimlet
2:14 The case for heterogeneous inference
4:03 Disaggregating inference by resource profile
6:24 Tracing PyTorch into a schedulable graph
8:08 Connecting chips never connected before
10:52 CPUs as the agentic workhorse
12:01 Tool calls in the same data center as the LLM
13:21 Latency vs throughput on a shared fabric
14:57 Three customer buckets
15:54 Sovereigns: make an API call, not a porting project
19:37 "Cracked software is the platform"
22:24 Why merchant silicon vendors need partners
25:18 Hyperscalers outsourcing CapEx, not just kernels
28:49 AI natives: latency budgets, not just price
32:06 The d-Matrix partnership
33:31 The Pareto frontier chart
35:56 Speculative decode on Corsair: 4× shift
37:27 4× faster, or 3× more customers?
41:22 Why most Neoclouds can't follow this model
42:34 Gimlet's two-track business model
44:30 CoreWeave vs Together vs Gimlet
45:15 Series A and hiring

Relevant reading:
The Information on Gimlet helping OpenAI optimize for Cerebras: https://www.theinformation.com/newsletters/ai-agenda/startup-helping-openai-optimize-ai-cerebras-chips
Sachin Katti and Zain Asgar coauthored research at Stanford: https://arxiv.org/abs/2507.19635

Follow Chipstrat:
Newsletter: https://www.chipstrat.com
X: https://x.com/chipstrat
- Lyssna Lyssna igen Fortsätt Lyssnar...
- Lyssna senare Lyssna senare
Power as the Next Physics Wall for AI
8 maj· Semi Doped
What's common to optics and power that ruins everything in the era of AI?
Resistance.
The same physics that drove interconnects to optics is now driving low-voltage power delivery up to 800V. Austin Lyons (Chipstrat) and Vik Sekar (Vik's Newsletter) unpack it using the Kyber rack as an example.
At 600kW and 48V, you're pushing 12,500 amps through a single rack. Power loss scales with I². The math doesn't work. The fix is 800V — and the parts come straight from the EV traction inverter ecosystem (SiC, GaN, IGBTs).
We cover the full grid-to-GPU power conversion chain (substation, utility room, PSU, intermediate bus converter, VRM), why vertical power delivery is the CPO equivalent for power, and why the power industry is a much wider open problem than optics or HBM. Plus the new topology fight: 800V → 48V (reuse the existing 48V infrastructure) vs 800V → 6V (skip 48V entirely, like TI and Navitas are pushing).
We also touch Coherent's six-inch indium phosphide ramp at Järfälla, Sweden, and why margins are the real read-through next quarter.
Relevant reading:
Vik's Substack post on power: https://www.viksnewsletter.com/p/power-delivery-as-the-next-physics-wall
Google TPU 8i / 8t blog (Boardfly deep dive): https://cloud.google.com/blog/products/compute/tpu-8t-and-tpu-8i-technical-deep-dive
Get more of Austin and Vik daily, free! Sign up here: https://www.semidoped.com/
Follow Chipstrat:
Newsletter: https://www.chipstrat.com
X: https://x.com/austinsemis
Follow Vik:
Newsletter: https://www.viksnewsletter.com/
X: https://x.com/vikramskr
Chapters
(00:00) Intro
(01:41) Memory tax: inflation, not innovation
(03:46) Boardfly: 16 hops to 7
(05:12) Coherent's six-inch indium phosphide ramp
(12:15) Power is the next physics wall
(15:08) Why 48V breaks at 600kW: 12,500 amps
(23:05) 800V and vertical power delivery: CPO for power
(30:34) Grid to GPU: every stage is a different supply chain
(39:20) 800V → 48V or skip straight to 6V?
- Lyssna Lyssna igen Fortsätt Lyssnar...
- Lyssna senare Lyssna senare
CapEx is just Memory Tax Now, Deepseek V4 NAND impact
4 maj· Semi Doped
The hyperscaler memory tax quarter.
More CapEx? Pssh. We knew flops needed scaling.
But $25B at Microsoft alone just to pay higher component prices?
A memory tax. That's the news.
NAND? Sold out. HBM? Sold out.
What we cover:
SanDisk revenue +97% sequential.78% gross margin. Guidance above 80% next quarter.Samsung HBM4 first to ship. Demand outstripping supply.DeepSeek v4 goes SSD-centric. KV cache offloads to flash.Microsoft: $25B of 2026 CapEx is just memory pricing.Jassy: memory shortage pushes on-prem to AWS.Qualcomm: mystery custom ASIC. Ships December.
New Semi Doped with @vikramskr and @austinsemis.
Check out our Substacks
- https://www.viksnewsletter.com/
- https://www.chipstrat.com/
Chapters:
0:00 Intro and Vik goes full-time
5:15 Earnings week: the memory tax
7:26 Samsung HBM4 and the Gbps race
14:42 Is the memory tax worth it?
17:37 SanDisk and the SunDisk origin
23:22 78% gross margins and 5-year supply lock-ins
29:29 DeepSeek v4 and SSD-centric inference
38:49 Hyperscaler CapEx and the cloud pull
42:49 AI accelerators: TPU, Trainium, MTIA
- Lyssna Lyssna igen Fortsätt Lyssnar...
- Lyssna senare Lyssna senare
Masterclass on Google's TPU v8 Networking
24 apr· Semi Doped
Google's Cloud Next 2026 keynote? Fire. 🔥
The TPU is now two chips instead of one — 8t for training, 8i for inference — but more interestingly, it's two scale-up networking topologies too.
Austin Lyons (Chipstrat) and Vik Sekar (Vik's Newsletter) walk through what actually changed, one day after the announcement. OCS? Yes. AECs? Yep. Copper? Yep. Optics? Yep.
We cover Virgo (Google's 47 petabit/second scale-out fabric, built entirely on OCS), Boardfly (the new scale-up topology for MoE inference that cuts hop count from 16 to 7), and the 3D torus Google still uses for training.
Why is optical circuit switching the substrate of Google's data center? Why do active electrical cables still carry scale-up traffic inside racks? Why did Google split the CPU layer too, with custom ARM Axion head nodes to keep the TPUs fed?
Along the way we trace the Dragonfly topology lineage to a 2008 paper by John Kim, Bill Dally, Steve Scott, and Dennis Abts. Abts went on to build Groq's rack-scale interconnect before landing at Nvidia.
Chapters:
0:00 Intro
0:21 Two TPUs for two workloads
2:31 HBM, SRAM, and Axion CPUs
7:22 Why networking is the new bottleneck
17:14 Virgo: rebuilding scale-out on optics
25:24 3D torus Rubik's Cube scale-up for training
34:50 Boardfly: scale-up for MoE inference
42:07 Workload-specific everything
Follow Chipstrat:
Newsletter: https://www.chipstrat.com
X: https://x.com/austinsemis
Follow Vik:
Newsletter: https://www.viksnewsletter.com/
X: https://x.com/vikramskr
- Lyssna Lyssna igen Fortsätt Lyssnar...
- Lyssna senare Lyssna senare
Meta VP Matt Steiner on Ads Infra, GPUs, MTIA, and LLM-Written Kernels
20 apr· Semi Doped
Matt Steiner, VP of Monetization Infrastructure, Ranking & AI Foundations at Meta, walks through how Meta's ad system actually works, and why the infrastructure behind it differs from what you'd build for LLMs.

We cover Andromeda (retrieval on a custom NVIDIA Grace Hopper SKU Meta co-designed), Lattice (consolidating N ranking models into one), GEM (Meta's Generative Ads Recommendation foundation model), and the adaptive ranking model, a roughly one-trillion-parameter recommender served at sub-second latency.

We get into why recommender workloads aren't embarrassingly parallel like LLMs (the "personalization blob"), what that means for Meta's MTIA custom silicon roadmap, and how LLM-written kernels (KernelEvolve) flipped the economics of running a heterogeneous hardware fleet. Demand for software engineering has actually gone up as the price has come down. Meta now wants ~100x more optimized kernels per chip.

Read the full transcript at https://www.chipstrat.com/p/an-interview-with-meta-vp-matt-steiner

Chapters:
0:00 Intro and scale
0:39 How Meta's ad system works
2:00 Meta Andromeda and the custom NVIDIA SKU
3:30 Lattice: consolidating ranking models
5:00 GEM, Meta's ads foundation model
6:30 Adaptive ranking for power users
8:17 The scale: 3B DAUs at sub-second latency
9:40 Why longer interaction histories matter
10:45 The anniversary gift analogy
12:57 A decade of compute evolution
15:21 Meta's infra as a CP-SAT problem
16:07 Co-designing Grace Hopper with NVIDIA
17:47 Matching compute shape to workload
18:26 Influencing hardware and software roadmaps
20:23 MTIA: why ads aren't LLMs
22:07 The personalization blob and I/O ratios
26:38 One trillion parameters at sub-second latency
28:26 Heterogeneous hardware trade-offs
29:30 KernelEvolve: LLMs writing custom kernels
33:30 GenAI and recommender systems cross-pollination
35:21 The 2-year infrastructure outlook
37:00 Why demand for software engineering is rising
38:53 How Matt stays on top of it all

Relevant reading:
KernelEvolve (Meta Engineering): https://engineering.fb.com/2026/04/02/developer-tools/kernelevolve-how-metas-ranking-engineer-agent-optimizes-ai-infrastructure/

Follow Chipstrat:
Newsletter: https://www.chipstrat.com
X: https://x.com/chipstrat
- Lyssna Lyssna igen Fortsätt Lyssnar...
- Lyssna senare Lyssna senare
Credo + Dust Photonics, XPO, Nuvacore
17 apr· Semi Doped
Austin and Vik discuss Credo's acquisition of Dust Photonics, XPO as the new standard for scale-out (maybe instead of CPO?) and some thoughts about Nuvacore entering the CPU scene for agentic AI.

Gavin Baker's tweet: https://x.com/GavinSBaker/status/2044410644301046031?s=20

Vik's Substack: https://www.viksnewsletter.com
Austin's Substack: https://www.chipstrat.com

Chapters

00:00 Introduction to the Semiconductor Landscape
02:49 The Rise of Nuvacore and CPU Innovations
05:27 The Demand for CPUs in the AI Era
07:59 Photonics: The Next Frontier in Semiconductors
10:26 Credo's Acquisition of Dust Photonics
13:12 Vertical Integration in Semiconductor Companies
15:15 The Future of Copper and Optical Technologies
20:28 The Evolution of AI Training Models
25:28 Innovations in Optical Interconnects
31:10 The Future of Data Center Connectivity
36:56 Strategic Implications in the Optical Ecosystem
- Lyssna Lyssna igen Fortsätt Lyssnar...
- Lyssna senare Lyssna senare
Is Intel Finally Back with a $300B market cap? OpenClaw can Dream?
10 apr· Semi Doped
In this episode, Austin and Vik discuss if Intel is finally back with CPU partnerships with Google, and heterogeneous inference with SambaNova, while market cap soars above $300B. Vik tries to get his OpenClaw instance to dream every night.

Chapters

00:00 Anthropic's New Direction: Chip Development
02:30 Navigating Subscription Changes and Token Costs
05:25 Exploring Alternative AI Models
08:10 The Economics of AI: Rent vs. Buy
10:56 Intel's Resurgence and Market Dynamics
15:23 Intel's Strategic Partnerships and Market Positioning
19:37 The Role of IPUs in Modern Computing
25:08 Coexistence of x86 and ARM Architectures
29:55 Innovations in Chip Architecture and Future Prospects
- Lyssna Lyssna igen Fortsätt Lyssnar...
- Lyssna senare Lyssna senare
Reiner Pope (MatX): Designing AI Chips From First Principles for LLMs
9 apr· Semi Doped
Reiner Pope is the co-founder and CEO of MatX, the startup building chips designed from first principles for LLMs. Before MatX, Reiner was on the Google Brain team training LLMs, and his co-founder Mike Gunter was on the TPU team. They left Google one week before ChatGPT was released.
A counterintuitive throughput insight from the conversation:
“Low latency means small batch sizes. That is just Little’s law. Memory occupancy in HBM is proportional to batch size. So you can actually fit longer contexts than you could if the latency were larger. Low latency is not just a usability win, it improves throughput.”
We get into:
• The hybrid SRAM + HBM bet, and why pipeline parallelism finally works
• Overcoming the CUDA moat
• Why frontier labs are willing to bet on an AI ASIC startup
• Memory-bandwidth-efficient attention, numerics, and what MatX publishes (and what it does not)
• Why 95% of model-side news is noise for chip design
• Why sparse MoE drives MatX to “the most interconnect of any announced product”
• How MatX uses AI for its own chip design
• The biggest challenges ahead
Chapters:
00:00 “We left Google one week before ChatGPT”
00:24 Intro: who is MatX
01:17 Origin story: leaving Google for LLM chips
02:21 GPT-3 and the “too expensive” problem
04:25 Why buy hardware that is not a GPU
05:52 Overcoming the CUDA moat
08:46 Early investors
09:35 The name MatX
09:59 The chip: matrix multiply + hybrid SRAM/HBM
12:11 Why pipeline parallelism finally works
14:22 Reading papers and Google going dark
15:20 Research agenda: attention and numerics
17:06 Five specs and meeting customers where they are
19:24 Why frontier labs are the natural first customer
20:32 Workloads: training, prefill, decode
22:18 Little’s law and the throughput case for low latency
24:29 Interconnect and MoE topology
26:35 Inside the team: 100 people, full stack
28:32 Agentic AI: 95% noise for hardware
30:35 KV cache sizing in an agentic world
32:11 How MatX uses AI for chip design (Verilog + BlueSpec)
34:23 Go to market: proving credibility under NDA
35:12 Porting effort for frontier labs
36:34 Biggest skepticism: manufacturing at gigawatt scale
37:32 Hiring plug
Austin Lyons @ Chipstrat: https://www.chipstrat.com
Vik Sekar @ Vik's Newsletter: https://www.viksnewsletter.com/
- Lyssna Lyssna igen Fortsätt Lyssnar...
- Lyssna senare Lyssna senare
$300M for 70K Viewers | Intel x Elon, OpenAI x TBPN, Citrini's Strait of Hormuz Stunt
7 apr· Semi Doped
Intel Foundry just partnered with Elon Musk’s Terafab. What is Terafab anyway, why vertically integrated fabs make sense but the economics don’t (yet!), and what Intel is doing here (hint: no idea).
Then: OpenAI acquires TBPN for an estimated $100-300M. Not sure why, but the more interesting thing is the value of niche audiences when five companies control a trillion dollars in AI capex.
And finally, Citrini Research sent an analyst to the Strait of Hormuz with a Pelican case full of spy gear, $15K cash, and Cuban cigars. The most unhinged research trip in Substack history.
Austin Lyons — Chipstrat (https://chipstrat.com) Vik Sekar — Vik's Newsletter (https://www.viksnewsletter.com)
Subscribe for weekly episodes on semiconductors, AI, infrastructure, and the business of chips.
- Lyssna Lyssna igen Fortsätt Lyssnar...
- Lyssna senare Lyssna senare
NVIDIA's Marvell Strategy, Is Memory Different This Time?, Intel's Ireland Fab
3 apr· Semi Doped
In this episode, Austin and Vik analyze NVIDIA's $2 billion investment in Marvell NVLink Fusion, exploring its implications for AI infrastructure, interconnect protocols, and the broader chip ecosystem. They also discuss the current memory market surge, DRAM pricing, and Intel's strategic fab buyback, providing deep insights into industry trends and future directions.

On Substack
Vik: https://www.viksnewsletter.com/
Austin: https://www.chipstrat.com/

Chapters

00:00 NVIDIA's $2 Billion Investment in Marvell
20:11 The Memory Market Crisis
20:16 The Future of Memory Pricing and Consumer Impact
22:55 The Cycle of Supply and Demand in Memory
27:23 AI's Impact on Memory Demand
31:46 Long-Term Agreements and Market Stability
35:07 Intel's Strategic Fab Buyback
40:44 Monopoly Analogy: Intel's Market Strategy
- Lyssna Lyssna igen Fortsätt Lyssnar...
- Lyssna senare Lyssna senare
ARM AGI CPU has entered the chat, TurboQuant thrashes memory stocks
27 mar· Semi Doped
In this episode, Austin and Vik analyze recent developments in GloFo patent lawsuits, the impact of TurboQuant on AI inference, and ARM's strategic move into silicon for agentic AI workloads.

Read Vik's substack: https://www.viksnewsletter.com
Read Austin's substack: https://www.chipstrat.com

Chapters

00:00 Patent Wars in Semiconductor Industry
07:14 Understanding TurboQuant and Its Implications
24:42 Innovations in Memory Management
28:00 The Rise of ARM AGI CPUs
32:56 Agentic AI and CPU Compatibility
39:54 Performance Metrics in Agentic AI
44:52 ARM's Market Timing and Challenges
- Lyssna Lyssna igen Fortsätt Lyssnar...
- Lyssna senare Lyssna senare
MicroLEDs Ain’t Dead, Micron Snags Vera Rubin
20 mar· Semi Doped
Austin and Vik break down a packed week in semiconductors, covering GTC, OFC, and Micron earnings. The conversation kicks off with Jensen Huang's bold claim that engineers should spend $250K/year on AI tokens, and whether companies will buy tokens or token generators (i.e., on-prem hardware like the Dell Pro Max with GB300). They dig into the CapEx vs OpEx tradeoffs, data security concerns, and how sharing GPU resources might end up looking a lot like the old EDA license model.

Next up: Micron crushed earnings and appears to be designed into Vera Rubin for HBM4 — despite months of rumors saying otherwise. Austin and Vik unpack the nuance around HBM pin speeds, memory node base dies, and what Micron's massive new fab investments in Taiwan, Singapore, Idaho, and New York mean for the memory cycle.

The back half of the episode dives into optical interconnects for AI scale-up. A new industry consortium (OCI-MSA) has formed with Meta, Broadcom, NVIDIA, and OpenAI to standardize optical components. Vik explains why traditional indium phosphide lasers might be overkill for short-reach scale-up, and makes the case for micro LEDs — a "slow but wide" approach that could fill the gap between copper and conventional optics. They also touch on Credo's expanding product portfolio (and the infamous purple-to-orange cable saga), plus Lumentum's new VCSEL work for scale-up.

Vik - https://www.viksnewsletter.com/
Austin - https://www.chipstrat.com/

CHAPTERS
0:00 Intro & GTC/OFC Conference Overload
2:09 Jensen's $250K Token Budget Per Engineer
5:08 On-Prem Inference vs. Cloud Token Spending (Dell Pro Max, CapEx vs OpEx)
6:44 Sharing GPU Resources Like EDA Licenses
8:16 Data Security & On-Prem Privacy Concerns
9:53 Matthew Berman's Fine-Tuned Open Claw Agent
10:35 Vik Sets Up Open Claw on a Home Server
11:53 Always Be Clauden (ABC) – Managing Agents from Your Phone
13:34 Micron Earnings & HBM4 in Vera Rubin
16:39 HBM Pin Speeds & the Micron Design-In Debate
20:17 Micron's New Fab Investments & Memory Cycle Fears
23:49 Why AI Drives a Step Change in Memory Demand
26:30 Optical Compute Interconnect MSA (OCI-MSA)
29:48 Scale-Up Optics: Do We Need New Technology?
30:58 Micro LEDs – The "Slow but Wide" Approach
35:45 Micro LEDs vs. Copper vs. Traditional Optics
36:55 Credo's Product Spectrum & the Purple Cable Story
39:31 VCSELs & Lumentum's 1060nm Scale-Up Play
- Lyssna Lyssna igen Fortsätt Lyssnar...
- Lyssna senare Lyssna senare
Visa fler

Avsnitt

WEKA's Val Bercovici: KV Cache, DeepSeek V4, HBF, SLC vs QLC NAND, CXL, NVLink, Tokenomics

Micron's Record Profits, Apple's CXMT Plea: AI is Eating All the Memory

Qualcomm's HBC Memory, Alphawave, Modular, and more

Advanced Packaging, TSMC CoWoS, Intel EMIB

Computex Mania 2026: Optics and Power

Huawei's Tau Scaling Law: Is the "EUV Killer" Real?

Lithography Masterclass

Cerebras IPO

Gimlet's Cross-Vendor Inference Cloud

Power as the Next Physics Wall for AI

CapEx is just Memory Tax Now, Deepseek V4 NAND impact

Masterclass on Google's TPU v8 Networking

Meta VP Matt Steiner on Ads Infra, GPUs, MTIA, and LLM-Written Kernels

Credo + Dust Photonics, XPO, Nuvacore

Is Intel Finally Back with a $300B market cap? OpenClaw can Dream?

Reiner Pope (MatX): Designing AI Chips From First Principles for LLMs

$300M for 70K Viewers | Intel x Elon, OpenAI x TBPN, Citrini's Strait of Hormuz Stunt

NVIDIA's Marvell Strategy, Is Memory Different This Time?, Intel's Ireland Fab

ARM AGI CPU has entered the chat, TurboQuant thrashes memory stocks

MicroLEDs Ain’t Dead, Micron Snags Vera Rubin