Ironwood, our latest TPU

61 points by zdw 10 hours ago

Google having their own hardware for training and inference is newsworthy, but the link is pretty bad. Here is a much better source https://blog.google/products/google-cloud/ironwood-tpu-age-o...

gorbot 6 hours ago

I'm an idiot and I know nothing

But I wonder if there could be room for an ARM-like spec that Google could try and own and license but for AI chips. Arm is to risc-cpu as google-thing is to asic-aichip

Prolly a dumb idea, better to sell the chips or access to them?

eru 5 hours ago

I'm not sure the chip spec (or instruction set) is the level of abstraction here?
Something like DirectX (or OpenGL) might be the better level to target? In practice, CUDA is that level of abstraction, but it only really works for Nvidia cards.
- karmakaze 5 hours ago
  
  It's not that it only works on Nvidia cards, it's only allowed to work on Nvidia cards. A non-clean room implementation of CUDA for other hardware has been done but is a violation of EULA (of the thing that was reverse engineered), copyright on the driver binary interface, and often patents. Nvidia aggressively sends cease-and-desist letters and threatens lawsuits (successfully killed ZLUDA, threatened others). It's an artificial (in a technical sense moat).
  - latchkey 5 hours ago
    
    Spectral just did a thread on that.
    https://x.com/SpectralCom/status/1993289178130661838
- latchkey 5 hours ago
  
  > CUDA is that level of abstraction, but it only really works for Nvidia cards.
  There are people actively working on that.
  https://scale-lang.com/

shrubble 7 hours ago

Not much real data or news there.

aurareturn 5 hours ago

I think we need an analysis of tokens/$1 and tokens/second for Nvidia Blackwell vs Ironwood.

ipnon 5 hours ago

It depends on how they’re utilized , especially at these scales, you have to squeeze every bit out.

bigyabai 7 hours ago

> It’s designed for AI with AI

CUDA engineers, your job security has never felt more certain.

jeffbee 6 hours ago

These are only available in Iowa on GCP, which to me raises this question: do they have them all over the world for their own purposes, or does this limited geography also mean that users of Google AI features get varied experiences depending on their location?

wmf 4 hours ago

Running on v6 vs v7 should just be different performance.

bgwalter 6 hours ago

So we will be getting wrong answers faster now.

ragequittah 12 minutes ago

I'll never understand this attitude. Recently I set up a full network with 5 computers, opnsense, xcp-ng and a few things like a pi, switch, AP, etc.
I was migrating from pfsense to Opnsense so I wasn't too familiar with some of the nitty gritty. Was migrating to xcp-ng 8.3 from 8.2 which has some major CLI differences. It was a pretty big migration that took me a full weekend.
OpenAI got things wrong (mostly because it was using old documentation - opnsense had just upgraded) maybe 8 times in the whole project and was able to quickly correct itself when I elaborated on the problem.
If I just had google this would've been a 2 week project easily. I'd have to drudge through extremely dry documentation that mostly doesn't apply to anything I'm doing. Would have to read a bunch of toxic threads demeaning users who don't know everything. Instead I had chatgpt 5 do all that for me and got to the exact same result with a tenth of the effort.
The AI is useless crowd truly makes me scratch my head.