Skip to main content

Posts

2025

Things I learned digging into 5090 perf
·1045 words
Various approaches to parallelizing Muon
·1960 words

2024

Testing the 4090 48GB
·1050 words
Visualizing 6D Mesh Parallelism
·3342 words
Why reduction precision matters
·1056 words