Skip to main content

Posts

2025

Various approaches to parallelizing Muon
·2019 words

2024

Testing the 4090 48GB
·1050 words
Visualizing 6D Mesh Parallelism
·3342 words
Why reduction precision matters
·1095 words