Implementing Transformer Attention from Scratch
A practical guide to reproducing attention kernels and validating correctness.
2026-01-20
Writing Archive
A practical guide to reproducing attention kernels and validating correctness.
2026-01-20
Why retrieval quality and ranking calibration dominate downstream generation quality.
2025-10-06