FlashGRPO

2025-10-09 — Written by Matchew

sidequests to make grpo faster

Replicating The Circuit Kings

2025-04-12 — Written by Matchew

replicating ‘circuit tracing: revealing computational graphs in language models’ by the absolute beasts over at anthropic

2025-03-15 — Written by Matchew

a bit of stream of conciousness poasting

2025-02-12 — Written by Matchew

Legal Mech Interp

2025-02-08 — Written by Matchew

verify the unverifiable

2025-01-12 — Written by Matchew

Exploring hard negative mining with bm25, self-selection, bandits, and faiss

2024-12-06 — Written by Matchew

legalbench, a private legal benchmark

2024-11-15 — Written by Matchew

we do a little tree searchin’

2024-11-04 — Written by Matchew

why you explodin’ fam

2023-04-29 — Written by Matchew

freemium is better than premium