CrackedAI
Problems Tracks Learn JAX Roadmap Articles
Log in Sign up
Problems Tracks Learn JAX Articles Roadmap
Log in Sign up
Radio

We can't find the internet

Attempting to reconnect

Something went wrong!

Attempting to reconnect

Articles

Long-form theory: intuitions, derivations, and modern variants. Each article has questions sprinkled throughout โ€” click to reveal the answer when you've thought about it.

  • All About Attention

    A modern theory guide to attention โ€” soft lookup, scaled dot-product, multi-head, induction heads, RoPE and YaRN context extension, GQA/MQA/MLA, sliding window with sinks, paged KV cache, sparse, linear, FlashAttention's online softmax, and the recent generation of small but consequential tweaks.