Natural Language Processing
- RWKV: Reinventing RNNs for the Transformer Era, June 12 2023
- Ghost in the Minecraft: Generally Capable Agents for Open-World Enviroments via Large Language Models with Text-based Knowledge and Memory, May 30 2023
- First-principles on AI scaling, May 14 2023
- Why didn’t we get GPT-2 in 2005?, May 14 2023
- The GTP-3 Architectrure, on a Napkin
- What Does BERT Look At? An Analysis of BERT’s Attention
- Scaling Transformer to 1M tokens and beyong with RMT
- ChatGPT: Optimizing Language Models for Dialogue
- How to run your own LLM GPT
- Visualizing A Neural Machine Translation Model (Mechanics of Seq2seq Models With Attention)
- The Illustrated Transformer
- Google “We Have No Moat, And Neither Does OpenAI