It's been a while. And I'm digging into Machine Learning.
I was watching the excellent video by Andrej Karpathy about how to write a GPT (of which GPT-3 is an example) from scratch, using the paper "Attention is all you need"
I implemented it from s...