Finally I got some time to learn some underlying concepts of Chat GPT, GPT-3 and transformers. This is a great summary.
Transformers, explained: Understand the model behind GPT, BERT, and T5:
Watch this video for detailed description from the groundbreaking transformer paper “Attention is all you need” and other technical explanations of encoders/decoders with examples.
Illustrated Guide to Transformers Neural Network: A step by step explanation: