coffee-gen-ai

Transformers

Transofrmers were first introduced in the paper “Attention Is All You Need”, Vaswani et al. (2017). This work was published by Google Brain at NeurIPS 2017.

From then this deep leaning architecture has become the main architecutre for many major breakthorughs to date. Here is the goole scholar page for this paper, and yet to date still being cited and work as a building block of majority of the NLP research.

Below are some of the great tutorials I find for learning about this amazing architecutre:

Here are some FAQs I often see people ask when they are introduced to transformers. This is also a great resource if you are trying to learn more about this architecture.