This is a brief introduction to transformer that simplifies the model’s structure to help build a quick understanding of what it is trying to realize. The original blogs and video where I learned about it have been posted below.

Continue reading

Author's picture

Xiaocao You

A Junior Student Majored in Statistics in Shanghai University of Finance and Economics

Shanghai, China