A Gentle Intro to Transformer Neural Networks
A Gentle Intro to Transformer Neural Networks


The field of Natural Language processing has been witnessing a rapid acceleration in model improvement in the last few years. The majority of the state-of-the-art models in the field are based on the Transformer architecture. Examples include models like BERT (which when applied to Google Search, resulted in what Google calls ""one of the biggest leaps forward in the history of Search"") and OpenAI's GPT2 and GPT3 (which are able to generate coherent text and essays).

This talk by the author of the popular ""Illustrated Transformer"" guide will introduce the Transformer architecture and its various applications. This will be a visual presentation accessible to people with various levels of ML experience.


Passionate analytical expert in building and scaling great Internet companies and products. Learns, codes, illustrates, and teaches machine learning topics at every opportunity. Jay’s hands-on expertise covers the entire product life cycle from initial research, focus groups, user experience design, product prototyping, user testing, up to product release, marketing, and acting on deep analytics insights.

Open Data Science




Open Data Science
One Broadway
Cambridge, MA 02142

Privacy Settings
We use cookies to enhance your experience while using our website. If you are using our Services via a browser you can restrict, block or remove cookies through your web browser settings. We also use content and scripts from third parties that may use tracking technologies. You can selectively provide your consent below to allow such third party embeds. For complete information about the cookies we use, data we collect and how we process them, please check our Privacy Policy
Consent to display content from - Youtube
Consent to display content from - Vimeo
Google Maps
Consent to display content from - Google