Transformer Series

Transformer Series

In this series of articles we are exploring a special type of sequence-to-sequence models – Transformers. They are big architectures with a lot ot parts and they are used used for language modeling, machine translation, image captioning and text generation....
Transformer with Python and TensorFlow 2.0 – Training

Transformer with Python and TensorFlow 2.0 – Training

So far in our journey through the interesting architecture of Transformer we covered several topics. First we had a chance how this huge system looks like from the higher level. We saw how this type of sequence-to-sequence model harness the same principles...
Transformer with Python and TensorFlow 2.0 – Attention Layers

Transformer with Python and TensorFlow 2.0 – Attention Layers

In the previous article, we got a chance to get familiar with the architecture of Transformer. Not just that, but we explored all architectures that came before it and what problems did they have, like Recurrent Neural Networks and Long Short Term Memory Networks. We...
Introduction to Transformers Architecture

Introduction to Transformers Architecture

We know that we used logo from Transformers in the featured image, so if you are a toy/movies/cartoon fan, sorry to disappoint you. We won’t cover any of those topics in this blog post. However, if you are data science and deep learning fan, you are in the right...