Vision Transformer Self-Attention

This interactive visualization demonstrates how self-attention works in Vision Transformers. Explore how each image patch "attends" to other patches, creating powerful visual understanding.

Image Patches & Attention

0
eye
1
2
3
eye
4
5
nose
6
mouth
7
8
9
10
11
12
13
14
15

Self-Attention Tutorial

Step 1 of 2

To begin, click on any patch in the image to see how it attends to other patches.

Tip: The image shows a simple face pattern in the top half. Try clicking on different facial features to see how they relate to each other!

Mastodon