A breakdown how how transformer models work (AlexNet, an image classifier) [edit: more about CNNs then transformers, I misunderstood]

BountifulEggnog [she/her] · edit-2 5 months ago

A breakdown how how transformer models work (AlexNet, an image classifier) [edit: more about CNNs then transformers, I misunderstood]

BountifulEggnog [she/her] · 5 months ago

Of course I messed it up. I thought the transformer paper was newer then 2012, but I remembered them being mentioned in the beginning of the video. I should have rewatched to make sure I understood.

KnilAdlez [none/use name] · 5 months ago

Honestly the video made it sound like CNNs were a part of transformers, so I'd blame the video before yourself

A breakdown how how transformer models work (AlexNet, an image classifier) [edit: more about CNNs then transformers, I misunderstood]

A breakdown how how transformer models work (AlexNet, an image classifier) [edit: more about CNNs then transformers, I misunderstood]

- YouTube