Original Post
More actual than ever, after using it for so long, after diving into bits and pieces (thanks to Andrey and Georgiy), after teaching a student how to write and run Transformer kernels efficiently in GLSL and MLIR. I really understand what it's doing and how exacty it's trained and why it works.
We're delusional in stating that high-level predictive patterns are meaning. This thing needs quantum luck, or biological neurons attached to chips to become sentient and meaningful.
Use it to improve your mathematical models. Transformer is the biggest achievement in mathematics.
#ML #transformer #meaning #hype
https://lnkd.in/gqMTruRE