Researchers upend AI status quo by eliminating matrix multiplication in LLMs

BrikoX@lemmy.zip · 8 months ago

Researchers upend AI status quo by eliminating matrix multiplication in LLMs

The Snark Urge@lemmy.world · 8 months ago

NVIDIA 📉

Transient Punk · 8 months ago

SturgiesYrFase@lemmy.ml · 8 months ago

I don’t really want to stop, and admit it, you don’t want that either. ;)

bitfucker@programming.dev · edit-2 8 months ago

Good

Edit: Oh shit nvm. It still requires dedicated HW (FPGA). This is no different than say, an NPU. But to be fair, they also said the researcher tested the model on traditional GPU too and reduce memory consumption.

Pennomi@lemmy.world · 8 months ago

Only for maximum efficiency. LLMs already run tolerably well on normal CPUs and this technique would make it much more efficient there as well.