Pro@programming.dev to Programming@programming.devEnglish · 7 days agoSurprisingly Fast AI-Generated Kernels We Didn’t Mean to Publish (Yet)crfm.stanford.eduexternal-linkmessage-square5fedilinkarrow-up123arrow-down16cross-posted to: [email protected]
arrow-up117arrow-down1external-linkSurprisingly Fast AI-Generated Kernels We Didn’t Mean to Publish (Yet)crfm.stanford.eduPro@programming.dev to Programming@programming.devEnglish · 7 days agomessage-square5fedilinkcross-posted to: [email protected]
minus-squareSpicyToaster420@sopuli.xyzlinkfedilinkarrow-up4·5 days agoAwesome use of LLMs. I wonder they didn’t use FP8 quantization though, especially since their target hardware was an L40s.
Awesome use of LLMs. I wonder they didn’t use FP8 quantization though, especially since their target hardware was an L40s.