minus-squareSpicyToaster420@sopuli.xyztoProgramming@programming.dev•Surprisingly Fast AI-Generated Kernels We Didn’t Mean to Publish (Yet)linkfedilinkarrow-up4·2 days agoAwesome use of LLMs. I wonder they didn’t use FP8 quantization though, especially since their target hardware was an L40s. linkfedilink
minus-squareSpicyToaster420@sopuli.xyztoProgrammer Humor@programming.dev•Happens oftenlinkfedilinkarrow-up11·14 days agoCUDA woulda shoulda linkfedilink
Awesome use of LLMs. I wonder they didn’t use FP8 quantization though, especially since their target hardware was an L40s.