inari@piefed.zip to Technology@lemmy.worldEnglish · 1 day agoDeepSeek ditches Nvidia for Huawei chips in V4 launchcybernews.comexternal-linkmessage-square52linkfedilinkarrow-up1276arrow-down15
arrow-up1271arrow-down1external-linkDeepSeek ditches Nvidia for Huawei chips in V4 launchcybernews.cominari@piefed.zip to Technology@lemmy.worldEnglish · 1 day agomessage-square52linkfedilink
minus-squarebrucethemoose@lemmy.worldlinkfedilinkEnglisharrow-up4arrow-down1·24 hours agoI just meant for mass inference serving. Yeah, I haven’t seen much in the way of bitnet training savings yet, like regular old QAT. It does appear that Deepseek is finetuning their MoEs in a 4-bit format now, though.
I just meant for mass inference serving.
Yeah, I haven’t seen much in the way of bitnet training savings yet, like regular old QAT. It does appear that Deepseek is finetuning their MoEs in a 4-bit format now, though.