DeepSeek founder’s latest paper proposes new AI model training to bypass GPU limits

Source: Tech – South China Morning PostA technical paper co-authored by Liang Wenfeng, the founder of Chinese artificial intelligence start-up DeepSeek, and a group of Peking University researchers has proposed a new model training technique, which they say can facilitate “aggressive parameter expansion” by bypassing graphics processing unit (GPU) memory constraints.
The development underscores the Hangzhou start-up’s continued focus on maximising cost efficiency amid a deficit in computational power relative to US industry leaders,…Read More

Geef een reactie

Het e-mailadres wordt niet gepubliceerd. Vereiste velden zijn gemarkeerd met *

Generated by Feedzy