an-in-depth-look-at-deepseek:-deepseekmoe-and-deepseekmla,-cheap-v3-training,-the-us-chip-ban,-“distillation”-from-other-models,-nvidia-impact,-agi,-and-more-(ben-thompson/stratechery)

An in-depth look at DeepSeek: DeepSeekMoE and DeepSeekMLA, cheap V3 training, the US chip ban, “distillation” from other models, Nvidia impact, AGI, and more (Ben Thompson/Stratechery)

Ben Thompson / Stratechery:
An in-depth look at DeepSeek: DeepSeekMoE and DeepSeekMLA, cheap V3 training, the US chip ban, “distillation” from other models, Nvidia impact, AGI, and more  —  It’s Monday, January 27.  Why haven’t you written about DeepSeek yet?  —  I did!  I wrote about R1 last Tuesday.

Posted In :

Leave a Reply

Your email address will not be published. Required fields are marked *