DeepSeek-V3 Breaks New Ground: The World's Largest Open-Source AI Model!

☆ Yσɠƚԋσʂ ☆@lemmy.ml · 2 days ago

DeepSeek-V3 Breaks New Ground: The World's Largest Open-Source AI Model!

JoeByeThen [he/him, they/them] · 17 hours ago

Ah yea, I remember when the We Have no Moats article dropped. It's wild because for years I was on the cutting edge of what was going on; Tinkering with java based neural network apps , then python based tensors, and right around when Transformers dropped I was pulled away from my hobbies for familial reasons and I've been playing catch up ever since. Everything is happening very fast and I've got so much to do that I just can't find time to stay on top of it all. Or have the money, tbh. But, yeah, lot of potential that the Left (in these parts) have plugged their fingers into their ears about. Especially as resistance is moving in a more physical way, but the infrastructure of our oppression is built on the cloud.

I saw this interesting video the other day. Basically since some of these mini-PCs share their memory with the onboard gpu, they can load up the 70b models. Slow as hell, but if you're running everything through a queue it'd be pretty handy.

https://www.youtube.com/watch?v=xyKEQjUzfAk

☆ Yσɠƚԋσʂ ☆@lemmy.ml · 9 hours ago

I've kind of given up trying to keep up with the details as well, stuff is moving way too fast for that. I'm really encouraged by the fact that open source models have consistently managed to keep up with, and often outperform commercial ones.

There's also stuff like petals that's really exciting. It's basically similar idea to SETI@home and torrents where you just have a big network doing computing so you can amortize the work that way. This seems like a really good approach for running big models leveraging volunteer resources.

https://github.com/bigscience-workshop/petals

HexReplyBot [none/use name] · 17 hours ago

I found a YouTube link in your comment. Here are links to the same video on alternative frontends that protect your privacy: