How does Lemmy feel about "open source" machine learning, akin to the Fediverse vs Social Media?

@[email protected] · edit-2 6 months ago

How does Lemmy feel about "open source" machine learning, akin to the Fediverse vs Social Media?

@[email protected] · 6 months ago

The splitting is 80% of the cool factor for me. Rather than bog down the one node that can handle those cooler models, and have more contribution opportunities.

I wonder honestly if a petals network could be a target host on horde lol

@[email protected] · edit-2 6 months ago

The problem is that splitting models up over a network, even over LAN, is not super efficient. The entire weights need to be run through for every half word.

And the other problem is that petals just can’t keep up with the crazy dev pace of the LLM community. Honestly they should dump it and fork or contribute to llama.cpp or exllama, as TBH no one wants to split up LLAMA 2 (or even llama 3) 70B, and be a generation or two behind for a base instruct model instead of a finetune.

Even the horde has very few hosts relative to users, even though hosting a small model on a 6GB GPU would get you lots of karma.

The diffusion community is very different, as the output is one image and even the largest open models are much smaller. Lora usage is also standardized there, while it is not on LLM land.

@[email protected] · edit-2 6 months ago

I guess to me be able to serve the 408b model even though I’m on a laptop is just awesome to me.

Also I saw Lora was an option for Petals but I haven’t messed with it at all.