How does Lemmy feel about "open source" machine learning, akin to the Fediverse vs Social Media?

@[email protected] · edit-2 5 months ago

How does Lemmy feel about "open source" machine learning, akin to the Fediverse vs Social Media?

@[email protected] · 5 months ago

I manually specify what models to pull. I’m not running anything too crazy. My largest model is gemma27B. But I’ve worked with dolphin-mistral which was fun.

@[email protected] · 5 months ago

If you have a 24GB card, just go straight to the most recent Command R, a 3.75bpw-4bpw quantization. It’s incredible, and you can do the full 131K context on a 24GB GPU easy.

Gemma 27B Is actually quite good, but “narrow.” Its super low context and seems to be hyper optimized for short chatbot-arena style questions.

@[email protected] · edit-2 5 months ago

Gemma 27B Is actually quite good, but “narrow.” Its super low context and seems to be hyper optimized for short chatbot-arena style questions.

This is the stuff I love to know so thanks for sharing. I will be pulling Command R tomorrow.

@[email protected] · 5 months ago

Good! So Command-R excels at “RAG” style tasks like asking questions about a huge document, continuing a long story or so on. You should also read up on its super intricate system prompt format, which can steer it quite well.

I dunno about code, I tend to use Mistral Code 22B (or deepseek v2 API) for that.

I am happy to ramble on about this stuff, just ask.