raffa to ChatBotsNSFWEnglish · 5 months ago

CPU-only i7-1355U koboldcpp works surprisingly well

NSFW

7

3

CPU-only i7-1355U koboldcpp works surprisingly well

NSFW

raffa to ChatBotsNSFWEnglish · 5 months ago

7

I’ve installed koboldcpp on a thinkpad x1 with 32gb RAM and a i7-1355U, no GPU. Sure, it’s only just around 1 token/s but for a chat it is still usable (about 15 s per reply). The setup was easier than expected.

Chat

KinkyThoughts
link
fedilink
English
arrow-up
1·
5 months ago
15 seconds per reply with just 1 token/s?! How short are they? What’s the context size to be processed? I get like 5 tokens per second on my GPU and need 1-2 minutes per reply on 4k context size.
- raffaOP
  link
  fedilink
  English
  arrow-up
  1·
  5 months ago
  context size default of 4096, replies are like 16 tokens or so.
  - KinkyThoughts
    link
    fedilink
    English
    arrow-up
    1·
    5 months ago
    I mean the actual context size to be processed for the message, based on chat history, character cards, world info, etc. And which model?

ChatBotsNSFWNSFW

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: [email protected]

This community is for lewd AI-generated text and the tools to generate it.

Feel free to share and discuss:

erotic roleplay
storywriting
AI companions
tools and software
character cards and scenarios
prompts and instructions
LLM models / fine-tunes

Beginner guide and Resources

For generated images there is another community: [email protected] General discussion about LLMs: [email protected]

Please respect LemmyNSFW’s rules
Don’t just dump ultra low quality suff here
Consider sharing your workflow so we can learn something
Mark your text as AI generated and tell us which model you used
You’re encouraged to license your own work for reuse. I suggest CC0 / Public domain. Or CC-BY-SA if you don’t want to give it away completely. Don’t do this with other people’s content.

An example waiver would be:

---
This content was generated by AI. Model used: 
“No Rights Reserved”, CC0: This work has been marked as dedicated to the public domain.

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

1 user / day
1 user / week
1 user / month
19 users / 6 months
102 local subscribers
154 subscribers
17 Posts
42 Comments
Modlog

mods:
magn418