RE: Run even larger AI models locally with LM Studio

You are viewing a single comment's thread from:

RE: Run even larger AI models locally with LM Studio

View the full context
View the direct parent

apshamilton in LeoFinance • 2 years ago

I'm getting 23 tokens per second using the 5 bit Mixtal 2.7 model.

2 years ago in LeoFinance by apshamilton

0.00 ARCHON

4 votes

Sort:

Trending

[-]

themarkymark 2 years ago

macs have a big edge for this.
I would recommend the 4 bit, the 5 bit isn't much better and takes a lot more ram. I'd stick with 4 bit, or something like 8 bit if you can get there.

0.00 ARCHON

4 votes