\
  The most prestigious law school admissions discussion board in the world.
BackRefresh Options Favorite

Just copped a 6000 Blackwell Pro. Which llm to install first?

...
Ocher Fortuitous Meteor Jew
  06/01/26
...
Ocher Fortuitous Meteor Jew
  06/02/26
lol there's nothing better than Qwen3.6 27b that will fit on...
dull round eye
  06/02/26
Wtf?? Which one am I missing out on that needs 256gb?
Ocher Fortuitous Meteor Jew
  06/02/26
https://huggingface.co/antirez/deepseek-v4-gguf https://h...
dull round eye
  06/02/26
Are you supposed to get a Mac ultra to run those? I read tha...
Ocher Fortuitous Meteor Jew
  06/02/26
People are still buying 3090s
dull round eye
  06/03/26
What do you have
Ocher Fortuitous Meteor Jew
  06/03/26
On one server I have a 5090, a 5080, a 5070 ti, and a 5060 t...
dull round eye
  06/03/26
just do your own paste my man, it's incredibly easy if you k...
razzmatazz party of the first part space
  06/03/26
take ur pick from any chinese model that fits from https://w...
stimulating pisswyrm institution
  06/02/26
...
Heady henna business firm
  06/02/26
Are the Chinese model superior at logical reasoning
Ocher Fortuitous Meteor Jew
  06/02/26
Should I keep this or sell it and get a 5090 or M4 ultra? Lo...
Ocher Fortuitous Meteor Jew
  06/03/26
there might be some M4s available for cheap soon when the M5...
stimulating pisswyrm institution
  06/03/26
Can you even buy M4 Ultras? I've been looking at Mac Studios...
dull round eye
  06/03/26
GPT/claude said Macs can run larger models but would be so s...
Ocher Fortuitous Meteor Jew
  06/03/26
Right. I think the ideal Mac has to be the Studio with an Ul...
dull round eye
  06/03/26
https://x.com/i/status/2062264195958141422
dull round eye
  06/03/26
What is this used for anyway?
Bright Bull Headed Nowag
  06/03/26
I would need at least 20 hours to explain this to you
Ocher Fortuitous Meteor Jew
  06/03/26
Can your graphics card speed that up?
Slap-happy twinkling uncleanness whorehouse
  06/03/26
Do you stick part of it up your ass?
Bright Bull Headed Nowag
  06/03/26
...
razzmatazz party of the first part space
  06/03/26


Poast new message in this thread



Reply Favorite

Date: June 1st, 2026 11:11 AM
Author: Ocher Fortuitous Meteor Jew



(http://www.autoadmit.com/thread.php?thread_id=5870420&forum_id=2#49909993)



Reply Favorite

Date: June 2nd, 2026 1:22 AM
Author: Ocher Fortuitous Meteor Jew



(http://www.autoadmit.com/thread.php?thread_id=5870420&forum_id=2#49910857)



Reply Favorite

Date: June 2nd, 2026 2:37 AM
Author: dull round eye

lol there's nothing better than Qwen3.6 27b that will fit on that card. You won't even use half your VRAM, but anything better requires more than 96gb. You either need 48gb or 256gb, and nothing in between really counts

(http://www.autoadmit.com/thread.php?thread_id=5870420&forum_id=2#49910875)



Reply Favorite

Date: June 2nd, 2026 1:00 PM
Author: Ocher Fortuitous Meteor Jew

Wtf??

Which one am I missing out on that needs 256gb?

(http://www.autoadmit.com/thread.php?thread_id=5870420&forum_id=2#49911186)



Reply Favorite

Date: June 2nd, 2026 4:41 PM
Author: dull round eye

https://huggingface.co/antirez/deepseek-v4-gguf

https://huggingface.co/unsloth/Step-3.7-Flash-GGUF

You could run the second one at Q2 or Q3 but what's the point?

(http://www.autoadmit.com/thread.php?thread_id=5870420&forum_id=2#49911462)



Reply Favorite

Date: June 2nd, 2026 8:14 PM
Author: Ocher Fortuitous Meteor Jew

Are you supposed to get a Mac ultra to run those? I read that Mac m4 with max ram is still slow as fuck vs nvidia

(http://www.autoadmit.com/thread.php?thread_id=5870420&forum_id=2#49911933)



Reply Favorite

Date: June 3rd, 2026 7:15 AM
Author: dull round eye

People are still buying 3090s

(http://www.autoadmit.com/thread.php?thread_id=5870420&forum_id=2#49912466)



Reply Favorite

Date: June 3rd, 2026 12:51 PM
Author: Ocher Fortuitous Meteor Jew

What do you have

(http://www.autoadmit.com/thread.php?thread_id=5870420&forum_id=2#49912844)



Reply Favorite

Date: June 3rd, 2026 12:59 PM
Author: dull round eye

On one server I have a 5090, a 5080, a 5070 ti, and a 5060 ti, but I'm only using the 5090 and 5080 to run my Hermes Agent right now, because I only need 48gb to run Qwen3.6 27b at Q8. I have two 3090s sitting in another server and I use them to do OCR, translation, image gen and shit like that. I just have a bunch of LXC containers running Ollama on the 3090s, and my Hermes Agent connects to whatever Ollama container it needs for a given task. I used to run a separate model for coding tasks, but Qwen3.6 does that well enough now. I only paid about $700 apiece for the 3090s plus another $100 to get them re-pasted by a shop, now they go for like $1200 on ebay with 6-year old paste lol

(http://www.autoadmit.com/thread.php?thread_id=5870420&forum_id=2#49912852)



Reply Favorite

Date: June 3rd, 2026 9:49 PM
Author: razzmatazz party of the first part space

just do your own paste my man, it's incredibly easy if you know what you're doing

(http://www.autoadmit.com/thread.php?thread_id=5870420&forum_id=2#49914109)



Reply Favorite

Date: June 2nd, 2026 3:33 AM
Author: stimulating pisswyrm institution

take ur pick from any chinese model that fits from https://willitrunai.com/

American OSS models in that range suck, I haven't tried the newer Mistral models but doubt they're competitive either

(http://www.autoadmit.com/thread.php?thread_id=5870420&forum_id=2#49910878)



Reply Favorite

Date: June 2nd, 2026 1:03 PM
Author: Heady henna business firm



(http://www.autoadmit.com/thread.php?thread_id=5870420&forum_id=2#49911193)



Reply Favorite

Date: June 2nd, 2026 1:20 PM
Author: Ocher Fortuitous Meteor Jew

Are the Chinese model superior at logical reasoning

(http://www.autoadmit.com/thread.php?thread_id=5870420&forum_id=2#49911219)



Reply Favorite

Date: June 3rd, 2026 2:24 AM
Author: Ocher Fortuitous Meteor Jew

Should I keep this or sell it and get a 5090 or M4 ultra? Looks like the price went up $2k again over the weekend

(http://www.autoadmit.com/thread.php?thread_id=5870420&forum_id=2#49912354)



Reply Favorite

Date: June 3rd, 2026 2:17 PM
Author: stimulating pisswyrm institution

there might be some M4s available for cheap soon when the M5s get announced and all the openclaw retards want to upgrade their "secure agent hosts"

(http://www.autoadmit.com/thread.php?thread_id=5870420&forum_id=2#49913125)



Reply Favorite

Date: June 3rd, 2026 2:23 PM
Author: dull round eye

Can you even buy M4 Ultras? I've been looking at Mac Studios for a few weeks, and they only let you spec it with a M3 Ultra or M4 Pro. And they won't let you add more than 96gb of RAM (which to be fair is plenty).

Anyway the Ultra models are never getting cheaper. The M5 is the same design as the M4 and M3, just made using a newer process at the TSMC foundry. You get insane memory bandwidth with all of them, it's just a matter of how many GPU cores you have to crunch on it. A M2 Ultra probably beats M5 Pro for AI just because it has more GPU cores.

(http://www.autoadmit.com/thread.php?thread_id=5870420&forum_id=2#49913147)



Reply Favorite

Date: June 3rd, 2026 3:46 PM
Author: Ocher Fortuitous Meteor Jew

GPT/claude said Macs can run larger models but would be so slow that it would practically be unusable. Like waiting an hour for a response or 8 hours to do a full brief draft

(http://www.autoadmit.com/thread.php?thread_id=5870420&forum_id=2#49913303)



Reply Favorite

Date: June 3rd, 2026 4:37 PM
Author: dull round eye

Right. I think the ideal Mac has to be the Studio with an Ultra -series and 64gb VRAM. That lets you use 48gb comfortably with room to spare. I wouldn't plan on using more than 48gb with a Mac and I can only see 96gb being useful if you want to have two models running concurrently. What I like about my GPU spread is the ease of serving up multiple LLMs concurrently l. My OCR -> vector embedding workflow is pretty ridic, and I have a server that translates media from any language and produces an English transcript.



(http://www.autoadmit.com/thread.php?thread_id=5870420&forum_id=2#49913438)



Reply Favorite

Date: June 3rd, 2026 5:33 PM
Author: dull round eye

https://x.com/i/status/2062264195958141422

(http://www.autoadmit.com/thread.php?thread_id=5870420&forum_id=2#49913592)



Reply Favorite

Date: June 3rd, 2026 5:36 PM
Author: Bright Bull Headed Nowag

What is this used for anyway?

(http://www.autoadmit.com/thread.php?thread_id=5870420&forum_id=2#49913600)



Reply Favorite

Date: June 3rd, 2026 6:54 PM
Author: Ocher Fortuitous Meteor Jew

I would need at least 20 hours to explain this to you

(http://www.autoadmit.com/thread.php?thread_id=5870420&forum_id=2#49913797)



Reply Favorite

Date: June 3rd, 2026 7:09 PM
Author: Slap-happy twinkling uncleanness whorehouse

Can your graphics card speed that up?

(http://www.autoadmit.com/thread.php?thread_id=5870420&forum_id=2#49913823)



Reply Favorite

Date: June 3rd, 2026 9:37 PM
Author: Bright Bull Headed Nowag

Do you stick part of it up your ass?

(http://www.autoadmit.com/thread.php?thread_id=5870420&forum_id=2#49914082)



Reply Favorite

Date: June 3rd, 2026 9:49 PM
Author: razzmatazz party of the first part space



(http://www.autoadmit.com/thread.php?thread_id=5870420&forum_id=2#49914106)