Just copped a 6000 Blackwell Pro. Which llm to install first?
| puce goyim | 06/01/26 | | puce goyim | 06/02/26 | | 180 stead | 06/02/26 | | puce goyim | 06/02/26 | | 180 stead | 06/02/26 | | puce goyim | 06/02/26 | | 180 stead | 06/03/26 | | puce goyim | 06/03/26 | | 180 stead | 06/03/26 | | crystalline nowag roast beef | 06/03/26 | | at-the-ready property | 06/02/26 | | Slap-happy Locus Jewess | 06/02/26 | | puce goyim | 06/02/26 | | puce goyim | 06/03/26 | | at-the-ready property | 06/03/26 | | 180 stead | 06/03/26 | | puce goyim | 06/03/26 | | 180 stead | 06/03/26 | | 180 stead | 06/03/26 | | garnet temple indirect expression | 06/03/26 | | puce goyim | 06/03/26 | | Marvelous station rigpig | 06/03/26 | | garnet temple indirect expression | 06/03/26 | | crystalline nowag roast beef | 06/03/26 |
Poast new message in this thread
 |
Date: June 3rd, 2026 12:59 PM Author: 180 stead
On one server I have a 5090, a 5080, a 5070 ti, and a 5060 ti, but I'm only using the 5090 and 5080 to run my Hermes Agent right now, because I only need 48gb to run Qwen3.6 27b at Q8. I have two 3090s sitting in another server and I use them to do OCR, translation, image gen and shit like that. I just have a bunch of LXC containers running Ollama on the 3090s, and my Hermes Agent connects to whatever Ollama container it needs for a given task. I used to run a separate model for coding tasks, but Qwen3.6 does that well enough now. I only paid about $700 apiece for the 3090s plus another $100 to get them re-pasted by a shop, now they go for like $1200 on ebay with 6-year old paste lol
(http://www.autoadmit.com/thread.php?thread_id=5870420&forum_id=2#49912852) |
|
|