R1 has multiple deployable model ranging from 1.5 billion parameters (weak) which even I can run on my system to 671b parameters model (needs 32 GB VRAM GPU and ~400 GB Storage). This one is the strongest, but takes a lot more resources to deploy. We just got a gaming GPU with that much VRAM 5090. $2K for a GPU is insane though 🤪
You are viewing a single comment's thread from:
We did it on a VPS with low specs, but it took too long to develop a response and gave false information. Perhaps it is because of the components
I have now checked the 7B model I had used.
That explains it. It's really weak with just 7 billion parameters compared to the best 671 billion parameters.