Recommended Models
All models are highly recommened for newer users as they are super easy to use and use the CHAT templ files from Twinz
Model Size | Description | Links |
---|---|---|
7b | CPU Friendly, small, okay quality | https://huggingface.co/TheBloke/dolphin-2.6-mistral-7B-GGUF |
2x7b | Normal sized, good quality | Removed for the time being, the model was acting up |
8x7b | Big, great quality | https://huggingface.co/TheBloke/dolphin-2.7-mixtral-8x7b-GGUF |
70b | Large, hard to run, significant quality | https://huggingface.co/TheBloke/dolphin-2.2-70B-GGUF |
Quant Mode | Description |
---|---|
Q3 | Smallest , significant quality loss - not recommended |
Q4 | Medium, balanced quality |
Q5 | Large, very low quality loss - recommended for most users |
Q6 | Very large, extremely low quality loss |
Q8 | Extremely large, extremely low quality loss, hard to use - not recommended |
None | Extremely large, No quality loss, super hard to use - really not recommended |
The minimum RAM and VRAM requirements for each model size, as a rough estimate.
- 7b: System RAM: 10 GB / VRAM: 2 GB
- 2x7b: System RAM: 25 GB / VRAM: 8 GB
- 8x7b: System RAM: 55 GB / VRAM: 28 GB
- 70b: System RAM: 105 GB / VRAM: AI Card or better