Running a Vicuna-13B 4it model ?

I found this model : 
[[ggml-vicuna-13b-4bit](https://huggingface.co/eachadea/ggml-vicuna-13b-4bit)](https://huggingface.co/eachadea/ggml-vicuna-13b-4bit/tree/main) and judging by their online demo it's very impressive.
I tried to run it with llama.cpp latest version - the model loads fine, but as soon as it loads it starts hallucinating and quits by itself. 
Do I need to have it converted or something like that ?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Running a Vicuna-13B 4it model ? #771

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Running a Vicuna-13B 4it model ? #771

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions