New Wizard coder model posted and quantized by TheBloke!

noneabove1182@sh.itjust.works · 1 year ago

New Wizard coder model posted and quantized by TheBloke!

LinuxFanatic@lemmy.world · 1 year ago

On The Bloke’s hugging face repo, it says the GGML quants are not compatible with llama.cpp, anyone know why?

Kerfuffle@sh.itjust.works · edit-2 1 year ago

It’s a different type of model. llama.cpp only supports LLaMA models while GGML (the machine learning library llama.cpp is based on) has examples of various models with different architectures. WizardCoder, MPT, Bloom, probably very soon Falcon. Also some separate projects use GGML to support other models (including some of the ones I listed). For example the Rust “llm” project can support LLaMA models, MPT, BLOOM.