My personal collection of interesting models I've quantized from the past week (yes, just week)

noneabove1182 · 9 months ago

My personal collection of interesting models I've quantized from the past week (yes, just week)

will_a113@lemmy.ml · 9 months ago

Do you do any kind of before/after testing of these to measure performance/accuracy changes? I’ve always wondered if there is some way to generalize the expected performance changes at different quantizations.

noneabove1182 · 9 months ago

You can get the resulting PPL but that’s only gonna get you a sanity check at best, an ideal world would have something like lmsys’ chat arena and could compare unquantized vs quantized but that doesn’t yet exist

turkishdelight@lemmy.ml · 9 months ago

upload them to ollama so we can also use them

ffhein@lemmy.world · 8 months ago

Does ollama even support exl2?