MTP

#2
by RedDragonGecko - opened

Any chance or interest in making MTP enabled quants?

Owner

Since I've already shifted the weights to my slow storage and I'd have to re-make the BF16, I'll wait for MTP to be available upstream. Once that happens I'll redo the uploads with the MTP heads.

@AesSedai Any chance to revisit? 😀

AFAIK llama.cpp doesn't support the GLM-style MTP yet?

Ah, okay. Thank you :)

Sign up or log in to comment