Quansloth
ストックにはログインが必要です
GUI Based on the implementation of Google's TurboQuant
Artificial Intelligence
GitHub
Tech
Software Engineering
Based on the implementation of Google's TurboQuant (ICLR 2026) — Quansloth brings elite KV cache compression to local LLM inference. Quansloth is a fully private, air-gapped AI server that runs massive context models natively on consumer hardware with ease - PacifAIst/Quansloth
投票数: 0