MiniCPM-V 4.6
ストックにはログインが必要です
Ultra-efficient 1.3B vision-language model for mobile
Artificial Intelligence
GitHub
Open Source
MiniCPM-V 4.6 is an open MLLM for image and video understanding on phones and consumer hardware, with mixed 4x/16x visual token compression, iOS/Android/HarmonyOS demos, and support for vLLM, SGLang, llama.cpp, and Ollama.
投票数: 0