AI多模态识别VLM大模型分享：LLaVA++、Qwen-VL、CogVLM2、MiniCPM、Florence-2_florence2 github

作者：Guff_9hys | 2024-07-18 09:26:57

踩

florence2 github

算法榜单

参考：https://github.com/open-compass/VLMEvalKit
在这里插入图片描述
https://github.com/BradyFU/Awesome-Multimodal-Large-Language-Models

https://huggingface.co/spaces/opencompass/open_vlm_leaderboard
在这里插入图片描述

具体算法

1）LLaVA++
https://github.com/mbzuai-oryx/LLaVA-pp
https://bengal-eminent-wasp.ngrok-free.app/
在这里插入图片描述

2）Qwen-VL
https://huggingface.co/spaces/Qwen/Qwen-VL-Plus

3）CogVLM2
https://huggingface.co/THUDM/cogvlm2-llama3-chinese-chat-19B
https://github.com/THUDM/CogVLM2

在线体验：
http://36.103.203.44:7861/
在这里插入图片描述

4）MiniCPM-Llama3-V-2_5
https://huggingface.co/spaces/openbmb/MiniCPM-Llama3-V-2_5

在这里插入图片描述

5）Florence-2

https://huggingface.co/spaces/gokaygokay/Florence-2
在这里插入图片描述

识别描述
在这里插入图片描述
检测、分割：

在这里插入图片描述

声明：本文内容由网友自发贡献，不代表【wpsshop博客】立场，版权归原作者所有，本站不承担相应法律责任。如您发现有侵权的内容，请联系我们。转载请注明出处：https://www.wpsshop.cn/w/Guff_9hys/article/detail/844739