DeepSeek-VL2

MoE vision-language, now easier to access

Screenshots

Hunter's comment

DeepSeek-VL2 are open-source vision-language models with strong multimodal understanding, powered by an efficient MoE architecture. Easily test them out with the new Hugging Face demo.
But, they are rapidly evolving. DeepSeek-VL2, their new open-source family of Mixture-of-Experts (MoE) vision-language models, is a big step forward, achieving strong performance with a much smaller activated parameter count, thanks to its MoE design.

And the exciting news: there is a new Hugging Face Spaces demo – you can now try these models without needing to deploy heavily (normally you would need more than 80GB of GPU resources, which is almost impossible for most of us)
DeepSeek made waves with their R1 language model, but their multimodal capabilities (especially image understanding) are not good enough:
So check it out and see what DeepSeek brings next to suprise everyone

Link

https://www.producthunt.com/posts/deepseek-vl2

This is posted on Steemhunt - A place where you can dig products and earn STEEM.
View on Steemhunt.com

Sort Order:

Trending

[-]

jswit (69) · 12 days ago

Upvoted! Thank you for supporting witness @jswit.

To turn off auto-reply, write a reply to this comment with "@jswit reply-off"
Delegate SP to jsup & receive daily upvote
Search and find Steemit posts

$0.00

successgr.with (74) · 12 days ago

DeepSeek-VL2 - MoE vision-language, now easier to access

DeepSeek-VL2

Screenshots

Hunter's comment

Link