Add corrections, implementation notes, pricing changes, or usage caveats for other readers.
Knowledge cutoff
2025-04
Input modalities
Output modalities
Capabilities
Recent tweets and retweets from ModelScope
One image, controllable 3D output. TripoSplat from @tripoai a lightweight image-to-3D Gaussian model for fast, controllable 3D asset generation.
🔗modelscope.ai/models/VAST-AI…
✅ Adjustable Gaussian count up to 262,144
✅ Two core files, ~2,000 LOC total, for lightweight…
Meet Cosmos3-Nano✨ Before Physical AI acts, it needs to imagine. NVIDIA Cosmos3-Nano makes that possible. 🧩
🧷modelscope.ai/models/nv-comm…
-16B, split for reasoning + generation. Combines an 8B reasoner and an 8B generator to connect world understanding with world…
Singapore builders, we're hosting a ModelScope & Qwen meetup on June 10 🇸🇬
We'll be talking about developers' commercial pathways with ModelScope, what people are actually building with Qwen, hear from the community, and have an open mic session for anyone who wants to…
Congrats to @PaddlePaddle on the open release of PaddleOCR-VL-1.6! 🚀
An upgraded compact document parsing model hitting a 96.33% score on OmniDocBench v1.6, outperforming top-tier VLMs.🤖
modelscope.ai/models/PaddleP…
📊 Benchmark Accuracy: Achieves 96.33% on OmniDocBench…
Say hello to OmniNFT, an RL fine-tuning framework that enables joint audio and video generation with better audio-visual synchronization. Built on LTX-2/2.3, with pretrained LoRA weights available.🚀
Collapsing multi-modal rewards into a single advantage causes conflicting…
Discuss this model
Add corrections, implementation notes, pricing changes, or usage caveats for other readers.