Alibaba offers AI model for phone and laptop image and video processing |
Alibaba Group Holding has launched Qwen2.5-Omni-7B, a multimodal AI model capable of processing text, images, audio, and video on smartphones and laptops. With just 7bn parameters, the model is designed to run on mobile phones, tablets, and laptops, making advanced AI capabilities more accessible to everyday users. The model can handle various types of inputs and generate real-time responses as text or audio. Alibaba made the model open-source and it is available on Hugging Face, Microsoft’s GitHub, and Alibaba’s ModelScope. The model's versatility underscores the growing demand for AI systems that go beyond text generation. |
|