【Mika Muroi Archives】

2025-06-26 13:53:32 491 views 84743 comments

On August 25,Mika Muroi Archives Alibaba Cloud launched an open-source Large Vision Language Model (LVLM) named Qwen-VL. The LVLM is based on Alibaba Cloud’s 7 billion parameter foundational language model Qwen-7B. In addition to capabilities such as image-text recognition, description, and question answering, Qwen-VL introduces new features including visual location recognition and image-text comprehension, the company said in a statement. These functions enable the model to identify locations in pictures and to provide users with guidance based on the information extracted from images, the firm added. The model can be applied in various scenarios including image and document-based question answering, image caption generation, and fine-grained visual recognition. Currently, both Qwen-VL and its visual AI assistant Qwen-VL-Chat are available for free and commercial use on Alibaba’s “Model as a Service” platform ModelScope. [Alibaba Cloud statement, in Chinese]

Comments (39752)
Transmission Information Network

Best speaker deal: Save $30 on the JBL Clip 5

2025-06-26 13:46
Creation Information Network

'Solo: A Star Wars Story' partners with Solo cups because of course

2025-06-26 12:01
Resonance Information Network

Martin Shkreli cries before he's sentenced to prison for fraud

2025-06-26 11:59
Impression Information Network

Outdoor speaker deal: Save $20 on the Soundcore Boom 2

2025-06-26 11:30
Search
Newsletter

Subscribe to our newsletter for the latest updates.

Follow Us