【Ireland】

2025-06-26 17:01:38 671 views 37593 comments

On August 25,Ireland Alibaba Cloud launched an open-source Large Vision Language Model (LVLM) named Qwen-VL. The LVLM is based on Alibaba Cloud’s 7 billion parameter foundational language model Qwen-7B. In addition to capabilities such as image-text recognition, description, and question answering, Qwen-VL introduces new features including visual location recognition and image-text comprehension, the company said in a statement. These functions enable the model to identify locations in pictures and to provide users with guidance based on the information extracted from images, the firm added. The model can be applied in various scenarios including image and document-based question answering, image caption generation, and fine-grained visual recognition. Currently, both Qwen-VL and its visual AI assistant Qwen-VL-Chat are available for free and commercial use on Alibaba’s “Model as a Service” platform ModelScope. [Alibaba Cloud statement, in Chinese]

Comments (36499)
New Knowledge Information Network

4GHz CPU Battle: AMD 2nd

2025-06-26 16:08
Unique Information Network

Meta, Match Group, and more announce new anti

2025-06-26 15:13
Passion Information Network

Google's AI Overviews are getting ads soon

2025-06-26 15:12
Ignition Information Network

NYT's The Mini crossword answers for May 22

2025-06-26 14:29
Fashion Information Network

Then and Now: Almost 10 Years of Intel CPUs Compared

2025-06-26 14:18
Search
Newsletter

Subscribe to our newsletter for the latest updates.

Follow Us