Switch to Qwen3-VL

New [Qwen vision model](https://qwen.ai/blog?id=99f0335c4ad9ff6153e517418d48535ab6d8afef&from=research.latest-advancements-list) is out, and it's (on paper) better than the 2.5 that we use now.

I would like to try it out, which means we'll need to set aside time to swap models on Pegasus and test the new one.
There are several model variations (thinking/non-thinking, various sizes). I am considering a non-thinking (for speed) non-quantized 8B model — let's see if we can fit it on our GPU, especially if we can deprecate YOLO. Alternatives would be an 8-bit 8B model or an FP16 4B model.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Switch to Qwen3-VL #1167

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Switch to Qwen3-VL #1167

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions