Higher Vanilla results for Qwen3-VL-4B-Instruct & DMLR impairs performance

Hello, I attempted to reproduce the performance of Vanilla. However, the performance I obtained on MMVP using eager+float32 was **77.3**, while the performance after adding DMLR was only **74.3**. It seems that DMLR has **_actually reduced the performance_** of Qwen3-VL-4B-Instruct. Could the author provide Vanilla code to ensure that DMLR has gain?

<img width="690" height="670" alt="Image" src="https://github.com/user-attachments/assets/07d00e38-9d78-4480-ac7d-1ad18e6aae52" />

<img width="697" height="668" alt="Image" src="https://github.com/user-attachments/assets/e93a612a-edbd-44e9-9c82-2ad9eec59d69" />

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Higher Vanilla results for Qwen3-VL-4B-Instruct & DMLR impairs performance #19

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Higher Vanilla results for Qwen3-VL-4B-Instruct & DMLR impairs performance #19

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions