Julio's dev
๐Ÿ“„ Paper

[Review] Qwen-VL: A Versatile Vision-Language Model for Understanding, Localization, Text Reading, and Beyond (`23. 8)