Ovis

en construction

Définition

XXXXXXXXX

Français

Ovis 2,5

Anglais

Ovis 2,5

An advanced multimodal large language model designed to process images at their native resolutions while incorporating reasoning capabilities. The model addresses two key limitations in current vision-language systems: the degradation caused by fixed-resolution image processing and the lack of reflective reasoning beyond simple chain-of-thought approaches.
By eliminating the limitations of fixed-resolution image processing and incorporating self-corrective reasoning, Ovis2.5 achieves substantial improvements over previous models while maintaining efficiency through optimized training infrastructure.

Source

Source : huggingface