InternVL


Révision datée du 20 septembre 2025 à 10:10 par Pitpitt (discussion | contributions) (Page créée avec « ==en construction== == Définition == XXXXXXXXX == Français == ''' InternVL''' == Anglais == ''' InternVL''' A new family of open-source multimodal large language models that significantly advances capabilities in versatility, reasoning, and efficiency. The models range from 1B to 241B parameters and achieve state-of-the-art performance among open-source models while narrowing the gap with commercial systems like GPT-5. InternVL3.5 achieves impressive per... »)
(diff) ← Version précédente | Voir la version actuelle (diff) | Version suivante → (diff)

en construction

Définition

XXXXXXXXX

Français

InternVL

Anglais

InternVL

A new family of open-source multimodal large language models that significantly advances capabilities in versatility, reasoning, and efficiency. The models range from 1B to 241B parameters and achieve state-of-the-art performance among open-source models while narrowing the gap with commercial systems like GPT-5.
InternVL3.5 achieves impressive performance across multiple benchmarks. The largest model, InternVL3.5-241B-A28B, attains state-of-the-art results among open-source models and narrows the performance gap with GPT-5 to just 3.9% on general multimodal tasks. On reasoning benchmarks, the models show substantial improvements, with InternVL3.5-8B achieving 73.4 on MMMU and InternVL3.5-241B-A28B reaching 77.7. The Cascade RL framework provides up to 16.0% improvement in overall reasoning performance compared to the predecessor InternVL3.

Source

Source : huggingface

Contributeurs: wiki