Contributions de Pitpitt


Rechercher des contributionsaffichermasquer
⧼contribs-top⧽
⧼contribs-date⧽

7 juin 2025

27 mai 2025

26 mai 2025

23 mai 2025

  • 11:2523 mai 2025 à 11:25 diff hist −38 WEB-SHEPHERDAucun résumé des modifications
  • 11:2123 mai 2025 à 11:21 diff hist +3 WEB-SHEPHERDAucun résumé des modifications
  • 11:0423 mai 2025 à 11:04 diff hist +724 N WEB-SHEPHERDPage créée avec « ==en construction== == Définition == XXXXXXXXX == Français == ''' XXXXXXXXX ''' == Anglais == '''WEB-SHEPHERD''' he first process reward model (PRM) specifically designed for web navigation tasks. It addresses the challenges of evaluating web agent trajectories at a step-by-step level, which is crucial for improving agent performance in long-horizon web tasks. WEB-SHEPHERD is designed as a process reward model that evaluates web navigation trajectories at... »

20 mai 2025

  • 10:3620 mai 2025 à 10:36 diff hist +1 819 N QwenPage créée avec « ==en construction== == Définition == XXXXXXXXX == Français == ''' Qwen''' == Anglais == '''Qwen'''   The latest version of the Qwen model family. Qwen3 comprises a series of large language models (LLMs) designed to advance performance, efficiency, and multilingual capabilities. The Qwen3 series includes models of both dense and Mixture-of-Expert (MoE) architectures, with parameter scales ranging from 0.6 to 235 billion. A key innovation in Qwen3 is the int... »

17 mai 2025

  • 20:5117 mai 2025 à 20:51 diff hist +1 MiMoAucun résumé des modifications
  • 20:4817 mai 2025 à 20:48 diff hist +578 N MiMoPage créée avec « ==en construction== == Définition == XXXXXXXXX == Français == ''' MiMo-7B'' == Anglais == '''MiMo-7B''' MiMo-7B, a large language model specifically designed for reasoning tasks. The model is optimized across both pre-training and post-training stages to unlock its reasoning potential. Despite having only 7 billion parameters, MiMo-7B achieves superior performance on mathematics and code reasoning tasks, outperforming even much larger models including Open... »

15 mai 2025

13 mai 2025

5 mai 2025