OpenTrain AI
Maintained implementation availablepytorch

VisionReasoner: Unified Reasoning-Integrated Visual Perception via Reinforcement Learning

Yuqi Liu, Tianyuan Qu, Zhisheng Zhong, Bohao Peng, Shu Liu +2 more

May 17, 2025arXiv: 2505.12081
4 repos4,838 stars~a few hours to reproduce
arXiv PDF

Abstract

Large vision-language models exhibit inherent capabilities to handle diverse visual perception tasks. In this paper, we introduce VisionReasoner, a unified framework capable of reasoning and solving multiple visual perception tasks within a shared model. Specifically, by designing a unified reward mechanism and multi-object cognitive learning strategies, VisionReasoner enhances its reasoning capabilities to analyze v...

Results & Benchmarks

TaskDatasetMetricValue
ClassificationQwen2.5-1.5BAccuracy.46.3

Best Implementation

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

4.8k 367 Apr 2026 Apache-2.0
License
CI
Deps
Docker
  • Selected hiyouga/easyr1 as the strongest maintained implementation for new work.
  • Includes CI workflow signals.
  • Includes dependency/environment manifest signals.
  • Repository activity is within the last 24 months.

Reproduction Path

  1. 1

    Start with hiyouga/easyr1 and validate setup instructions in README.

  2. 2

    Reproduce the baseline result with the provided defaults before modifying hyperparameters.

  3. 3

    Log exact dependency versions and runtime environment for reproducibility.

Time to first repro: a few hoursNo repository-level red flags were detected, but paper-specific preprocessing and hyperparameter details may still be under-specified.

Additional Implementations

Official

  • Project Page For "Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement"

    Stars: 621Forks: 29Last push: Jan 2026License: Apache-2.0

Community

  • Project Page For "Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement"

    Stars: 621Forks: 29Last push: Jan 2026License: Apache-2.0

Hugging Face Artifacts

No direct paper-linked artifacts were found. Showing strongest curated related artifacts.