What is the best open-source implementation of "Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning"?

The best maintained implementation is yxb-nku/se-gui with 102 stars on GitHub. Confidence: high. Reproducibility: Limited.

Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning

Q: How reproducible is "Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning"?

Estimated time to first reproduction: a few days. Risk flags: License metadata missing, No CI workflows detected, Dependency manifest is missing. Start with yxb-nku/se-gui and validate setup instructions in README.

Q: What framework is used to implement "Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning"?

The primary implementation uses pytorch.

Published: May 1, 2025

Best maintained implementation now

Evidence: Direct

Domain fit: AI-core

Verified repos: 3

Top repo stars: 102

Core AI workload signals detected from paper context and implementation/artifact evidence.

Framework: pytorch

Time to first repro: a few days

3 risk flags

arXiv PDF

Technical details

Canonical key: arxiv-2505.12370

Cache status: Fresh

Generated at: Apr 29, 2026, 6:16 PM

Artifact coverage: direct

HF provider: ok (token)

PWC source used: Yes

LLM status: not_generated

LLM model: n/a

LLM generated: Unknown

LLM content type: n/a

HF policy: hf-relevance-v27

implementation starting point

Benchmarks: thin evidence

Time to repro: a few days

3 risk flags

pytorch

Results & Benchmarks

Freshness tier: hot

Direct + Inferred Evidence

Reinforcement learning

GPT-4o

ScreenSpot Accuracy

21.9

Source: paper fulltext

Reinforcement learning

9.4

ScreenSpot-v2 Accuracy .

22.5

Source: paper fulltext

Reinforcement learning

22.2

ScreenSpot-v2 Accuracy .

20.1

Source: paper fulltext

Reinforcement learning

Qwen2-VL-7B

ScreenSpot Accuracy

50.3

Source: paper fulltext

Reinforcement learning

27.4

ScreenSpot Accuracy

42.9

Source: paper fulltext

Benchmark evidence drill-down

6 findings

Audit each benchmark finding before selecting an implementation path. Evidence refs map to the disclosure section below.

Task	Dataset	Metric	Value	Source	Evidence refs
Reinforcement learning	GPT-4o	ScreenSpot Accuracy	21.9	paper-derived	No explicit refs
Reinforcement learning	9.4	ScreenSpot-v2 Accuracy .	22.5	paper-derived	No explicit refs
Reinforcement learning	22.2	ScreenSpot-v2 Accuracy .	20.1	paper-derived	No explicit refs
Reinforcement learning	Qwen2-VL-7B	ScreenSpot Accuracy	50.3	paper-derived	No explicit refs
Reinforcement learning	27.4	ScreenSpot Accuracy	42.9	paper-derived	No explicit refs
Reinforcement learning	50.1	ScreenSpot-v2 Accuracy .	39.8	paper-derived	No explicit refs

Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning presents a reinforcement learning method.

Use This Implementation Because…

Confidence: high

yxb-nku/se-gui is the strongest maintained implementation based on ranking signals.

Open yxb-nku/se-gui

Reproduction Risks

License metadata missing
No CI workflows detected
Dependency manifest is missing

Hardware Notes

Expect multi-day setup/compute for meaningful reproduction based on current guidance.

Evidence disclosure

Evidence graph: 3 refs, 3 links.

Utility signals: depth 100/100, grounding 85/100, status high.

Implementation Comparison

Top 3 paths

Compare maintenance quality, reproducibility coverage, and evidence confidence before choosing a reproduction baseline.

yxb-nku/se-gui

best maintained

Maintenance: Stale risk

Confidence: High

Reproducibility: Limited

Official implementation from Papers with Code · Repository link is mentioned in the paper metadata

Stars: 102
Last push: Oct 21, 2025 (191d ago)

Risk flags

No CI pipeline detected
No tagged releases
No Docker setup

likaixin2000/screenspot-pro-gui-grounding

alternative

Maintenance: Active

Confidence: Low

Reproducibility: Limited

Partial overlap with paper title keywords · Community adoption signal (367 stars)

Stars: 367
Last push: Apr 14, 2026 (16d ago)

Risk flags

No CI pipeline detected
No tagged releases
No Docker setup

Lyz103/LLM-Agent-Paper-daily

alternative

Maintenance: Active

Confidence: Low

Reproducibility: Strong

Matched via arXiv identifier search

Stars: 20
Last push: Apr 29, 2026 (1d ago)

CIDependencies

Risk flags

No tagged releases
No Docker setup
Low confidence match

Best implementation now

yxb-nku/se-gui

Confidence: High

Reproducibility: Limited

[NeurIPS 2025]"Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning"

Stars: 102

Forks: 6

Last push: Oct 21, 2025

Official implementation from Papers with Code

Repository link is mentioned in the paper metadata

Strong overlap with paper title keywords

Community adoption signal (102 stars)

License –

CI –

Deps –

Docker –

Selected yxb-nku/se-gui as the strongest maintained implementation for new work.
Repository activity is within the last 24 months.

Reproduction readiness

Major Work

Time to first repro: days

Last checked: Apr 29, 2026

Hardware requirements

Expect multi-day setup/compute for meaningful reproduction based on current guidance.

No dependency manifest — manual reconstruction required

· yxb-nku/se-gui has no requirements.txt, environment.yml, pyproject.toml, or Dockerfile.
· You will need to reverse-engineer dependencies from import statements in the source code.
· Last push was 191 days ago.

Open yxb-nku/se-gui

Additional implementations

Official

No additional official repositories detected.

Community

YXB-NKU/SE-GUI
Confidence: Medium

[NeurIPS 2025]"Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning"

Stars: 102

Last push: Oct 21, 2025
mlfoundations/Gelato
Confidence: Medium

🍨 Gelato — From Data Curation to Reinforcement Learning: Building a Strong Grounding Model for Computer-Use Agents

Stars: 46

Last push: Dec 22, 2025