Official implementation from Papers with Code · Repository link is mentioned in the paper metadata
- Stars
- 170
- Last push
- May 9, 2023 (1138d ago)
Risk flags
- No push in 12+ months
- No CI pipeline detected
- No tagged releases
Paper appears method- or tooling-adjacent to AI workflows with partial ecosystem coverage.
No concrete benchmark grounding is available yet. Treat the page as context or an implementation starting point only.
A Closer Look at Invalid Action Masking in Policy Gradient Algorithms is the primary contribution described in this paper.
Stable-Baselines-Team/stable-baselines3-contrib is the closest maintained adjacent implementation (Community adoption signal (723 stars)). It is not paper-verified; validate algorithm and evaluation setup against the paper before trusting reported metrics. Community adoption signal: 723 GitHub stars.
Open vwxyzjn/invalid-action-maskingHardware Notes
Expect multi-day setup/compute for meaningful reproduction based on current guidance.
Evidence graph: 3 refs, 3 links.
Utility signals: depth 65/100, grounding 75/100, status medium.
Compare maintenance quality, reproducibility coverage, and evidence confidence before choosing a reproduction baseline.
Official implementation from Papers with Code · Repository link is mentioned in the paper metadata
Risk flags
Matched via arXiv identifier search
Risk flags
Matched via arXiv identifier search
Risk flags
Only a historical official implementation is available.
Use with caution for new projects; verify against current tooling and maintained community alternatives.
Hardware requirements
Dependencies pinned, manual setup needed
Quick start
git clone https://github.com/vwxyzjn/invalid-action-masking.git
pip install -e . No benchmark numbers could be verified. You will not be able to validate reproduction correctness against published numbers.
These are not paper-verified. Use them as reference points when no direct implementation is available.
Community adoption signal (723 stars)
No additional verified repositories beyond the primary recommendation.
These repositories had low-confidence matching signals and are hidden by default.
Showing top 6 by score. 1 additional low-confidence matches are hidden.
No trustworthy direct or curated related Hugging Face artifacts were found yet.
Continue with targeted Hugging Face searches derived from the paper title and method context:
Tip: start with models, then check datasets/spaces if you need evaluation data or demos.
Direct artifact matches are currently sparse. Use targeted Hugging Face searches to quickly locate candidate models, datasets, and demos.
Evaluation & Human Feedback Data
Open this paper in HFEPX to review benchmark signals, evaluation modes, and human-feedback protocol context.
Open in HFEPXNeed human evaluators for your AI research? Scale annotation with expert AI Trainers.