Official implementation from Papers with Code · Repository link is mentioned in the paper metadata
- Stars
- 245
- Last push
- Apr 21, 2026 (60d ago)
Risk flags
- No CI pipeline detected
- No tagged releases
- No Docker setup
Paper appears method- or tooling-adjacent to AI workflows with partial ecosystem coverage.
No concrete benchmark grounding is available yet. Treat the page as context or an implementation starting point only.
AISafetyLab: A Comprehensive Framework for AI Safety Evaluation and Improvement is the primary contribution described in this paper.
thu-coai/AISafetyLab is the strongest maintained implementation based on ranking signals. License is declared (MIT).
Open thu-coai/AISafetyLabHardware Notes
Expect multi-day setup/compute for meaningful reproduction based on current guidance.
Evidence graph: 4 refs, 4 links.
Utility signals: depth 65/100, grounding 85/100, status medium.
Compare maintenance quality, reproducibility coverage, and evidence confidence before choosing a reproduction baseline.
Official implementation from Papers with Code · Repository link is mentioned in the paper metadata
Risk flags
AISafetyLab: A comprehensive framework covering safety attack, defense, evaluation and paper list.
Hardware requirements
No dependency manifest — manual reconstruction required
No benchmark numbers could be verified. You will not be able to validate reproduction correctness against published numbers.
No direct paper-linked artifacts were found. Showing strongest curated related artifacts for faster exploration.
Broaden model search
Broaden dataset search
No trustworthy demo spaces right now.
Search spaces on Hugging FaceEvaluation & Human Feedback Data
Open this paper in HFEPX to review benchmark signals, evaluation modes, and human-feedback protocol context.
Open in HFEPXNeed human evaluators for your AI research? Scale annotation with expert AI Trainers.