Official implementation from Papers with Code · Repository link is mentioned in the paper metadata
- Stars
- 2
- Last push
- Feb 13, 2025 (492d ago)
Risk flags
- No push in 12+ months
- No CI pipeline detected
- No tagged releases
Youpeng Ma, Tao Chen, Ke Li
Paper appears method- or tooling-adjacent to AI workflows with partial ecosystem coverage.
As software systems become more complex and configurable, more performance problems tend to arise from the configuration designs. This has caused some configuration options to unexpectedly degrade performance which deviates from their original expectations designed by the developers. Such discrepancies, namely configuration performance bugs (CPBugs), are devastating and can be deeply hidden in the source code. Yet, e ...
fficiently testing CPBugs is difficult, not only due to the test oracle is hard to set, but also because the configuration measurement is expensive and there are simply too many possible configurations to test. As such, existing testing tools suffer from lengthy runtime or have been ineffective in detecting CPBugs when the budget is limited, compounded by inaccurate test oracle. In this paper, we seek to achieve significantly faster CPBug testing by neurally prioritizing the testing at both the configuration option and value range levels with automated oracle estimation. Our proposed tool, dubbed NDP, is a general framework that works with different heuristic generators. The idea is to leverage two neural language models: one to estimate the CPBug types that serve as the oracle while, more vitally, the other to infer the probabilities of an option being CPBug-related, based on which the options and the value ranges to be searched can be prioritized. Experiments on several widely-used systems of different versions reveal that NDP can, in general, better predict CPBug type in 87% cases and find more CPBugs with up to 88.88x testing efficiency speedup over the state-of-the-art tools.
No concrete benchmark grounding is available yet. Treat the page as context or an implementation starting point only.
As software systems become more complex and configurable, more performance problems tend to arise from the configuration designs.
Only historical official repository was found (ideas-labo/ndp).
Open ideas-labo/ndpHardware Notes
Expect multi-day setup/compute for meaningful reproduction based on current guidance.
Evidence graph: 2 refs, 1 links.
Utility signals: depth 60/100, grounding 58/100, status medium.
Compare maintenance quality, reproducibility coverage, and evidence confidence before choosing a reproduction baseline.
Official implementation from Papers with Code · Repository link is mentioned in the paper metadata
Risk flags
Official implementation from Papers with Code · Repository link is mentioned in the paper metadata
Risk flags
Official implementation from Papers with Code · Repository link is mentioned in the paper metadata
Risk flags
Only a historical official implementation is available.
Use with caution for new projects; verify against current tooling and maintained community alternatives.
Hardware requirements
Dependencies pinned, manual setup needed
Quick start
git clone https://github.com/ideas-labo/ndp.git
pip install -r requirements.txt No benchmark numbers could be verified. You will not be able to validate reproduction correctness against published numbers.
The dataset and code for the accepted ICSE25 paper "Faster Configuration Performance Bug Testing with Neural Dual-level Prioritization"
NDP repository
No additional community repositories detected yet.
No trustworthy direct or curated related Hugging Face artifacts were found yet.
Continue with targeted Hugging Face searches derived from the paper title and method context:
Tip: start with models, then check datasets/spaces if you need evaluation data or demos.
Direct artifact matches are currently sparse. Use targeted Hugging Face searches to quickly locate candidate models, datasets, and demos.
Tasks
None detected
Methods
Transformer
Domains
Natural Language Processing
Evaluation & Human Feedback Data
Open this paper in HFEPX to review benchmark signals, evaluation modes, and human-feedback protocol context.
Open in HFEPXExplore Similar Papers
Jump to Paper2Code search queries derived from this paper's research context.
Need human evaluators for your AI research? Scale annotation with expert AI Trainers.