Parallel Tempering With a Variational Reference
Nikola Surjanovic, Saifuddin Syed, Alexandre Bouchard‐Côté, Trevor Campbell
No strong AI-core implementation/artifact signals were detected from current providers.
Sampling from complex target distributions is a challenging task fundamental to Bayesian inference. Parallel tempering (PT) addresses this problem by constructing a Markov chain on the expanded state space of a sequence of distributions interpolating between the posterior distribution and a fixed reference distribution, which is typically chosen to be the prior. However, in the typical case where the prior and poster ...
ior are nearly mutually singular, PT methods are computationally prohibitive. In this work we address this challenge by constructing a generalized annealing path connecting the posterior to an adaptively tuned variational reference. The reference distribution is tuned to minimize the forward (inclusive) KL divergence to the posterior distribution using a simple, gradient-free moment-matching procedure. We show that our adaptive procedure converges to the forward KL minimizer, and that the forward KL divergence serves as a good proxy to a previously developed measure of PT performance. We also show that in the large-data limit in typical Bayesian models, the proposed method improves in performance, while traditional PT deteriorates arbitrarily. Finally, we introduce PT with two references -- one fixed, one variational -- with a novel split annealing path that ensures stable variational reference adaptation. The paper concludes with experiments that demonstrate the large empirical gains achieved by our method in a wide range of realistic Bayesian inference scenarios.
Results & Benchmarks
No concrete benchmark grounding is available yet. Treat the page as context or an implementation starting point only.
Sampling from complex target distributions is a challenging task fundamental to Bayesian inference.
Implementation Evidence Summary
This is primarily a method paper. Reproduce it within a maintained framework baseline instead of chasing paper-specific repos.
Reproduction Risks
- No maintained paper-verified implementation is currently available
Evidence disclosure
Evidence graph: 2 refs, 1 links.
Utility signals: depth 60/100, grounding 58/100, status medium.
Implementation Status
There is no verified maintained implementation yet. Use this baseline plan to decide whether to prototype now or defer.
- This is primarily a method paper. Reproduce it within a maintained framework baseline instead of chasing paper-specific repos.
- Start with framework-native implementations (e.g. PyTorch optimizer module, Optax, or Transformers training loops).
- Replicate the paper ablation settings first, then compare against modern baselines.
Reproduction readiness
No verified implementation available
- · No maintained repository has been identified for this paper. Check adjacent implementations or HF artifacts below.
No benchmark numbers could be verified. You will not be able to validate reproduction correctness against published numbers.
Hugging Face artifacts
No trustworthy direct or curated related Hugging Face artifacts were found yet.
Continue with targeted Hugging Face searches derived from the paper title and method context:
Tip: start with models, then check datasets/spaces if you need evaluation data or demos.
Direct artifact matches are currently sparse. Use targeted Hugging Face searches to quickly locate candidate models, datasets, and demos.
Research context
1
Citations
0
References
Tasks
Posterior probability, Parallel tempering, Computer science, Simulated annealing, Markov chain, Markov chain Monte Carlo, Physical Sciences
Methods
Algorithm, Bayesian probability, Mathematical optimization, Bayesian inference
Domains
Mathematics, Applied mathematics, Artificial Intelligence
Evaluation & Human Feedback Data
Open this paper in HFEPX to review benchmark signals, evaluation modes, and human-feedback protocol context.
Open in HFEPXExplore Similar Papers
Jump to Paper2Code search queries derived from this paper's research context.
Related papers
-
Search on Paper2Code
Markov Chain Monte Carlo Algorithms (2014) Semantic similarity
-
Search on Paper2Code
Parallel tempering for dynamic generalized linear models (2016) Semantic similarity
-
Search on Paper2Code
Parallel tempering for the traveling salesman problem (2008) Semantic similarity
-
Search on Paper2Code
PARALLEL TEMPERING FOR THE TRAVELING SALESMAN PROBLEM (2009) Semantic similarity
-
Search on Paper2Code
Anti-tempered layered adaptive importance sampling (2017) Semantic similarity
Need human evaluators for your AI research? Scale annotation with expert AI Trainers.