Official implementation from Papers with Code · Repository link is mentioned in the paper metadata
- Stars
- 161
- Last push
- Jul 25, 2025 (318d ago)
Risk flags
No obvious maintenance or reproducibility risks detected.
Gerrit J. J. van den Burg, Christopher K. I. Williams
Core AI workload signals detected from paper context and implementation/artifact evidence.
Change point detection is an important part of time series analysis, as the presence of a change point indicates an abrupt and significant change in the data generating process. While many algorithms for change point detection have been proposed, comparatively little attention has been paid to evaluating their performance on real-world time series. Algorithms are typically evaluated on simulated data and a small numb ...
er of commonly-used series with unreliable ground truth. Clearly this does not provide sufficient insight into the comparative performance of these algorithms. Therefore, instead of developing yet another change point detection method, we consider it vastly more important to properly evaluate existing algorithms on real-world data. To achieve this, we present a data set specifically designed for the evaluation of change point detection algorithms that consists of 37 time series from various application domains. Each series was annotated by five human annotators to provide ground truth on the presence and location of change points. We analyze the consistency of the human annotators, and describe evaluation metrics that can be used to measure algorithm performance in the presence of multiple ground truth annotations. Next, we present a benchmark study where 14 algorithms are evaluated on each of the time series in the data set. Our aim is that this data set will serve as a proving ground in the development of novel change point detection algorithms.
No concrete benchmark grounding is available yet. Treat the page as context or an implementation starting point only.
Change point detection is an important part of time series analysis, as the presence of a change point indicates an abrupt and significant change in the data generating process.
alan-turing-institute/TCPD is the strongest maintained implementation based on ranking signals. CI workflows are present. License is declared (MIT).
Open alan-turing-institute/TCPDEvidence graph: 4 refs, 4 links.
Utility signals: depth 55/100, grounding 85/100, status medium.
Compare maintenance quality, reproducibility coverage, and evidence confidence before choosing a reproduction baseline.
Official implementation from Papers with Code · Repository link is mentioned in the paper metadata
Risk flags
No obvious maintenance or reproducibility risks detected.
Official implementation from Papers with Code · Repository link is mentioned in the paper metadata
Risk flags
Official implementation from Papers with Code · Repository link is mentioned in the paper metadata
Risk flags
The Turing Change Point Dataset - A collection of time series for the evaluation and development of change point detection algorithms
Preserved for provenance. Not recommended as the default path for new builds.
Dependencies pinned, manual setup needed
Quick start
git clone https://github.com/alan-turing-institute/TCPD.git
pip install -r requirements.txt No benchmark numbers could be verified. You will not be able to validate reproduction correctness against published numbers.
A simple flask application to collect annotations for the Turing Change Point Dataset, a benchmark dataset for change point detection algorithms
No additional community repositories detected yet.
These repositories had low-confidence matching signals and are hidden by default.
No direct paper-linked artifacts were found. Showing strongest curated related artifacts for faster exploration.
Broaden model search
No trustworthy dataset matches right now.
Search datasets on Hugging FaceNo trustworthy demo spaces right now.
Search spaces on Hugging FaceTasks
None detected
Methods
Transformer
Domains
None detected
Evaluation & Human Feedback Data
Open this paper in HFEPX to review benchmark signals, evaluation modes, and human-feedback protocol context.
Open in HFEPXExplore Similar Papers
Jump to Paper2Code search queries derived from this paper's research context.
Need human evaluators for your AI research? Scale annotation with expert AI Trainers.