What is the best open-source implementation of "Are Language Models Actually Useful for Time Series Forecasting?"?

The best maintained implementation is bennytmt/llmsfortimeseries with 161 stars on GitHub. Confidence: high. Reproducibility: Limited.

How reproducible is "Are Language Models Actually Useful for Time Series Forecasting?"?

Estimated time to first reproduction: a few days. Risk flags: License metadata missing, No CI workflows detected, Dependency manifest is missing. Start with bennytmt/llmsfortimeseries and validate setup instructions in README.

Are there pretrained models available for "Are Language Models Actually Useful for Time Series Forecasting?"?

Yes, 1 Hugging Face model found. The top result is keras-io/timeseries_forecasting_for_weather with 31 downloads.

What framework is used to implement "Are Language Models Actually Useful for Time Series Forecasting?"?

The primary implementation uses pytorch.

Are Language Models Actually Useful for Time Series Forecasting?

Published: Jun 1, 2024

Best maintained implementation now

Evidence: Direct

Domain fit: AI-core

Verified repos: 2

Top repo stars: 161

Core AI workload signals detected from paper context and implementation/artifact evidence.

Framework: pytorch

Time to first repro: a few days

3 risk flags

arXiv PDF

Technical details

Canonical key: arxiv-2406.16964

Cache status: Fresh

Generated at: Jun 17, 2026, 12:44 PM

Artifact coverage: direct

HF provider: ok (token)

PWC source used: Yes

LLM status: not_generated

LLM model: n/a

LLM generated: Unknown

LLM content type: n/a

HF policy: hf-relevance-v27

implementation starting point

Benchmarks: missing

Time to repro: a few days

3 risk flags

pytorch

Results & Benchmarks

Freshness tier: cold

Direct + Inferred Evidence

No concrete benchmark grounding is available yet. Treat the page as context or an implementation starting point only.

Are Language Models Actually Useful for Time Series Forecasting? is the primary contribution described in this paper.

Use This Implementation Because…

Confidence: high

bennytmt/llmsfortimeseries is the strongest maintained implementation based on ranking signals.

Open bennytmt/llmsfortimeseries

Reproduction Risks

License metadata missing
No CI workflows detected
Dependency manifest is missing

Hardware Notes

Expect multi-day setup/compute for meaningful reproduction based on current guidance.

Evidence disclosure

Evidence graph: 4 refs, 4 links.

Utility signals: depth 65/100, grounding 85/100, status medium.

Implementation Comparison

Top 3 paths

Compare maintenance quality, reproducibility coverage, and evidence confidence before choosing a reproduction baseline.

bennytmt/llmsfortimeseries

best maintained

Maintenance: Stale risk

Confidence: High

Reproducibility: Limited

Official implementation from Papers with Code · Repository link is mentioned in the paper metadata

Stars: 161
Last push: Jun 25, 2025 (358d ago)

Risk flags

No CI pipeline detected
No tagged releases
No Docker setup

bennytmt/ts_models

historical official

Maintenance: Stale risk

Confidence: High

Reproducibility: Limited

Official implementation from Papers with Code · Repository link is mentioned in the paper metadata

Stars: 161
Last push: Jun 25, 2025 (358d ago)

Risk flags

No CI pipeline detected
No tagged releases
No Docker setup

thuml/AutoTimes

alternative

Maintenance: Stale risk

Confidence: Low

Reproducibility: Moderate

Strong overlap with paper title keywords · Community adoption signal (268 stars)

Stars: 268
Last push: Jul 22, 2025 (331d ago)

Dependencies

Risk flags

No CI pipeline detected
No tagged releases
No Docker setup

Best implementation now

bennytmt/llmsfortimeseries

Confidence: High

Reproducibility: Limited

BennyTMT/LLMsForTimeSeries

Stars: 161

Forks: 23

Last push: Jun 25, 2025

Official implementation from Papers with Code

Repository link is mentioned in the paper metadata

Community adoption signal (161 stars)

License –

CI –

Deps –

Docker –

Selected bennytmt/llmsfortimeseries as the strongest maintained implementation for new work.
Repository activity is within the last 24 months.
Official repository is preserved separately as historical context.

Historical official implementation

Preserved for provenance. Not recommended as the default path for new builds.

bennytmt/ts_models

Stars: 161

Last push: Jun 25, 2025

Reproduction readiness

Major Work

Time to first repro: days

Last checked: Jun 17, 2026

Hardware requirements

Expect multi-day setup/compute for meaningful reproduction based on current guidance.

No dependency manifest — manual reconstruction required

· bennytmt/llmsfortimeseries has no requirements.txt, environment.yml, pyproject.toml, or Dockerfile.
· You will need to reverse-engineer dependencies from import statements in the source code.
· Last push was 358 days ago.

Open bennytmt/llmsfortimeseries

No benchmark numbers could be verified. You will not be able to validate reproduction correctness against published numbers.

Additional implementations

No additional verified repositories beyond the primary recommendation.

Possible but unverified matches (1)

These repositories had low-confidence matching signals and are hidden by default.

thuml/AutoTimes

Confidence: Low

Stars: 268

Hugging Face artifacts

No direct paper-linked artifacts were found. Showing strongest curated related artifacts for faster exploration.

Models

keras-io/timeseries_forecasting_for_weather

Curated Related

Downloads: 31

Likes: 21

Broaden model search

Transformer Natural Language Processing Transformer language models actually useful

Datasets

Wenyan0110/Multimodal-Dataset-Image_Text_Table_TimeSeries-for-Financial-Time-Series-Forecasting

Curated Related

Downloads: 1,516

Likes: 10

Updated: Jun 6, 2025
t4tiana/store-sales-time-series-forecasting

Curated Related

Downloads: 504

Likes: 6

Updated: Jul 5, 2023

Broaden dataset search

Transformer Natural Language Processing dataset Transformer dataset language models actually useful dataset

Spaces

No trustworthy demo spaces right now.

Search spaces on Hugging Face

Explore on Hugging Face

Search models Search datasets Search spaces

Research context

Tasks

None detected

Methods

Transformer

Domains

Natural Language Processing

Evaluation & Human Feedback Data

Open this paper in HFEPX to review benchmark signals, evaluation modes, and human-feedback protocol context.

Open in HFEPX

Explore Similar Papers

Jump to Paper2Code search queries derived from this paper's research context.

Transformer Natural Language Processing

Need human evaluators for your AI research? Scale annotation with expert AI Trainers.

Post a Job Get a Quote