What is the best open-source implementation of "Chat AI: A Seamless Slurm-Native Solution for HPC-Based Services"?

The best maintained implementation is gwdg/chat-ai with 103 stars on GitHub. Confidence: high. Reproducibility: Limited.

How reproducible is "Chat AI: A Seamless Slurm-Native Solution for HPC-Based Services"?

Estimated time to first reproduction: a few days. Risk flags: No CI workflows detected, Dependency manifest is missing. Start with gwdg/chat-ai and validate setup instructions in README.

Are there pretrained models available for "Chat AI: A Seamless Slurm-Native Solution for HPC-Based Services"?

Yes, 1 Hugging Face model found. The top result is facebook/seamless-m4t-v2-large with 76,246 downloads.

What framework is used to implement "Chat AI: A Seamless Slurm-Native Solution for HPC-Based Services"?

The primary implementation uses none.

Chat AI: A Seamless Slurm-Native Solution for HPC-Based Services

Ali Doosthosseini, Jonathan Decker, Hendrik Nolte, Julian M. Kunkel

Published: Jun 27, 2024

Best maintained implementation now

Evidence: Direct

Domain fit: AI-core

Verified repos: 3

Top repo stars: 103

Core AI workload signals detected from paper context and implementation/artifact evidence.

Framework: none

Time to first repro: a few days

2 risk flags

arXiv PDF

The widespread adoption of large language models (LLMs) has created a pressing need for an efficient, secure and private serving infrastructure, which allows researchers to run open source or custom fine-tuned LLMs and ensures users that their data remains private and is not stored without their consent. While high-performance computing (HPC) systems equipped with state-of-the-art GPUs are well-suited for training LL ...

Read full abstract

Ms, their batch scheduling paradigm is not designed to support real-time serving of AI applications. Cloud systems, on the other hand, are well suited for web services but commonly lack access to the computational power of HPC clusters, especially expensive and scarce high-end GPUs, which are required for optimal inference speed. We propose an architecture with an implementation consisting of a web service that runs on a cloud VM with secure access to a scalable backend running a multitude of LLM models on HPC systems. By offering a web service using our HPC infrastructure to host LLMs, we leverage the trusted environment of local universities and research centers to offer a private and secure alternative to commercial LLM services. Our solution natively integrates with the HPC batch scheduler Slurm, enabling seamless deployment on HPC clusters, and is able to run side by side with regular Slurm workloads, while utilizing gaps in the schedule created by Slurm. In order to ensure the security of the HPC system, we use the SSH ForceCommand directive to construct a robust circuit breaker, which prevents successful attacks on the web-facing server from affecting the cluster. We have successfully deployed our system as a production service, and made the source code available at \url{https://github.com/gwdg/chat-ai}

Technical details

Canonical key: arxiv-2407.00110

Cache status: Fresh

Generated at: May 26, 2026, 2:43 AM

Artifact coverage: direct

HF provider: ok (token)

PWC source used: Yes

LLM status: not_generated

LLM model: n/a

LLM generated: Unknown

LLM content type: n/a

HF policy: hf-relevance-v27

implementation starting point

Benchmarks: missing

Time to repro: a few days

2 risk flags

none

Results & Benchmarks

Freshness tier: cold

Direct + Inferred Evidence

No concrete benchmark grounding is available yet. Treat the page as context or an implementation starting point only.

Use This Implementation Because…

Confidence: high

gwdg/chat-ai is the strongest maintained implementation based on ranking signals. License is declared (GPL-3.0).

Open gwdg/chat-ai

Reproduction Risks

No CI workflows detected
Dependency manifest is missing

Hardware Notes

Expect multi-day setup/compute for meaningful reproduction based on current guidance.

Evidence disclosure

Evidence graph: 4 refs, 4 links.

Utility signals: depth 65/100, grounding 85/100, status medium.

Implementation Comparison

Top 3 paths

Compare maintenance quality, reproducibility coverage, and evidence confidence before choosing a reproduction baseline.

gwdg/chat-ai

best maintained

Maintenance: Active

Confidence: High

Reproducibility: Limited

Official implementation from Papers with Code · Repository link is mentioned in the paper metadata

Stars: 103
Last push: May 21, 2026 (5d ago)

Releases

Risk flags

No CI pipeline detected
No Docker setup
Dependency manifest missing

gwdg/saia-hub

historical official

Maintenance: Active

Confidence: High

Reproducibility: Limited

Official implementation from Papers with Code · Repository link is mentioned in the paper metadata

Stars: 18
Last push: May 8, 2026 (18d ago)

Releases

Risk flags

No CI pipeline detected
No Docker setup
Dependency manifest missing

gwdg/saia-hpc

alternative

Maintenance: Active

Confidence: High

Reproducibility: Limited

Official implementation from Papers with Code · Repository link is mentioned in the paper metadata

Stars: 12
Last push: May 8, 2026 (18d ago)

Releases

Risk flags

No CI pipeline detected
No Docker setup
Dependency manifest missing

Best implementation now

gwdg/chat-ai

Confidence: High

Reproducibility: Limited

gwdg/chat-ai

Stars: 103

Forks: 13

Last push: May 21, 2026

License: GPL-3.0

Official implementation from Papers with Code

Repository link is mentioned in the paper metadata

Community adoption signal (103 stars)

License ✓

CI –

Deps –

Docker –

Selected gwdg/chat-ai as the strongest maintained implementation for new work.
Repository activity is within the last 24 months.
Official repository is preserved separately as historical context.

Historical official implementation

Preserved for provenance. Not recommended as the default path for new builds.

gwdg/saia-hub

Stars: 18

Last push: May 8, 2026

Reproduction readiness

Major Work

Time to first repro: days

Last checked: May 26, 2026

Hardware requirements

Expect multi-day setup/compute for meaningful reproduction based on current guidance.

No dependency manifest — manual reconstruction required

· gwdg/chat-ai has no requirements.txt, environment.yml, pyproject.toml, or Dockerfile.
· You will need to reverse-engineer dependencies from import statements in the source code.

Open gwdg/chat-ai

No benchmark numbers could be verified. You will not be able to validate reproduction correctness against published numbers.