How reproducible is "Latte: Latent Diffusion Transformer for Video Generation"?

Estimated time to first reproduction: a few hours. No risk flags identified. Start with maxin-cn/Latte and validate setup instructions in README.

Are there pretrained models available for "Latte: Latent Diffusion Transformer for Video Generation"?

Yes, 2 Hugging Face models found. The top result is maxin-cn/Latte-1 with 117 downloads.

What framework is used to implement "Latte: Latent Diffusion Transformer for Video Generation"?

The primary implementation uses pytorch.

Latte: Latent Diffusion Transformer for Video Generation

Q: What is the best open-source implementation of "Latte: Latent Diffusion Transformer for Video Generation"?

The best maintained implementation is maxin-cn/Latte with 35 stars on GitHub. Confidence: high. Reproducibility: Strong.

Published: Jan 1, 2024

Best maintained implementation now

Evidence: Direct

Domain fit: AI-core

Verified repos: 2

Top repo stars: 35

Core AI workload signals detected from paper context and implementation/artifact evidence.

Framework: pytorch

Time to first repro: a few hours

No risk flags

arXiv PDF

Technical details

Canonical key: arxiv-2401.03048

Cache status: Fresh

Generated at: Jun 4, 2026, 3:32 AM

Artifact coverage: direct

HF provider: ok (token)

PWC source used: Yes

LLM status: not_generated

LLM model: n/a

LLM generated: Unknown

LLM content type: n/a

HF policy: hf-relevance-v27

implementation starting point

Benchmarks: thin evidence

Time to repro: a few hours

pytorch

Results & Benchmarks

Freshness tier: cold

Direct + Inferred Evidence

Transformer

Latte-S

D 𝐷 D italic_D

384

Source: paper fulltext

Transformer

Latte-B

D 𝐷 D italic_D

768

Source: paper fulltext

Benchmark evidence drill-down

2 findings

Audit each benchmark finding before selecting an implementation path. Evidence refs map to the disclosure section below.

Task	Dataset	Metric	Value	Source	Evidence refs
Transformer	Latte-S	D 𝐷 D italic_D	384	paper-derived	No explicit refs
Transformer	Latte-B	D 𝐷 D italic_D	768	paper-derived	No explicit refs

Latte: Latent Diffusion Transformer for Video Generation presents a transformer method.

Use This Implementation Because…

Confidence: high

maxin-cn/Latte is the strongest maintained implementation based on ranking signals. CI workflows are present. License is declared (Apache-2.0).

Open maxin-cn/Latte

Reproduction Risks

No repository-level red flags were detected, but paper-specific preprocessing and hyperparameter details may still be under-specified.

Evidence disclosure

Evidence graph: 4 refs, 4 links.

Utility signals: depth 90/100, grounding 95/100, status high.

Implementation Comparison

Top 3 paths

Compare maintenance quality, reproducibility coverage, and evidence confidence before choosing a reproduction baseline.

maxin-cn/Latte

best maintained

Maintenance: Stale

Confidence: High

Reproducibility: Strong

Official implementation from Papers with Code · Strong overlap with paper title keywords

Stars: 35
Last push: Feb 26, 2025 (464d ago)

CIDependencies

Risk flags

No push in 12+ months
No tagged releases
No Docker setup

vchitect/latte

alternative

Maintenance: Stale risk

Confidence: Low

Reproducibility: Strong

Strong overlap with paper title keywords · Community adoption signal (1940 stars)

Stars: 1,940
Last push: Oct 30, 2025 (217d ago)

CIDependencies

Risk flags

No tagged releases
No Docker setup
Low confidence match

mindspore-lab/mindone

alternative

Maintenance: Recently updated

Confidence: Low

Reproducibility: Strong

Community adoption signal (463 stars)

Stars: 463
Last push: Jan 14, 2026 (141d ago)

CIReleasesDependencies

Risk flags

No Docker setup
Low confidence match

Best implementation now

maxin-cn/Latte

Confidence: High

Reproducibility: Strong

The official implementation of Latte: Latent Diffusion Transformer for Video Generation.

Stars: 35

Forks: 3

Last push: Feb 26, 2025

License: Apache-2.0

Official implementation from Papers with Code

Strong overlap with paper title keywords

Community adoption signal (35 stars)

License ✓

CI ✓

Deps ✓

Docker –

Selected maxin-cn/Latte as the strongest maintained implementation for new work.
Includes CI workflow signals.
Includes dependency/environment manifest signals.
Repository activity is within the last 24 months.

Reproduction readiness

Setup Required

Time to first repro: hours

Last checked: Jun 4, 2026

Dependencies pinned, manual setup needed

· maxin-cn/Latte has environment.yml but requires manual environment setup.
· Last push was 464 days ago — expect possible dependency version conflicts.
· No Dockerfile — you will set up the environment manually.

Open maxin-cn/Latte

Quick start

git clone https://github.com/maxin-cn/Latte.git
conda env create -f environment.yml && conda activate <env-name>

Additional implementations

Official

No additional official repositories detected.

Community

explainingai-code/VideoGeneration-PyTorch
Confidence: Medium

This repo implements Video generation model using Latent Diffusion Transformers(Latte) in PyTorch and provides training and inference code on Moving mnist dataset and UCF101 dataset

Stars: 19

Last push: Jan 6, 2025

License: MIT