OpenTrain AI
Maintained implementation availablepytorch

F-LMM: Grounding Frozen Large Multimodal Models

June 1, 2024arXiv: 2406.05821
2 repos109 stars~a few days to reproduce
arXiv PDF

Abstract

Results & Benchmarks

TaskDatasetMetricValue
Grounding Frozen Large Multimodal ModelsSpaCy ParserRecall97.3
Grounding Frozen Large Multimodal ModelsLinear Keyword SelectorRecall96.6

Hardware Requirements

  • Expect multi-day setup/compute for meaningful reproduction based on current guidance.

Best Implementation

[CVPR2025] Code Release of F-LMM: Grounding Frozen Large Multimodal Models

109 1 May 2025 NOASSERTION
License
CI
Deps
Docker
  • Selected wusize/f-lmm as the strongest maintained implementation for new work.
  • Repository activity is within the last 24 months.

Reproduction Path

  1. 1

    Start with wusize/f-lmm and validate setup instructions in README.

  2. 2

    Reproduce the baseline result with the provided defaults before modifying hyperparameters.

  3. 3

    Log exact dependency versions and runtime environment for reproducibility.

Time to first repro: a few daysNo CI workflows detectedDependency manifest is missing

Additional Implementations

Official

No additional official repositories detected.

Community

  • wusize/F-LMMConfidence: low

    [CVPR2025] Code Release of F-LMM: Grounding Frozen Large Multimodal Models

    Stars: 109Forks: 1Last push: May 2025License: NOASSERTION

Hugging Face Artifacts

No trustworthy direct or curated related Hugging Face artifacts were found yet.

Continue with targeted Hugging Face searches: