OpenTrain AI
Maintained implementation availablenonePretrained Models Available

AIOS Compiler: LLM as Interpreter for Natural Language Programming and Flow Programming of AI Agents

Shuyuan Xu, Zelong Li, Kai Mei, Yongfeng Zhang

May 11, 2024arXiv: 2405.06907
3 repos5,452 stars~a few hours to reproduce
arXiv PDF

Abstract

Since their inception, programming languages have trended towards greater readability and lower barriers for programmers. Following this trend, natural language can be a promising type of programming language that provides great flexibility and usability and helps towards the democracy of programming. However, the inherent vagueness, ambiguity, and verbosity of natural language pose significant challenges in developi...

Results & Benchmarks

TaskDatasetMetricValue
Instruction tuningTask 1 (CLIP Score)Mixtral as LLM interpreter0.0

Best Implementation

AIOS: AI Agent Operating System

5.5k 747 Jan 2026 NOASSERTION
License
CI
Deps
Docker
  • Selected agiresearch/aios as the strongest maintained implementation for new work.
  • Includes CI workflow signals.
  • Includes dependency/environment manifest signals.
  • Repository activity is within the last 24 months.

Reproduction Path

  1. 1

    Start with agiresearch/aios and validate setup instructions in README.

  2. 2

    Reproduce the baseline result with the provided defaults before modifying hyperparameters.

  3. 3

    Log exact dependency versions and runtime environment for reproducibility.

Time to first repro: a few hoursNo repository-level red flags were detected, but paper-specific preprocessing and hyperparameter details may still be under-specified.

Additional Implementations

Official

  • agiresearch/coreConfidence: low

    LLM as Interpreter for Natural Language Programming, Pseudo-code Programming and Flow Programming of AI Agents

    Stars: 47Forks: 7Last push: Jul 2024License: Apache-2.0

Community

No additional community repositories detected yet.

Hugging Face Artifacts

No direct paper-linked artifacts were found. Showing strongest curated related artifacts.