MAMUT: A Novel Framework for Modifying Mathematical Formulas for the Generation of Specialized Datasets for Language Model Training
Jonathan Drechsel, Anja Reusch, Steffen Herbold
Abstract
Mathematical formulas are a fundamental and widely used component in various scientific fields, serving as a universal language for expressing complex concepts and relationships. While state-of-the-art transformer models excel in processing and understanding natural language, they encounter challenges with mathematical notation, which involves a complex structure and diverse representations. This study focuses on the...
Results & Benchmarks
| Task | Dataset | Metric | Value |
|---|---|---|---|
| Retrieval / indexing | MATH | Accuracy | 93.98 |
| Retrieval / indexing | MP BERT | Recall | 99.5 |
| Retrieval / indexing | MP BERT -random-falses | Recall | 99.7 |
| Retrieval / indexing | MP BERT -constant-falses | Recall | 99.2 |
Best Implementation
A computer algebra system written in pure Python with a randomized LaTeX Formula Generator
- Selected jdrechsel13/sympy-random-latex as the strongest maintained implementation for new work.
- Includes CI workflow signals.
- Includes dependency/environment manifest signals.
- Repository activity is within the last 24 months.
Reproduction Path
- 1
Start with jdrechsel13/sympy-random-latex and validate setup instructions in README.
- 2
Reproduce the baseline result with the provided defaults before modifying hyperparameters.
- 3
Log exact dependency versions and runtime environment for reproducibility.
Additional Implementations
Official
- aieng-lab/transformer-math-evaluationConfidence: low
aieng-lab/transformer-math-evaluation
Stars: 2Forks: 0Last push: Jul 2025License: Apache-2.0 - aieng-lab/transformer-math-pretrainingConfidence: low
Framework to pretrain mathematical aware transformer models using MAMUT datasets
Stars: 1Forks: 0Last push: Jul 2025License: Apache-2.0
Community
No additional community repositories detected yet.
Hugging Face Artifacts
No direct paper-linked artifacts were found. Showing strongest curated related artifacts.