OpenTrain AI
Maintained implementation availablenone

Roaring Bitmaps: Implementation of an Optimized Software Library

Daniel Lemire, Owen Kaser, Nathan Kurz, Luca Deri, Chris O'Hara +2 more

September 22, 2017arXiv: 1709.07821
2 repos1,799 stars~a few days to reproduce
arXiv PDFDOI

Abstract

Compressed bitmap indexes are used in systems such as Git or Oracle to accelerate queries. They represent sets and often support operations such as unions, intersections, differences, and symmetric differences. Several important systems such as Elasticsearch, Apache Spark, Netflix's Atlas, LinkedIn's Pinot, Metamarkets' Druid, Pilosa, Apache Hive, Apache Tez, Microsoft Visual Studio Team Services and Apache Kylin rel...

Results & Benchmarks

Benchmark data is not yet available for this paper.

Hardware Requirements

  • Expect multi-day setup/compute for meaningful reproduction based on current guidance.

Best Implementation

Roaring bitmaps in C (and C++), with SIMD (AVX2, AVX-512 and NEON) optimizations: used by Apache Doris, ClickHouse, Alibaba Tair, Redpanda, YDB and StarRocks

1.8k 312 Apr 2026 NOASSERTION
License
CI
Deps
Docker
  • Selected RoaringBitmap/CRoaring as the strongest maintained implementation for new work.
  • Includes CI workflow signals.
  • Repository activity is within the last 24 months.
  • Official repository is preserved separately as historical context.

Reproduction Path

  1. 1

    Start with RoaringBitmap/CRoaring and validate setup instructions in README.

  2. 2

    Reproduce the baseline result with the provided defaults before modifying hyperparameters.

  3. 3

    Log exact dependency versions and runtime environment for reproducibility.

Time to first repro: a few daysDependency manifest is missing

Additional Implementations

No additional verified repositories beyond the primary recommendation.

Hugging Face Artifacts

No direct paper-linked artifacts were found. Showing strongest curated related artifacts.