Skip to content
@Infini-AI-Lab

Infini-AI-Lab

Next Generation AI algorithms and systems

Popular repositories Loading

  1. Sequoia Sequoia Public

    scalable and robust tree-based speculative decoding algorithm

    Python 366 37

  2. TriForce TriForce Public

    [COLM 2024] TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding

    Python 276 17

  3. MagicPIG MagicPIG Public

    [ICLR2025 Spotlight] MagicPIG: LSH Sampling for Efficient LLM Generation

    Python 246 17

  4. MagicDec MagicDec Public

    [ICLR2025] Breaking Throughput-Latency Trade-off for Long Sequences with Speculative Decoding

    Python 137 9

  5. UMbreLLa UMbreLLa Public

    LLM Inference on consumer devices

    Python 128 15

  6. Multiverse Multiverse Public

    Python 110 9

Repositories

Showing 10 of 30 repositories
  • vortex_torch Public

    Vortex: A Flexible and Efficient Sparse Attention Framework

    Infini-AI-Lab/vortex_torch’s past year of commit activity
    Python 43 Apache-2.0 2 0 0 Updated Jan 18, 2026
  • jackpot Public
    Infini-AI-Lab/jackpot’s past year of commit activity
    0 0 0 0 Updated Dec 7, 2025
  • Infini-AI-Lab/vortex-technical-blog’s past year of commit activity
    HTML 0 0 0 0 Updated Nov 30, 2025
  • Infini-AI-Lab/ai-environment-architect’s past year of commit activity
    HTML 0 0 0 0 Updated Nov 26, 2025
  • FCV Public
    Infini-AI-Lab/FCV’s past year of commit activity
    Python 10 0 0 0 Updated Nov 3, 2025
  • M2PO Public
    Infini-AI-Lab/M2PO’s past year of commit activity
    Python 28 Apache-2.0 2 3 0 Updated Oct 8, 2025
  • Multiverse Public
    Infini-AI-Lab/Multiverse’s past year of commit activity
    Python 110 Apache-2.0 9 5 0 Updated Sep 13, 2025
  • OLMo Public Forked from allenai/OLMo

    Modeling, training, eval, and inference code for OLMo

    Infini-AI-Lab/OLMo’s past year of commit activity
    Python 0 Apache-2.0 712 0 0 Updated Sep 10, 2025
  • Kinetics Public

    Kinetics: Rethinking Test-Time Scaling Laws

    Infini-AI-Lab/Kinetics’s past year of commit activity
    Python 85 3 0 0 Updated Jul 11, 2025
  • sglang Public Forked from sgl-project/sglang

    SGLang is a fast serving framework for large language models and vision language models.

    Infini-AI-Lab/sglang’s past year of commit activity
    Python 0 Apache-2.0 4,112 0 0 Updated Jul 9, 2025