Exla logo

Exla

An SDK to run transformer models anywhere

Winter 2025B2B / B2B -> Engineering, Product and Design
Edge Computing Semiconductors
Computer Vision
AI

About

Exla aggressively quantizes AI models to minimize memory usage and maximize inference speed. Whether you're deploying LLMs, VLMs, VLAs, or custom models, Exla reduces memory footprint by up to 80% and accelerates inference by 3–20x - all with just a few lines of code. https://cal.com/exla-ai/schedule

Founders (2)

Pranav Nair
Co-Founder
Viraat Das
Founder

Details

Status
Active
Stage
Early
Regions
Unspecified