Chunkr logo

Chunkr

Open source API service to parse complex documents

Winter 2024B2B / B2B -> Engineering, Product and Design3 employees

About

Battle-tested + highly modular vision infrastructure to convert PDFs, PPTs, Word, Excel, PNG, and JPEGs into LLM-ready data. We started by building lumina.sh - where we needed to parse ~600M pages of scientific literature. The researchers didn't care - but devs wanted our ingestion pipeline. So we built chunkr instead. We offer high quality layout analysis, OCR, bounding boxes, granular VLM controls, semantic chunking, and all the last mile engineering that goes into building standout AI applications. Common use-cases include RAG, and automating document workflows like invoices/medical reports -> database.

Build your own investing agent

GPAgent keeps YC listings public and neutral. Fund-specific scoring, notes, and workflow state live in each customer workspace.

Join the GPAgent waitlist

Founders (3)

Akhilesh Sharma
Founder
Ishaan Kapoor
Founder
Mehul Chadda
Co-founder & CEO

Details

Status
Active
Stage
Early
Team Size
3
Regions
Unspecified