What is Sarvam Akshar?

Akshar is a document digitization product. It reads complex layouts, tables, and Indic scripts, and returns structured output in HTML, JSON, or Markdown with layout and reading order intact.

What is the difference between Akshar API and Akshar Platform?

The API is for batch processing. Send documents, get structured output, no manual step. The Platform adds a visual interface where you can review, edit, and correct output before exporting.

Which languages does Akshar support?

Twenty-two Indic languages plus English: Assamese, Bengali, Bodo, Dogri, Gujarati, Hindi, Kannada, Kashmiri, Konkani, Maithili, Malayalam, Manipuri, Marathi, Nepali, Odia, Punjabi, Sanskrit, Santali, Sindhi, Tamil, Telugu, and Urdu.

What output formats are available?

HTML, JSON, and Markdown. All three preserve the original layout and reading order.

How do I get access to the Akshar Platform?

Contact us. We will walk through your use case and set up access.

Sarvam Akshar

Intelligent Document Digitisation,
built for India.

A document digitisation platform that reads, understands, and extracts knowledge from real-world documents: scanned archives, handwritten notes, ancient scripts, complex tables, and dense scripts.

Where digitisation falls short

Most OCR tools weren't built for Indian documents. Complex layouts, dense tables, and scripts where a single matra changes the word.

Complex Layouts Break Apart

Multi-column pages and mixed-format documents come out garbled. Layout structure is lost before extraction even begins.

Indic Scripts Are Misread

Conjuncts, matras, and diacritics get confused. Low-resource scripts fare even worse. A single matra changes the word.

Tables Lose Their Structure

Rows merge, columns shift, and cell boundaries vanish. What was once a structured table becomes an unreadable wall of text.

No Way to Fix Errors

One-shot output with no review or correction step. If the OCR gets it wrong, you have to start over from scratch.

Akshar turns complex documents into
structured, usable data

Accurate even when documents are not

Spots paragraphs, headers, tables, footnotes, and figures in any document structure.

Traces the correct reading path across columns and sections, regardless of layout complexity.

Handles 22 Indian languages and English, including multilingual pages in a single pass.

Delivers HTML, JSON, or Markdown with layout and reading order preserved.

Review and correct before export

Every block, paragraph, and table cell is linked to its location in the source document. You always see where things came from.

Describe what you want changed. The agent applies it across the entire document.

Click any extracted element to see its position in the original scan.

Fix text, relabel blocks, and restructure layout with the source document alongside.

23 languages, every script natively understood

हिन्दीHindi · hi-IN

বাংলাBengali · bn-IN

தமிழ்Tamil · ta-IN

తెలుగుTelugu · te-IN

मराठीMarathi · mr-IN

ગુજરાતીGujarati · gu-IN

ಕನ್ನಡKannada · kn-IN

മലയാളംMalayalam · ml-IN

অসমীয়াAssamese · as-IN

اردوUrdu · ur-IN

संस्कृतम्Sanskrit · sa-IN

नेपालीNepali · ne-IN

डोगरीDogri · doi-IN

बड़ोBodo · brx-IN

ਪੰਜਾਬੀPunjabi · pa-IN

ଓଡ଼ିଆOdia · od-IN

कोंकणीKonkani · kok-IN

मैथिलीMaithili · mai-IN

سنڌيSindhi · sd-IN

कॉशुरKashmiri · ks-IN

মৈতৈলোন্Manipuri · mni-IN

ᱥᱟᱱᱛᱟᱲᱤSantali · sat-IN

EnglishEnglish · en-IN

For every kind of document

Designed to handle the full range of documents, across industries and use cases.

Government & public records

Convert administrative files, forms, and historical
records into searchable, structured formats.

Publishing & archives

Turn scanned books, backlists, and out-of-print
titles into accessible e-books.

Finance & legal

Process contracts, statements, court records, and compliance docs.

Research & education

Extract text from manuscripts, newspapers, textbooks, and primary sources.

Developers

Add digitization to your product with the Akshar API.

From extraction to action, powered by agents

Move from raw documents to reliable output with agents handling each step.

No credit card needed. Get started for free.

Start with extraction

Documents are processed to capture text, layout, and structure. Agents identify and fix common errors.

Apply instructions automatically

Instructions provided at upload are executed across every page, consistently.

Proofread with context

Issues can be reviewed and corrected across the document or within specific sections through a simple interface.

Take actions and retain context

Agents perform tasks and maintain memory, improving consistency across workflows over time.

Questions? Answers.

Try Akshar. See the structured output.

Try Akshar.

See the structured output.

Intelligent Document Digitisation,
built for India.