Index built at upload
Native PDF text indexes directly. Scanned PDFs and images run through OCR; Word, Excel and PowerPoint extract text from native format. The index is structured so the search returns precise matches.
Document Management | Document Full-Text Search
Full-text search across native PDFs, Word, Excel, PowerPoint and OCR-extracted text from scanned PDFs and images. Filter by counterparty, period, document type, status. Match-in-context returns the file plus the page where the term appears.
How it works
Native PDF text indexes directly. Scanned PDFs and images run through OCR; Word, Excel and PowerPoint extract text from native format. The index is structured so the search returns precise matches.
A search for "limitation of liability" returns every contract with the matching clause. The result shows the file, the surrounding context, the page number and a direct link.
Filter by counterparty, by period, by document type (contract, KYC, challan, board resolution). Combined filters get to the right document fast.
Save a search to re-run later. Set alerts so a new document matching the search criteria notifies the right person automatically.
What the system does
| Capability | Input | Output |
|---|---|---|
| Native text indexing | PDF, Word, Excel, PowerPoint | Searchable text index |
| OCR for scans | Scanned PDFs + images | Extracted text in index |
| Match-in-context | Search query | File + page + surrounding context |
| Filter combinations | Counterparty + period + type + status | Narrowed result set |
| Saved searches | Query + criteria | Re-runnable, alert-able query |
Native text indexing
OCR for scans
Match-in-context
Filter combinations
Saved searches
Compliance + integrations
Search is scoped to the documents the user has permission to see. A controller sees their entity's documents; the auditor sees what the engagement scope allows. The search index honours every access control.
Regulations we work within
DPDP Act 2023
Personal-data fields hidden in search results when user lacks consent.
Rule 11(g), Companies Act
Search activity logged for audit trail.
Connects to
Document Full-Text Search FAQ
Yes. The OCR pipeline supports English plus major Indian languages including Hindi, Tamil, Marathi, Bengali, Gujarati, Kannada, Malayalam, Telugu and Punjabi. Mixed-language documents (often in Indian B2B context) are handled.
Yes. The search index honours permissions per user. A controller for entity A sees only entity A documents in search results. An auditor with engagement scope on FY 2024 sees only that scope. The permission boundary applies before results are returned.
Sub-second response for typical queries across 100,000 documents. Larger archives (1 M+ documents) get response under 3 seconds. The index is updated incrementally on upload, so new documents are searchable within minutes.
More in Document Management
MSAs, addendums, NDAs, service orders. Searchable. Renewal alerts.
See Contract ManagementPer-filing evidence packs auto-assembled. One-click pack export with hash-verified chain.
See Audit Evidence PacksReplace a document; the old version stays. Audit trail of every upload, edit and delete.
See Document VersioningFree trial. Upload contracts and challans. Search for any term. The result lands on the matching clause, the matching invoice line or the matching board resolution within seconds.