AI.DI by imkore — Document Intelligence Platform

Overview · Tab 01

The Document Intelligence Category Just Changed.

For thirty years, document management meant storage, organization, and search. AI.DI redefines the category entirely: Document Intelligence means every document is certified, extracted, scored, distributed, and queryable by any AI system — automatically, continuously, at scale. No incumbent has built this. We have.

Full Platform Architecture — Reproduced from AI.DI Statement of Outcomes

The Intelligence Flywheel — Why It Compounds

Compounding Platform Value

Documents flow into Document Gateway. Abstract.DI extracts intelligence from every one. Sentry fingerprints and certifies them. The Document Warehouse stores all of it as structured, queryable data. The Warehouse improves Abstract.DI model accuracy. Better accuracy improves Sentry signals. Better signals make Document Gateway more valuable. More value drives more documents. After 18 months, switching costs are effectively permanent — and accuracy measurably exceeds any out-of-the-box alternative.

Platform Layer Stack — Complete Feature Inventory

Document Gateway — Exchange & Distribution Engine

Check-In Studio · Distribution Studio · Transaction Rooms · ML Learning Studio · 200+ components · 29 edge functions

Core OSAny industry

The central operating system for every document. Replaces Box, SharePoint, Egnyte as the primary system of record while connecting to any of them as migration sources. React/TypeScript/Vite + Supabase + Deno Edge Functions + Cloudflare R2 storage.

Check-In Studio

AI-powered document intake with AbstractIQ auto-classification, batch template mode, session history, rejection/resubmission pipeline, and external submitter portal.

Distribution Studio

Unified hub replacing 6 legacy distribution workflows. Standing distributions, serialized delivery, access tracking, client branding engine, full audit trail.

ML Learning Studio

30 self-improving AI engines across 6 capability tiers. Org-specific model weights. 4 map views. Continuous accuracy improvement.

Abstract.DI — AI Extraction Engine

Any doc type · 94% day-one confidence · 100K batch chunks · GPU OCR · Anomaly detection · Custom schema builder

AI-nativeCore moat

Reads every document and converts it into structured, queryable intelligence. Multi-pass pipeline: OCR → classification → extraction → confidence scoring → anomaly detection → warehouse write. Different models optimized per document type.

Batch Engine

Process entire archives in 100K-document chunks. ZIP, Box, SharePoint, S3. Output to Excel, JSON, CSV, or Warehouse.

Any Document Type

Custom schema builder for proprietary types — hours not months. Pre-built schemas for legal, financial, compliance, real estate.

GPU OCR

DocTR engine. CPU and GPU — 10x–50x speedup on GPU. Selective OCR for maximum cost efficiency.

Sentry Document Assurance — Trust & Compliance Monitor

~10,000 fingerprints · Zero doc storage · Patent pending · GDPR/HIPAA/SEC by architecture · 10-100x faster search v2

TrustCompliance

Deterministic mathematical fingerprinting. Zero document storage — only immutable fingerprints. Three types: Document Content, Document Data, Trusted Data Fingerprints (unique in market — fingerprint individual database rows, find every document referencing that entity).

Duplicate Elimination

40%+ industry average duplicate rate = 40% wasted AI spend. 30-50% LLM cost reduction immediately.

Cross-System Search

Search SharePoint, OneDrive, Windows Share, email archives, ERP, FileStar simultaneously. No tags. No training.

PII Redaction

Auto-detects and redacts SSNs, financial IDs, tax IDs before fingerprint storage. GDPR data minimization by mathematics.

AI.DI Document Warehouse — Structured Intelligence Layer

PostgreSQL · SQL/GraphQL/REST · Snowflake · Databricks · MCP · Vector embeddings · 6 query views

Data moatBI connectors

The biggest differentiator in the market — and it doesn't exist anywhere else. Every document Abstract.DI processes becomes structured rows in PostgreSQL. Every extracted field is queryable data. Every AI signal is persisted as a structured record.

6 Query Views

List · Library · Cube (pivot) · Time series · Schema · Scientist mode. Every dimension instantly explorable.

BI Connectors

Snowflake Data Share, Databricks, Tableau, Power BI, dbt, BigQuery, Python SDK. Zero ETL overhead.

Event Streaming

Webhook Manager fires events on every platform action. Real-time pipeline triggers for any downstream system.

AI Orchestration & Agent Gateway

LLM-agnostic · Live MCP server · RAG foundation · OAuth2/OIDC · Zero hallucination

AI agentsRAG substrate

The AI layer that makes every enterprise LLM investment actually work. Not competing with LLMs — the prerequisite. Works with Copilot, GPT-4, Claude, Gemini, Llama, or any custom LLM. Live MCP server callable from Claude, Cursor, LangChain, AutoGen.

Live MCP Server

Production-ready. Callable from any MCP-compatible environment. No custom integration layer required.

LLM-Agnostic

Client chooses the AI model; AI.DI provides the trusted foundation. No vendor lock-in to any LLM.

RAG Foundation

Every chunk provenance-tracked, every answer traceable to a specific certified document version. Zero hallucination.

Millennia FileStar — Document Governance & Fabric

45 enterprise clients · 7,500 users · ~$1M ARR · 20+ year trust base · Zero-CAC upsell on-ramp

Installed baseOn-ramp

The governance engine and most defensible competitive moat. 45 enterprise clients with 7,500 users cannot be cold-called by any competitor. FileStar governs document lifecycle and syncs all metadata to the AI.DI Warehouse.

Document Fabric

Structured lifecycle governance for any document type. Configurable routing, approval chains, escalation paths.

AI.DI Integration

FileStar governs; Warehouse stores; Sentry certifies; Abstract.DI extracts — automatically on every FileStar doc.

Installed Base

8–10 Phase 1 upgrade targets. 20+ year institutional trust that no competitor can replicate regardless of funding.

Platform Combinations — Entry to Full Suite

DG alone

Day-one value. Replaces Box.

Document storage, hierarchy, roles, and lifecycle management for any organization. 30-day deployment. No implementation project.

"We're better than Box at organizing documents for your specific org structure. Same price. Setup in a week."

DG+Abstract.DI

The intelligence upgrade.

Every document uploaded turns into structured, searchable data automatically. No manual tagging. 94% accuracy on day one.

"Every document you upload now tells us what it says, who it's about, and when it expires — without you doing anything."

DG+Sentry

The compliance overlay.

Sentry wraps any existing repository — Box, SharePoint, Egnyte — and adds continuous compliance monitoring without replacing storage.

"Keep Box. We add the trust layer that tells you what's missing, what's expired, and what's been altered."

All six engines

The platform. The moat.

All six engines create a compounding intelligence flywheel. The platform becomes permanently irreplaceable.

"Your documents are now a structured, AI-certified, queryable intelligence asset. This is a different category."

"M-Files has been building their AI product since 2019. Box added AI features in 2022. SharePoint Copilot launched in 2023. None of them started from scratch. None of them can. We did. That is the only advantage that cannot be copied."

— Platform architecture principle

Overview · Tab 02

AI Intelligence — Built Ground-Up. Not Bolted On.

Every incumbent document platform bolted AI on top of a 20-year-old data model. AI.DI was architected in 2024 with AI as the primary actor — not the afterthought. 27 engines running in production. 30 self-improving ML models. A live MCP server connectable to any LLM today. This is not a roadmap.

The Competitive Barrier — Stated Plainly

Competitors have slides about AI. AI.DI has 200+ React/TypeScript components, 29 live serverless edge functions, an ML Learning Studio with 30 self-improving engines, a live MCP server, an AI Agent Gateway connecting to Claude/Copilot/GPT-4/Gemini, and a production AI.DI Studio running 27 active AI engines. The gap between what competitors promise and what we have already shipped is measured in years of engineering. This is the unfair advantage that cannot be purchased with a VC round.

The AI.DI Studio — 27 Active Engines Running in Production

documentgateway.ai

Click to enlarge

AI.DI Studio — Real-Time Intelligence Infrastructure

AI Intelligence · 27 Active Engines · 5 Capability Domains

27 AI engines running simultaneously across 5 capability domains — not planned, not in beta, not slides — running in production right now against real documents. Each column is a domain: AI Core handles classification, extraction, confidence scoring, and the HITL Reduction meta-engine that makes the entire platform self-improving. Intelligence manages deep document comprehension, cross-document validation, expiry detection, and the document type registry that routes every document to the right schema. Process automates the operational pipeline — OCR, obligation extraction, approval routing, distribution rule execution, workflow management, and industry-specific feature configuration. Trust & Security enforces immutability through blockchain anchoring, continuous tamper detection, and database-level access control. Data/Integration maintains the warehouse sync, API gateway, registry, and storage optimization engines that keep the entire data pipeline running at scale. Every node shows live metrics: documents processed, classifications made, conflicts detected, connections active. This is the intelligence infrastructure. No competitor has built it. No competitor can.

HITL Reduction — The Self-Improving Loop

Human-In-The-Loop Reduction Architecture

The HITL Reduction AI engine monitors all other engines' human review rates and autonomously moves classifications to auto-approve when confidence consistently exceeds configurable thresholds. Standard document types trend toward zero human intervention at 12 months. Novel or edge-case documents always retain human oversight — the goal is the right humans reviewing the right exceptions, not zero humans.

Legacy Platform HITL at 12 Months

Standard contracts65%

Insurance certificates55%

Financial statements70%

Fixed model weights. No production learning. Same cost and error rate at month 12 as month 1.

AI.DI HITL at 12 Months

Standard contracts8%

Insurance certificates5%

Financial statements12%

Continuous production learning. Every human correction retrains the model automatically. No ML engineers required.

Blockchain Engine — Immutable Audit Trail

documentgateway.ai

AI.DI Studio — Blockchain Engine · On-Chain Document Integrity

Click to enlarge

AISENTRY

AI.DI Studio — Blockchain Engine · On-Chain Document Integrity

AI Intelligence · Trust Engine · Ethereum / Hedera / Polygon

2,814 documents have been anchored on-chain through this engine — each one generating a Merkle tree hash committed to Ethereum, Hedera, or Polygon as an immutable proof of existence and content at a specific point in time. The HITL Reduction panel shows 100% automated — meaning the Blockchain Engine requires zero human review at this tenant after 12 months, because blockchain anchoring is deterministic: if the fingerprint matches, it anchors; no judgment required. The Pages Powered By This Engine panel shows exactly where this engine's certifications surface in the UI: Document Vault, Asset Vault, and Verification Portal. This is not a compliance checkbox — it is the infrastructure that allows a document to be presented to a regulator, counterparty, or auditor with cryptographic proof that its content has not changed since a specific timestamp. No document management platform on the market ships this out of the box.

Integration Studio — Connect Any AI Agent

documentgateway.ai

Integration Studio — Live AI Agent Gateway

Click to enlarge

AIORCHESTRATION

Integration Studio — Live AI Agent Gateway

AI Orchestration · MCP Server + 3 Connected AI Systems

This screen represents the moment enterprise AI deployment becomes real. Three AI systems are live and connected: Claude.ai, ChatGPT/OpenAI, FileStar. Each has read-only, row-level-security-enforced access to the entire certified document corpus through 6 production tools. The MCP Server URL is a live endpoint — paste it into Claude, Cursor, LangChain, or any MCP-compatible environment and the AI immediately gains the ability to search certified documents, check compliance status, retrieve obligations, query the warehouse, navigate the hierarchy, and retrieve signed document URLs. Keys are tenant-scoped, revocable instantly, and enforce the same RLS policies as the UI. The AI Agent Gateway panel on the left shows Microsoft Copilot, Gemini/Google, and Grok/xAI as available-to-add connections — meaning your entire AI vendor portfolio can query the same trusted document foundation. This is the infrastructure that makes every LLM investment in your organization actually work.

Integration Ecosystem — 28 Connectors

documentgateway.ai

Integration Studio — 28 Connectors Across Every Enterprise System

Click to enlarge

AIORCHESTRATION

Integration Studio — 28 Connectors Across Every Enterprise System

AI Orchestration · Full Connector Ecosystem

The integration architecture eliminates the "but we already use X" objection entirely. 28 connectors span every category an enterprise might already have in production. AI Agent Platforms: Claude.ai and ChatGPT are live-connected today; Microsoft Copilot, Grok/xAI, and Gemini are configured with setup wizards. CRE & PE Platforms: Yardi Voyager pulls leases and rent rolls directly, MRI Software delivers financial reports and lease abstracts, Argus Enterprise connects valuation models, Salesforce manages CRM documents and contracts, Juniper Square handles fund admin and investor documents, Dealpath delivers deal pipeline docs. Document Management: SharePoint, Google Drive, Box, OneDrive, Dropbox all connect as document sources. Dev & Monitoring: Sentry, Datadog, GitHub. Data & Warehouses: Snowflake, Databricks, BigQuery, Redshift, Tableau, Power BI. The AI Wizard button walks through each connection in guided steps — no IT project, no professional services, no custom integration work.

Document IQ — Conversational AI Over Your Certified Corpus

documentgateway.ai

Document IQ — AI-Powered Document Intelligence Assistant

Click to enlarge

AIORCHESTRATION

Document IQ — AI-Powered Document Intelligence Assistant

AI Orchestration · Conversational AI · Portfolio-Wide Access

Document IQ is what happens when you give an AI system access to a certified, structured document corpus instead of raw PDFs. With access to 18 assets across a portfolio, it can answer questions that would take a human analyst days: "What's missing from the vault?" surfaces every gap across every asset simultaneously. "Show critical risk items" aggregates all violation flags and expiry warnings into a single prioritized view. "Expiring in the next 30 days" is a precise query against structured expiry dates in the Warehouse — not a keyword search, not an approximation. Upload any file and Document IQ cross-references it against vault data in real time: upload a rent roll and it identifies which tenants aren't in the vault, which leases are missing, which figures don't match the abstracted data. This is not a chatbot bolted onto a document management system — it is an AI with structured, trusted data access that no general-purpose LLM can replicate without the Warehouse underneath it.

ML Learning Studio — 30 Engines, 6 Tiers

The Self-Improvement Architecture

Every legacy DMS has fixed classification models requiring expensive, time-consuming retraining. AI.DI's ML Learning Studio inverts this entirely — 30 engines improving continuously from production data, automatically, without engineering intervention. AI.DI gets cheaper and more accurate at scale. Every competitor's cost stays flat or increases.

Tier	Focus	Example Engines	HITL Trajectory
Tier 1 — Foundation	Document type classification	CRE Type Classifier, PE Type Classifier, Legal Type Classifier	Near-zero for covered types
Tier 2 — Entity	Named entity extraction	Party Extractor, Property Identifier, Fund/Entity Linker	5–15% at 6 months
Tier 3 — Date & Validity	Temporal signal extraction	Expiration Detector, Effective Date Parser, Renewal Classifier	Near-zero for standard formats
Tier 4 — Financial	Financial data extraction	Loan Terms Extractor, Rent Roll Parser, Appraisal Value Extractor	10–20% at 6 months
Tier 5 — Compliance	Compliance validation	Coverage Gap Detector, Compliance Flag Engine, Signature Validator	15–25% — domain expertise retained
Tier 6 — Cross-Document	Cross-document consistency	Portfolio Benchmark Engine, Anomaly Correlator, Reconciliation Engine	Complex analysis — strategic HITL

"We didn't build a document platform and add AI. We built an AI platform that happens to manage documents. The difference is not semantic. It is architectural. And architecture determines destiny."

— AI.DI platform design principle

Overview · Tab 03

The Honest Battle Table — Where We Win and Why It's Structural

We do not win on every dimension today. What matters is the architecture. An incumbent can add a feature. No incumbent can add a clean data model, a zero-legacy stack, or an AI engine designed in from the first line of code.

Why Legacy Platforms Cannot Catch Up

Box cannot rebuild their data model for AI without breaking 150,000 customers. SharePoint's incentive is to preserve Teams and Office revenue, not cannibalize Copilot. M-Files is 2–3 years behind on the data model and the Warehouse layer. Egnyte wins on storage reliability but has no awareness of what documents contain. Every dollar these platforms invest in AI is constrained by the need to not break existing products. That constraint does not exist for AI.DI.

Full Capability Matrix

Capability	AI.DI Platform	Box	SharePoint	M-Files	Egnyte
Architecture & Philosophy
AI-native architecture (built for AI, not adapted)	Win 2024-2025. Zero compromise. AI is core, not a wrapper.	Bolt-on	Copilot wrapper	Aino — improving but bolt-on	Minimal
Zero legacy technical debt	Win No codebase older than 18 months.	2005 origin	2001 origin	2003 origin	2009 origin
Edge compute architecture	Win All compute at edge. Scale to zero or infinity.	None	Azure Functions (partial)	None	None
Modular adoption (standalone or full suite)	Win Every engine has standalone value.	Partial	Module-based but complex	Partial	Partial
AI & Document Intelligence
Structured data extraction from documents	Win Abstract.DI — any type, 94% day-one, 100K batch.	None	Basic Copilot extraction	Aino — requires training	None
Day-one extraction accuracy (no training)	Win 94%+ on pre-built schemas. No training required.	N/A	N/A	Months of training	N/A
GPU-accelerated OCR pipeline	Win DocTR — 10-50x speedup on GPU.	None	Azure OCR (limited)	Basic OCR	Basic OCR
Batch processing (100K+ archives)	Win 100K-chunk batch. ZIP, Box, SharePoint, S3.	None	None	Limited batch	None
30 self-improving ML engines	Win Continuous production learning. No ML engineers.	None	Generic Copilot	Limited self-learning	None
HITL Reduction AI (autonomous meta-engine)	Win Autonomous promotion of high-confidence classifications.	None	None	None	None
Trust, Compliance & Security
Document fingerprinting (deterministic, patent-pending)	Win ~10,000 fingerprint catalog. Zero doc storage.	None	None	None	None
Zero document storage compliance model	Win Only fingerprints stored. GDPR minimization by math.	Full storage	Full storage	Full storage	Full storage
PII auto-detection and redaction pipeline	Win Tokenization pipeline auto-redacts at ingestion.	None	Purview (partial)	None	DLP (partial)
Fraud / document manipulation detection	Win Deterministic — single character change detectable.	None	None	None	None
Blockchain audit trail	Win On-chain anchoring. 2,814+ documents on chain.	None	None	None	None
Data & AI Infrastructure
Structured document intelligence warehouse	Win Every extracted field is a queryable row. Unique.	None	None	None	None
Snowflake Data Share (zero-ETL)	Win Zero-copy. Join doc intelligence with financial data.	None	None	None	None
Live MCP server for AI agents	Win Production MCP. Claude, Cursor, LangChain — no wrapper.	None	None	None	None
Vector embeddings on certified chunks	Win Tied to certified versions. pg_vector native.	None	Azure AI Search (partial)	None	None
CTR Score (Continuous Transaction Readiness)	Win Live composite readiness score. Portfolio-wide.	None	None	None	None
27 active AI engines in production	Win AI.DI Studio — live engine map with real-time status.	None	None	None	None
Deployment & Integration
Unlimited hierarchy depth (any org structure)	Win Enterprise → Group → Entity → Asset → Unit. Any depth.	Folders only	Sites/subsites	Metadata-based	Folders/workspaces
30-day deployment (no implementation project)	Win 30 days from contract to live. M-Files runs 3–6 months.	Weeks–months	Months–years	3–6 months typical	Weeks–months
Installed base / existing trust relationships	Win 45 FileStar enterprise clients. 20+ year relationships. Zero-CAC.	Large (hard to access)	Large (bundled)	Existing clients	Existing clients

One-Line Positioning Per Competitor

vs. Box

"Box stores it. We understand it."

Never compete with Box on storage. Lead with the batch engine demo — upload a folder of contracts, produce a structured Excel workbook in 4 hours. Box produces a list of file names. Do not ask them to cancel Box. Ask what Box actually tells them about their documents.

vs. SharePoint

"Keep SharePoint. Add intelligence."

Never demand they cancel SharePoint. "SharePoint manages collaboration. AI.DI manages the intelligence layer — extraction, assurance, readiness scoring — running on top of whatever you already have. You do not have to change anything to start."

vs. M-Files

"Day-one intelligence, not month-six."

M-Files Aino requires months of training. AbstractIQ delivers 90%+ confidence on day one for any document type — because schemas are pre-built and AI classification runs from the first upload. "You bring the documents. We bring the intelligence."

vs. Egnyte

"You know where your files are. We know what they say."

Egnyte wins on hybrid storage reliability and IT governance. It has no awareness of what documents contain. AbstractIQ connects to an Egnyte repository as an intelligence overlay. "Add intelligence to Egnyte" — the displacement happens organically once the intelligence layer is live.

"We are not trying to be a better Box. We are trying to make Box irrelevant — the same way Salesforce made Act! irrelevant. Not by being louder. By being categorically different."

— imkore category strategy · 2025

Overview · Feature Gallery

Platform Screenshots — Every Screen. Real Production Data.

Every screenshot is from the live production environment at documentgateway.ai. No mockups. No renders. This is what the platform looks like, running today, with real data. 22 screens across 7 product areas. Click any screenshot to enlarge.

documentgateway.ai

Check-In Studio — Intelligent Document Intake

Click to enlarge

GATEWAYAI

Check-In Studio — Intelligent Document Intake

Document Gateway · AI-Powered Intake Engine

Files dropped here enter a fully autonomous AI pipeline. AbstractIQ classifies document type, extracts fields, scores confidence at the field level, checks for duplicates against the ~10,000-fingerprint Sentry catalog, detects anomalies versus portfolio patterns, and routes to the correct steward queue — before any human sees it. Required documents surface as named cards organized by packet template: what's needed, what's fulfilled, what's outstanding, what was rejected with AI-identified reasons. As the corpus grows, the HITL Reduction engine autonomously promotes high-confidence document types to auto-approve. The platform learns continuously from every correction. Standard document types trend toward zero human review at 12 months.

documentgateway.ai

Check-In Engine — Per-Tenant AI Configuration

Click to enlarge

GATEWAYAI

Check-In Engine — Per-Tenant AI Configuration

Document Gateway · ML Thresholds & Live Performance

Every AI parameter is independently configurable per tenant. Auto-certify threshold (85%) means any document classified above that confidence approves without human review. Review threshold (65%) routes mid-confidence results to steward queue. Below that triggers rejection with AI-generated explanation. The AI Model Performance panel shows live production metrics: 78% auto-classify rate, 91% average confidence, 47 human corrections in 30 days — each correction is permanent training data. Model last retrained March 12 against a 6,421-document corpus, with no ML engineer involvement. The platform learns from its own production use.

documentgateway.ai

Click to enlarge

GATEWAYDEVELOPERS

Check-In API — Full Programmatic Ingest

Document Gateway · Developer Interface

The same AI pipeline powering the visual Check-In Studio is fully accessible via REST API. POST to /v1/checkin/bulk submits ZIP bundles of up to 10,000 documents in one call. The response returns a job ID; the full AbstractIQ pipeline runs asynchronously and fires webhook events at every stage: classified, extracted, named, certified, review_required, rejected — each with full payload including document type, confidence score, extracted fields, routing decision, and anomaly flags. Your existing systems get real-time notification at every pipeline stage without polling, enabling full downstream automation. The platform is not just a UI — it is a document intelligence API.

documentgateway.ai

Distribution Studio — Unified Distribution Hub

Click to enlarge

GATEWAY

Distribution Studio — Unified Distribution Hub

Document Gateway · Transaction Rooms · Packages · Share Links

This screen replaced six separate legacy distribution workflows. Transaction Rooms are secure, deal-type-aware environments tracking who accessed which documents, for how long, from which organization — with CTR Score progress, expiry countdown, and phase completion visible at a glance. Every distribution event is immutable: timestamped, version-locked, recipient-specific, fully auditable. The engagement data tells you more about counterparty interest than any conversation — which documents they spent the most time on, which sections they returned to, which they never opened.

documentgateway.ai

Distribution Builder — Three Guided Wizard Modes

Click to enlarge

GATEWAY

Distribution Builder — Three Guided Wizard Modes

Document Gateway · Distribution Wizard

The Builder pre-configures deal type, counterparty structure, access controls, expiry settings, NDA gate options, and QA thread settings automatically based on your distribution type selection. Transaction Room launches a 7-step wizard (6 deal types, section-level access matrix, Abstract.DI engagement signal integration, CTR Score gap alerts). Document Package builds curated bundles with custom cover letters and per-recipient permissions. Share Documents sends direct, tracked links in 2 guided steps. No configuration guesswork — the platform knows the right settings for each distribution type.

documentgateway.ai

Distribution Analytics — Counterparty Behavioral Intelligence

Click to enlarge

GATEWAYVALUE

Distribution Analytics — Counterparty Behavioral Intelligence

Document Gateway · Deal Analytics

Distribution data becomes deal intelligence. Room engagement by deal type shows objective counterparty interest metrics that no email thread can provide. Phase completion rates reveal where deals stall portfolio-wide. The document access heatmap shows which document types drive the most engagement per counterparty per room — signaling where interest is concentrated before a conversation happens. This is not analytics. It is a behavioral signal layer over every transaction.

documentgateway.ai

Abstract.DI — Structured Extraction from Any Document

Click to enlarge

ABSTRACTAI

Abstract.DI — Structured Extraction from Any Document

Abstract.DI · AI Extraction Engine · Any Document Type

Any document becomes a structured database record. The left panel shows the original document as received. The right panel shows every field extracted: parties, dates, financial terms, obligations, conditions, signatures, execution status — organized into typed field groups with individual confidence scores. Every extracted field is a live source link: click any field and the document scrolls to the exact source text it derived from. All extracted data is immediately written to the Document Warehouse as queryable PostgreSQL rows, available to any BI tool, API, or AI agent the moment extraction completes. This turns a folder of any document type into a structured database — automatically.

documentgateway.ai

Click to enlarge

AI.DI Studio — 27 Active Engines in Production

AI Intelligence · Real-Time Intelligence Infrastructure

27 AI engines running simultaneously across 5 capability domains. AI Core handles classification, extraction, confidence scoring, and the HITL Reduction meta-engine. Intelligence manages deep document comprehension, cross-document validation, and expiry detection. Process automates OCR, obligation extraction, approval routing, and workflow management. Trust & Security enforces immutability through blockchain anchoring and continuous tamper detection. Data/Integration maintains warehouse sync, API gateway, and storage optimization. Every node shows live metrics. No competitor has built this.

documentgateway.ai

Blockchain Engine — On-Chain Document Integrity

Click to enlarge

AISENTRY

Blockchain Engine — On-Chain Document Integrity

AI Intelligence · Trust Engine · Ethereum / Hedera / Polygon

2,814 documents anchored on-chain with Merkle tree hashes committed to Ethereum, Hedera, or Polygon — creating cryptographic proof that a document's content has not changed since a specific timestamp. HITL Reduction shows 100% automated: blockchain anchoring is deterministic, requiring zero human judgment. Surfaces in Document Vault, Asset Vault, and Verification Portal. This is not a compliance feature — it is infrastructure that allows any document to be presented to a regulator, counterparty, or auditor with mathematical proof of integrity.

documentgateway.ai

Click to enlarge

ORCHESTRATIONAI

Integration Studio — Live AI Agent Gateway

AI Orchestration · MCP Server + 3 Connected AI Systems

Three AI systems live and connected: Claude.ai, ChatGPT/OpenAI, FileStar. 6 production tools available: search_documents, get_compliance_status, get_obligations, query_warehouse, get_hierarchy, get_document_url. The MCP Server URL is a live endpoint — paste it into Claude, Cursor, or LangChain and the AI immediately gains certified document access. Keys are tenant-scoped, revocable instantly, and enforce the same RLS policies as the UI. Microsoft Copilot, Gemini, and Grok are configured and ready to add. No other document platform ships a production MCP server.

documentgateway.ai

Integration Studio — 28 Enterprise Connectors

Click to enlarge

ORCHESTRATIONAI

Integration Studio — 28 Enterprise Connectors

AI Orchestration · Full Connector Ecosystem

28 connectors across AI Agent Platforms, CRE & PE industry systems (Yardi, MRI, Argus, Salesforce, Juniper, Dealpath), Document Management (SharePoint, Google Drive, Box, OneDrive, Dropbox), Dev & Monitoring (Sentry, Datadog, GitHub), and Data Warehouses. The AI Wizard guides each connection in steps — no custom code, no IT project. AI.DI adds intelligence to your existing infrastructure rather than requiring you to abandon it.

documentgateway.ai

Click to enlarge

ORCHESTRATIONAI

Document IQ — AI-Powered Document Intelligence Assistant

AI Orchestration · Conversational AI · Portfolio-Wide Access

AI with access to a structured, certified corpus instead of raw PDFs. "What's missing from the vault?" is a real-time completeness calculation across every asset simultaneously. "Show critical risk items" aggregates all violation flags and expiry warnings portfolio-wide in seconds. Upload any file and Document IQ cross-references it against vault records — identifying discrepancies, missing documents, and data conflicts without manual comparison reports. This is what AI looks like when it has trusted data underneath it.

documentgateway.ai

Click to enlarge

WAREHOUSE

AI.DI Document Warehouse — Your Corpus as Structured Data

Warehouse · Live Query Interface · 9,857 Documents

Every row is a record in PostgreSQL with typed columns for every field Abstract.DI extracted. The Abstract.DI query bar accepts natural language and returns structured results — because the underlying data is structured, not a keyword search across unstructured text. Six view modes let a compliance officer, data analyst, AI engineer, and CFO each see the view that matches their workflow. This is the first document system where documents are a side effect — the real product is a continuously enriched, AI-maintained structured database of everything your organization has ever received, produced, or executed.

documentgateway.ai

Warehouse Studio — Zero-ETL BI Connectors

Click to enlarge

WAREHOUSEDEVELOPERS

Warehouse Studio — Zero-ETL BI Connectors

Warehouse · Snowflake · Databricks · 9 Connectors

Snowflake receives rows on a 15-minute incremental sync via Data Share — zero-copy, zero ETL. Databricks connects via Delta Lake. BigQuery, Redshift, Tableau, Power BI, dbt Cloud, and a Python SDK are all one-click configuration. The Warehouse is not just queryable — it is the most current, most complete, most structured view of your document estate available to every analytics system you operate simultaneously.

documentgateway.ai

Data Lineage Map — End-to-End Data Provenance

Click to enlarge

WAREHOUSEDEVELOPERS

Data Lineage Map — End-to-End Data Provenance

Warehouse · 15 Nodes · 17 Connections

Every piece of intelligence has a traceable origin. Source documents flow through ingestion, deduplication, fingerprinting, Abstract.DI extraction, and warehouse storage before reaching consumers (MCP Server, Snowflake, Webhooks). Stale nodes are visually flagged proactively. This is the audit trail that answers "where did this AI answer come from?" for every query, extraction, and alert the platform produces. Provenance is not an afterthought — it is the foundation.

documentgateway.ai

Document Type Studio — Complete Document Vocabulary

Click to enlarge

GATEWAYADMIN

Document Type Studio — Complete Document Vocabulary

Document Gateway · AI-Powered Taxonomy

The document type taxonomy is the foundation everything else is built on — classification, extraction schemas, routing rules, compliance requirements, and CTR Score calculations all reference this vocabulary. Pre-built libraries for every major industry. AI Generate suggests new types from your own document corpus patterns. Essential types drive CTR Score calculations; Elective types are tracked but don't penalize. The complete document vocabulary for your organization — maintained automatically.

documentgateway.ai

Hierarchy Studio — Any Org Structure, Any Depth

Click to enlarge

GATEWAYADMIN

Hierarchy Studio — Any Org Structure, Any Depth

Document Gateway · Organization Architecture

Not a folder structure — a typed, dimensional organizational model where every node carries permissions, document requirements, CTR Score calculations, process templates, library assignments, and AI extraction schemas. Any industry org structure configures without code or professional services. Every node becomes a first-class citizen in the Document Warehouse — queryable, scoreable, and connectable to any AI agent via the MCP server.

documentgateway.ai

Process Library — 11 Pre-Built Transaction Workflows

Click to enlarge

GATEWAYADMIN

Process Library — 11 Pre-Built Transaction Workflows

Document Gateway · Workflow Automation

Institutional knowledge codified into launchable templates. Every complex document transaction follows a predictable phase sequence — the Process Library captures that sequence and makes it repeatable, measurable, and improvable. Launching a process creates an active tracking instance with phase-by-phase document completeness, automatic CTR Score updates as documents are fulfilled, and distribution-ready packaging when all phases complete. Ad-hoc scramble becomes a managed, auditable workflow.

documentgateway.ai

Platform Masters — Document Status Workflow Engine

Click to enlarge

GATEWAYADMIN

Platform Masters — Document Status Workflow Engine

Document Gateway · Status Configuration

Document statuses are workflow triggers, not labels. Submitted auto-routes to review queue. Approved fires the Approval Engine. Expired triggers the Violation Engine. Sentry Certified records an immutable fingerprint — can only be assigned by the fingerprinting pipeline, never manually. Terminal flags prevent override. The AI Suggestions panel detects gaps in your current status workflow and proposes additions based on your document patterns. The platform configures itself.

documentgateway.ai

Roles & Permissions — 9 Roles × 138 Features

Click to enlarge

GATEWAYADMINDEVELOPERS

Roles & Permissions — 9 Roles × 138 Features

Document Gateway · Identity & Access Control

138 features. 9 roles. 4 tiers. Row Level Security enforced at the database layer — not the application layer — meaning direct API access and MCP agent connections respect the same access boundaries as the UI. A Steward at a specific hierarchy node sees only their assigned assets. No orphaned permissions. No over-provisioned service accounts. Security is structural, not configured.

documentgateway.ai

Storage Manager — Automated Document Lifecycle

Click to enlarge

GATEWAYADMIN

Storage Manager — Automated Document Lifecycle

Document Gateway · Storage Intelligence

Automated lifecycle policies eliminate the operational overhead of storage management at scale. Documents move from Hot to Warm to Archive automatically based on access patterns, certification status, and file characteristics. Certified documents are immutable — hot storage is wasteful, so they archive automatically. Compliance documents are retained for the regulatory minimum automatically. Storage cost optimization runs continuously without any human intervention.

documentgateway.ai

White Label Branding — Full Enterprise Identity

Click to enlarge

GATEWAYADMIN

White Label Branding — Full Enterprise Identity

Document Gateway · Enterprise Branding

Every customer-facing surface carries your organization's identity. Document Banner, Login Page, Email Templates, Certificates of Authenticity, Shared Viewer links — all branded to your organization. Master Logos propagate automatically to all surfaces on upload. Each surface can be individually overridden. For institutional clients sharing documents with investors, lenders, or regulators, the platform presents entirely as their own product — no "Powered by imkore" in any counterparty-facing experience.

Products · Tab 05

Document Gateway — The Operating System for Every Document

The central hub where documents live, roles are configured, compliance is monitored, and every other AI.DI engine plugs in. 200+ React/TypeScript components. 29 live serverless edge functions. Replaces Box, SharePoint, Egnyte as the primary system of record.

200+

React/TS Components

Live Edge Functions

30 days

Avg. Deployment

5-tier

Org Hierarchy

ML Engines

Role Types

Check-In Studio — AI-Powered Document Ingestion

documentgateway.ai

Click to enlarge

GATEWAYAI

Check-In Studio — Intelligent Document Intake

Document Gateway · The AI Intake Engine

Every file dropped here enters a multi-stage AI pipeline that runs entirely without human instruction. AbstractIQ classifies the document by type, extracts key fields, scores confidence, checks for duplicates, detects anomalies, and routes to the correct steward queue — all before a human sees it. Required documents are surfaced as named cards organized by packet template, so a steward's view is not a list of files but a structured set of obligations: what's needed, what's fulfilled, what's outstanding, and what was rejected with AI-identified reasons. The HITL Reduction AI continuously monitors which document types consistently reach auto-certify confidence and promotes them to bypass human review entirely. As your document corpus grows, the percentage of documents requiring human attention trends toward zero for standard types. This is not document management — it is an autonomous compliance engine that happens to accept file uploads.

Check-In Engine — AI Thresholds & Real-Time Performance

documentgateway.ai

Check-In Engine Settings — Configurable AI Per Tenant

Click to enlarge

GATEWAYAI

Check-In Engine Settings — Configurable AI Per Tenant

Document Gateway · Per-Tenant ML Configuration

Every AI parameter visible here is independently configurable per tenant — the same platform serves a 500-asset institutional portfolio and a 10-asset regional operator with different accuracy thresholds, different OCR pipelines, and different ML feedback settings. The auto-certify threshold (85%) means any document the AI classifies with 85%+ confidence is automatically approved without human review. The review threshold (65%) sends mid-confidence results to the steward queue. Below 65% triggers rejection with AI-generated explanation. The AI Model Performance panel shows live production metrics: auto-classify rate 78%, average confidence 91%, 47 human corrections logged in 30 days, model last retrained March 12 with a 6,421-document corpus. Those 47 corrections become training data — each human decision permanently improves future accuracy for that document type. No ML engineers. No retraining pipeline. The platform learns from its own production use continuously.

Check-In API & Webhook Integration

documentgateway.ai

Check-In API — Full Programmatic Document Ingest

Click to enlarge

GATEWAYDEVELOPERS

Check-In API — Full Programmatic Document Ingest

Document Gateway · Developer Interface

The same AI pipeline that powers the visual Check-In Studio is fully accessible through a REST API — meaning any internal system, any existing workflow, any document management tool can push files directly into the AI.DI pipeline without a user interface. POST to /v1/checkin/ingest for a single file; POST to /v1/checkin/bulk for ZIP bundles of up to 10,000 documents in one call. The response returns a job ID for status polling — the full AbstractIQ pipeline runs asynchronously and fires webhook events at every stage: classified, extracted, named, certified, review_required, rejected. Each event carries the full payload — document type, confidence score, extracted fields, routing decision, DG name, anomaly flags. This means your existing systems get real-time notification the moment a document reaches any status, enabling downstream automation without polling. The platform is not just a UI — it is a document intelligence API that happens to have an excellent UI.

Distribution Studio — Studio View

documentgateway.ai

Distribution Studio — The Unified Distribution Hub

Click to enlarge

GATEWAY

Distribution Studio — The Unified Distribution Hub

Document Gateway · Transaction Rooms · Packages · Share Links

This screen replaced six separate legacy distribution workflows — Transaction Rooms, Document Packages, Standing Distributions, Share Links, Distribution Rules, and Distribution Templates — all now visible and manageable from a single hub. Each Transaction Room is a secure, deal-type-aware environment that tracks exactly who has accessed which documents, for how long, from which organization — with CTR Score progress, expiry countdown, and phase completion all visible at a glance. Document Packages are curated bundles sent to specific recipients with custom cover letters, watermarks, NDA gates, and per-recipient download permissions. Every distribution event is immutable — timestamped, version-locked, recipient-specific, and fully auditable. The engagement data flowing from these rooms tells you more about your counterparty's interest level than any conversation: which documents they spent the most time on, which sections they returned to repeatedly, and which they never opened.

Distribution Studio — Builder & Templates

documentgateway.ai

Distribution Builder — Three Wizard Modes

Click to enlarge

GATEWAY

Distribution Builder — Three Wizard Modes

Document Gateway · Distribution Wizard

The Builder asks one question: what kind of distribution is this? The answer determines everything — deal type, counterparty structure, access controls, expiry configuration, NDA gate options, and QA thread settings — all pre-configured by the wizard based on the selection. Transaction Room launches a 7-step guided workflow covering deal type (CRE Refinancing, M&A/PE, LP Reporting, Asset Sale, JV Formation, General), counterparty org hierarchy, phase-based document structure, section-level access matrix, Abstract.DI engagement signal integration, QA thread with room-level threading, and CTR Score gap alert configuration. No guesswork. No "what settings should I use?" — the platform knows.

documentgateway.ai

Distribution Analytics — Counterparty Intelligence

Click to enlarge

GATEWAYVALUE

Distribution Analytics — Counterparty Intelligence

Document Gateway · Deal Analytics

The Analytics tab transforms distribution data into deal intelligence. Room engagement by deal type shows views per room, CTR percentage achieved, and QA threads open — giving deal teams an objective read on counterparty engagement that no email thread can provide. Phase completion rates from NDA & Setup through Closing show exactly where deals stall portfolio-wide. The document access heatmap reveals which document types drive the most views per counterparty per room — signaling where interest is concentrated before a conversation happens. When a counterparty spends 3 hours on the indemnification clause and then asks for a 15-day extension, you already know why. This is not analytics — it is a behavioral signal layer over every transaction.

Process Library — Pre-Built Transaction Workflows

documentgateway.ai

Click to enlarge

GATEWAYADMIN

Process Library — 11 Pre-Built Transaction Workflows

Document Gateway · Workflow Automation

Every complex document transaction — refinancing, acquisition, compliance audit, tenant coordination, property sale — follows a predictable sequence of document requirements organized in phases. The Process Library codifies that institutional knowledge into launchable templates. The Refinance/Lender Request template shown here has 7 phases (Legal & Ownership → Property & Physical → Leasing & Tenants → Financial Performance → Valuation → Existing Debt → Package & Delivery), 22 required documents across 24 total, and 45–60 day estimated duration. Launching a process from this library creates an active tracking instance with phase-by-phase document completeness, automatic CTR Score update as documents are fulfilled, and distribution-ready packaging when all phases are complete. The process library is the institutional knowledge layer that turns an ad-hoc scramble into a repeatable, measurable, improvable workflow.

Document Type Studio & Hierarchy Studio

documentgateway.ai

Click to enlarge

GATEWAYADMIN

Document Type Studio — Complete Document Vocabulary

Document Gateway · Document Taxonomy

The document type taxonomy is the foundation everything else is built on — classification, extraction schemas, routing rules, compliance requirements, and CTR Score calculations all reference this vocabulary. The Document Type Studio shows the full hierarchy: Commercial Real Estate and Private Equity libraries with subcategories, the Diligence library alone contains 62 document types (43 essential, 19 elective). Essential types drive CTR Score calculations — missing an Essential document drops the score. Elective types are tracked but don't penalize. The AI Generate button uses your existing document corpus to suggest new types your organization actually uses that aren't in the default catalog. The platform ships pre-built taxonomies for every major industry — configurable, extensible, and learnable from your own document patterns.

documentgateway.ai

Click to enlarge

GATEWAYADMIN

Hierarchy Studio — Any Org Structure, Any Depth

Document Gateway · Organization Architecture

The hierarchy is not a folder structure — it is a typed, dimensional organizational model where every node carries permissions, document requirements, CTR Score calculations, process templates, library assignments, and AI extraction schemas. The Traditional Private Equity template shown here (Enterprise GP → Fund → LP/Investor → Portfolio Company → Subsidiary/Division) maps exactly to how a PE fund operates — but the studio can configure any structure for any industry without code or professional services. CRE operators, corporate legal departments, financial institutions, and government agencies all configure different hierarchies from the same studio. Every node created here becomes a first-class citizen in the Document Warehouse — queryable, scoreable, and connectable to any AI agent via the MCP server.

Platform Configuration

documentgateway.ai

Click to enlarge

GATEWAYADMIN

Platform Masters — Document Status Workflow Engine

Document Gateway · Status Configuration

Document statuses are not labels — they are workflow triggers. Each status in this table drives a specific system behavior: Submitted auto-routes to review queue, Approved fires the Approval Engine, Expired triggers the Violation Engine, Sentry Certified records an immutable fingerprint in vault_records. The drag-to-reorder interface sets the logical default sequence, but the real power is in the terminal and certified flags — terminal statuses cannot be manually overridden, and certified statuses can only be assigned by the Sentry fingerprinting pipeline, never by a user. The AI Suggestions panel on the right uses your industry and document patterns to propose status additions — "Add AI Flagged for low-confidence classifications" appears because the system detected classifications below the review threshold that currently fall through to Needs Revision without a distinct routing path. This is the platform configuring itself.

documentgateway.ai

Click to enlarge

GATEWAYADMINDEVELOPERS

Roles & Permissions — 9 Roles × 138 Features

Document Gateway · Identity & Access

138 features. 9 roles. 4 tiers. This is enterprise access control with the granularity that regulated industries require. The Role Matrix shows exactly which features each role can access — filtered by Actions, Data, or Pages — with color-coded permission states (full access, limited, read-only, none). The 4-tier structure (System, Tenant, Hierarchy, Node) means a Steward at a specific hierarchy node can only see documents and actions relevant to their assigned assets — not portfolio-wide. Row Level Security enforcement happens at the database layer via Supabase RLS, not at the application layer — which means even direct API access or MCP agent connections respect the same access boundaries. No orphaned permissions. No over-provisioned service accounts. Security is structural, not configured.

Storage Management — Intelligent Lifecycle Automation

documentgateway.ai

Storage Manager — Automated Document Lifecycle Policies

Click to enlarge

GATEWAYADMIN

Storage Manager — Automated Document Lifecycle Policies

Document Gateway · Storage Intelligence

Documents cost money to store, process, and query — and most organizations keep everything in hot storage indefinitely because moving things manually never happens. Storage Manager automates the entire lifecycle through policy rules that run on configurable schedules. Auto-Warm After Inactivity moves documents not accessed in 30 days from Hot to Warm storage automatically. Archive Certified Docs moves Sentry-certified documents to Archive on an hourly schedule — certified documents are immutable by definition, so hot storage is wasteful. Hot Retention for Active keeps any document accessed in the last 7 days in Hot tier regardless of other rules. Each tier has a defined retention schedule: Hot is indefinite for active docs, Warm moves certified docs to Archive after 180 days, Archive retains compliance documents for 7 years minimum. The platform manages storage cost at scale without operational overhead.

White Label Branding

documentgateway.ai

Click to enlarge

GATEWAYADMIN

White Label Branding — Full Enterprise Identity Control

Document Gateway · Enterprise Branding

Every customer-facing surface of the platform — Document Banner, Login Page, Email Templates, Certificates of Authenticity, Shared Viewer links — carries your organization's identity, not imkore's. Upload Master Logos once (light mode and dark mode variants) and they propagate automatically to all surfaces. Each surface can also be individually overridden with a custom logo if different contexts require different branding. The Logo Across Surfaces panel on the right shows a real-time preview of exactly how your logo appears on each surface before you publish. For institutional clients sharing documents with investors, lenders, or regulatory bodies, the platform presents entirely as their own product. This is the infrastructure that allows a GP to present a Transaction Room to an LP counterparty with full institutional branding — no "Powered by imkore" anywhere in the counterparty experience.

Products · Tab 06

Abstract.DI — The Engine That Reads Your Documents

Abstract.DI does not summarize documents. It comprehends them — classifying every document type, extracting every meaningful field, scoring confidence at the field level, detecting anomalies against portfolio patterns, and writing all of it as structured, queryable data into the Document Warehouse. Any document. Any industry. 94%+ confidence out of the box. No training required.

ANY

Document Type

94%+

Day-One Confidence

100K

Batch Chunk Size

10–50x

GPU OCR Speedup

Day 1

Accuracy (not Month 6)

Abstract.DI in Action — AI-Powered Document Comprehension

documentgateway.ai

Abstract.DI — Structured Field Extraction from Any Document

Click to enlarge

ABSTRACTAI

Abstract.DI — Structured Field Extraction from Any Document

Abstract.DI · AI Extraction Engine · Any Document Type

What you are seeing here is a document — any document — being converted into structured, queryable intelligence in seconds. The left panel shows the original document exactly as it arrived. The right panel shows every field Abstract.DI extracted: parties, dates, financial terms, obligations, conditions, signatures, execution status — organized into typed field groups with individual confidence scores. Every highlighted passage in the document is a live link: click any extracted field and the document scrolls to the exact source text it was derived from. This is not an AI summary — it is a structured database record created from an unstructured document, with full provenance tracing from field value back to source text. The 94%+ confidence score is field-level, not document-level — you know exactly which fields the AI is certain about and which need review. All extracted data is immediately written to the Document Warehouse as queryable PostgreSQL rows, available to any BI tool, API consumer, or AI agent the moment extraction completes. This is the engine that turns a folder of PDFs into a structured database.

Multi-Pass Extraction Pipeline

Abstract.DI — Document to Intelligence Pipeline

Step 1

Document Ingestion

→

PDF, DOCX, XLSX, PPTX, MSG/EML, CSV, ZIP, JPEG/PNG/TIFF, DB records

Step 2

Selective OCR

→

DocTR engine · GPU 10–50x speedup · Multilingual · 8s timeout with fallback

Step 3

Classification

→

Any doc type · Claude Haiku inference · "AI.DI Named" badge · Confidence scoring

Step 4

Field Extraction

→

Type-specific schemas · Dates · Parties · Amounts · Obligations · Conditions

Step 5

Anomaly Detection

→

Cross-document consistency · Version comparison · Portfolio baseline deviations

↓

AI.DI Document Warehouse — Structured PostgreSQL Rows

All extracted fields stored as structured, queryable data. Available to BI tools, APIs, and AI agents instantly.

"Box shows you a file. We show you what the file says. Run both side by side. The demo closes itself."

— Abstract.DI positioning principle

Products · Tab 06

Sentry Document Assurance — "Shazam for Documents"

Deterministic mathematical fingerprinting — patent pending. Zero document storage. Zero PII exposure. GDPR, HIPAA, SEC, and APA compliant by architecture, not configuration. ~10,000 pre-built fingerprints. Three unique fingerprint types including Trusted Data Fingerprints — unique in the market.

~10K

Fingerprint Catalog

Documents Stored

10-100x

Search Speed (v2)

30-50%

LLM Cost Reduction

Patent

Pending Architecture

Three Fingerprint Types

Type 01

Document Content Fingerprints

Full textual and structural content of any document. Two identical documents always produce identical fingerprints. Any change — single word, date, number, comma — produces a measurably different fingerprint. Deterministic mathematics. Zero false positives. Used for certification, version tracking, and fraud detection.

Type 02

Document Data Fingerprints

Structured data extracted from document fields fingerprinted independently of full document content. Field-level matching — find every document containing the same lease term, coverage limit, or financial figure — without requiring full-text identity. Supports cross-document data validation.

Type 03

Trusted Data Fingerprints

Unique in the market. No equivalent exists anywhere. Fingerprint individual rows from Excel files and database tables — a tenant record, vendor entry, or asset row — then find every document in the enterprise that references that specific record. Given any entity: instantly surface every connected document across the entire corpus.

30–50% AI Cost Reduction — Immediate

Industry average: 40%+ of files are duplicates. A corpus with 40% duplicates means 40% of every LLM bill computes the same content twice. Sentry identifies all duplicates, consolidates to canonical records, preserves all metadata from every duplicate instance, then suppresses duplicates from AI queues. LLM compute costs drop 30–50% immediately — without changing a single prompt or model.

"The fingerprint never lies. The document might. Sentry tells you the difference."

— Sentry Document Assurance design principle

Products · Tab 08

AI.DI Document Warehouse — The Data Moat That Doesn't Exist Anywhere Else

Every document Abstract.DI processes becomes structured rows in PostgreSQL. Every extracted field is queryable data. Every AI signal is persisted as a structured record. No competitor has built this. No competitor can build it without starting over.

Document Warehouse — Live Query Interface

documentgateway.ai

Click to enlarge

WAREHOUSE

AI.DI Document Warehouse — Your Entire Document Corpus as Structured Data

Warehouse · 9,857 Documents · Live Query Interface

What looks like a document list is actually a live query interface into a structured database. Every row here is not a file reference — it is a record in PostgreSQL with typed columns for every field Abstract.DI extracted: classification type, confidence score, execution status, expiry date, party names, financial terms, compliance flags, and dozens more depending on document type. The Abstract.DI query bar at the top accepts natural language — "show all contracts expiring in Q1 with coverage below minimum" returns structured results because the underlying data is structured, not a keyword search across unstructured text. The six view modes (List, Gallery, Library, Cube, Time Series, Schema, Scientist) let you slice the same corpus as a compliance officer, a data analyst, an AI engineer, or a CFO — each seeing exactly the view that matches their workflow. This is the first document management system where the documents are a side effect of the real product: a continuously enriched, AI-maintained structured database of everything your organization has ever received, produced, or executed.

Warehouse Studio — BI Connectors

documentgateway.ai

Warehouse Studio — Zero-ETL Connections to Every Data Stack

Click to enlarge

WAREHOUSEDEVELOPERS

Warehouse Studio — Zero-ETL Connections to Every Data Stack

Warehouse · Snowflake · Databricks · 9 Connectors

The Document Warehouse is not a destination — it is a source of truth that feeds every analytics system your organization already uses. Snowflake receives 891 rows on a 15-minute incremental sync via Data Share — zero-copy, zero ETL, no pipeline to build or maintain. Databricks connects via Delta Lake for full and incremental refresh, enabling document intelligence to join with financial models, risk systems, and ML pipelines in the same compute environment. Webhooks fire on every document event — ingest, certify, extract, expire — enabling real-time triggers to any downstream HTTP endpoint. BigQuery, Redshift, Tableau, Power BI, dbt Cloud, and a Python SDK are available with one-click configuration. The data model is fully documented — every extracted field, every confidence score, every audit event — so data engineers can join document intelligence with any other enterprise dataset without discovering the schema by trial and error. The Warehouse is not just queryable. It is the most current, most complete, most structured view of your document estate that has ever existed.

Data Intelligence — Full Data Lineage Observability

documentgateway.ai

Data Lineage Map — Complete Provenance from Source to Consumer

Click to enlarge

WAREHOUSEDEVELOPERS

Data Lineage Map — Complete Provenance from Source to Consumer

Warehouse · Data Intelligence · 15 Nodes · 17 Connections

Every piece of intelligence in the platform has a traceable origin. The Data Lineage Map visualizes the complete path a document takes from source system through ingestion, processing, warehouse storage, and consumer delivery — with live status on every node. PDF Documents (693 ingested) flow through the Ingest Pipeline (1,000 processed), Deduplication (exact + fuzzy matching), and Fingerprinting (3 fingerprint types) before Abstract.DI extracts 13 abstraction fields. Those fields become Extraction Fields (queryable warehouse layer), Document Metadata (cross-asset search index), and Query Engine (PostgreSQL + custom SQL). Consumers — MCP Server, Snowflake, Webhooks — pull from the warehouse in real time. Stale nodes are visually flagged (Bridge Sync shows 0 fields synced, triggering immediate attention). This is not just observability — it is the audit trail that answers "where did this AI answer come from?" for every query, every extraction, and every alert the platform produces.

"The Finance director ran one query. Five years of contract values from 3,000 documents into Excel in 30 seconds. He looked up and said: 'We've been paying people to do this manually for twenty years.' That was the moment the platform sale closed itself."

— Warehouse Scientist Mode proof moment

Products · Tab 08

Millennia FileStar — The Installed Base That Cannot Be Replicated

45 enterprise clients. 7,500 active users. ~$1M ARR. 20+ years of institutional trust. The FileStar installed base is the most defensible competitive moat in the AI.DI portfolio — a client list that cannot be cold-called by any competitor, at any funding level.

Enterprise Clients

7,500

Active Users

~$1M

Current ARR

20+

Years of Trust

8-10

Phase 1 Upgrade Targets

Document Fabric & Lifecycle Governance

Structured lifecycle governance for any document type. Every document has a defined lifecycle: creation → review → approval → distribution → monitoring → archival. Configurable routing rules, approval chains, and escalation paths enforce this lifecycle.

Version control with full history. Compliance monitoring with expiry tracking. Audit trail on every action. Role-based access aligned with hierarchy.

The AI.DI Integration Pathway

FileStar governs documents. AI.DI makes them intelligent. FileStar-managed documents automatically flow through Sentry certification and Abstract.DI extraction without any workflow change for existing users. All FileStar metadata syncs to the AI.DI Warehouse.

Every FileStar client is one conversation away from the full AI.DI platform. No rip-and-replace. No migration project. No change management crisis.

The Permanent Competitive Advantage

Every FileStar client that upgrades to AI.DI is a client that could not have been won by a cold-start competitor — regardless of product quality, VC funding, or pricing. A startup raising $20M today cannot replicate a 20-year enterprise trust relationship with a client's CFO. This is structural moat, not temporary advantage.

Products · Tab 10

AI Orchestration & Agent Gateway — The Infrastructure That Makes LLMs Actually Work

AI.DI is not a competitor to LLMs. It is their prerequisite. Every enterprise deploying Copilot, GPT-4, Claude, or Gemini faces the same problem: the AI is only as good as the documents it reasons from. If documents are uncertified and unstructured — your AI hallucinates. AI.DI is the trusted document foundation that makes any LLM enterprise-grade.

Live MCP Server — AI Agent Gateway

documentgateway.ai

Click to enlarge

ORCHESTRATIONAI

Integration Studio — Live AI Agent Gateway

AI Orchestration · MCP Server + Connected AI Systems

This is the screen that enterprise AI teams have been waiting for. A live production MCP server exposes 6 certified tools to any MCP-compatible AI system — Claude, Cursor, LangChain, AutoGen, or any agent framework. The moment Claude.ai connects to this URL, it can search your certified document corpus, check compliance status on any asset, retrieve all obligations from any document set, run structured queries against the full Warehouse, navigate your org hierarchy, and retrieve signed access URLs for specific document versions. Every query enforces row-level security at the database layer — the AI agent cannot access documents the connected user is not authorized to see. Keys are revocable instantly. Usage is logged. This is not middleware or a wrapper — it is a purpose-built enterprise document intelligence API that treats your LLM as a trusted, auditable consumer of certified data rather than a summarizer of raw PDFs.

28 Connectors — Every System You Already Use

documentgateway.ai

Click to enlarge

ORCHESTRATIONAI

Integration Studio — 28 Enterprise Connectors

AI Orchestration · Full Connector Ecosystem

The most common objection to any new platform is "we already use X." AI.DI answers that objection by connecting to every X simultaneously. Yardi Voyager delivers leases and rent rolls directly into the ingestion pipeline. MRI Software pushes financial reports and lease abstracts. Argus Enterprise connects valuation models. Salesforce ingests CRM documents and contracts. The document management connectors (SharePoint, Google Drive, Box, OneDrive, Dropbox) mean AI.DI adds intelligence to your existing storage rather than replacing it. Dev and monitoring connectors (Sentry, Datadog, GitHub) ingest operational documents as first-class records. Data warehouse connectors (Snowflake, Databricks, BigQuery) make extracted document intelligence immediately available in every analytics environment you operate. The AI Wizard guides each connection in steps — no custom integration code, no IT project, no professional services. The platform meets your existing infrastructure where it lives.

Document IQ — AI-Powered Portfolio Intelligence

documentgateway.ai

Document IQ — Conversational AI Over a Certified Corpus

Click to enlarge

ORCHESTRATIONAI

Document IQ — Conversational AI Over a Certified Corpus

AI Orchestration · Portfolio-Wide AI Query Interface

Document IQ demonstrates what changes when AI has access to structured, certified data instead of raw files. A question like "What's missing from the vault?" is not a keyword search — it is a completeness calculation across every asset in the portfolio, comparing current documents against required schemas, returning a prioritized gap list with asset, document type, and days overdue. "Show critical risk items" aggregates violation flags, expiry warnings, and compliance alerts across the entire portfolio into a single prioritized view that would take a compliance analyst days to compile manually. The file upload capability takes any document — a counterparty rent roll, a vendor certificate, a financial statement — and cross-references it against vault records in real time, identifying discrepancies, missing correlations, and data conflicts without a human pulling comparison reports. This is the AI experience that becomes the reason nobody opens a legacy platform again.

Data Lineage — Full Provenance for Every AI Answer

documentgateway.ai

Click to enlarge

ORCHESTRATIONWAREHOUSE

Data Intelligence — Data Lineage Map

AI Orchestration · End-to-End Data Provenance

When an AI agent answers a question using AI.DI data, every element of that answer has a traceable origin. The Data Lineage Map shows the complete pipeline from source document to consumer — enabling any data engineer, compliance officer, or auditor to trace exactly how a specific piece of intelligence was produced, what transformations it passed through, and which source document it ultimately came from. This is the infrastructure that eliminates LLM hallucination risk: every answer the AI returns is backed by a certified document, a specific extraction, a confidence score, and a provenance chain. The stale node indicators (Bridge Sync showing 0 fields synced) surface data freshness issues proactively — you know before an AI answer is delivered whether the underlying data is current. Provenance is not an afterthought in AI.DI. It is the foundation.

Value & Strategy · Tab 10

Continuous Transaction Readiness™ — The Score That Runs Your Document Strategy

CTR is not a feature. It is a category-defining concept. It means your organization is always prepared to respond to a capital call, close an acquisition, satisfy a regulator, or distribute to an investor — because AI.DI monitors, scores, routes, and maintains your entire document estate in real time.

The Primary Value Statement

AI.DI gives your organization Continuous Transaction Readiness — the state where every document is accessible, authentic, aligned, and actionable at all times. Enterprises that achieve this state lower their cost of capital, reduce audit risk, deploy AI with confidence, and close transactions faster.

How CTR Is Calculated

Sample Asset CTR Score

84/100

Near Ready — Minor Gaps

23/26

Docs Present

Expiring Soon

Violation

4.2d

Avg Response

Five Weighted Dimensions

Document Completeness88/100

23 of 26 required document types present and valid

Document Validity & Freshness76/100

2 insurance certificates expire within 45 days

Compliance & Regulatory Status71/100

1 active violation: Certificate of Occupancy version mismatch

Distribution Readiness92/100

Document package deliverable to counterparty within 2 hours

Access & Permissioning Health97/100

All role assignments current. No orphaned access detected.

Score Interpretation

Score	Status	Typical Situation	Time to Transact
90–100	Transaction Ready	All documents present, current, valid. No violations.	48 hours
75–89	Near Ready	1–3 documents missing or expiring. No active violations.	1–5 business days
55–74	Attention Required	Multiple gaps or 1–2 violations.	2–4 weeks
35–54	Not Ready	Significant document gaps. Will not survive buyer diligence.	30–60 days
0–34	Critical	Severely incomplete or non-compliant documentation.	90+ days

Value & Strategy · Tab 11

For Data Scientists — The Document Intelligence Stack You've Been Waiting For

You've been asked to build AI on enterprise documents. You know what that means: unstructured PDFs, no provenance, wrong versions, 40% duplicates, PII everywhere, no reliable way to trace an LLM answer to a specific document. AI.DI is the infrastructure layer that solves every one of those problems — through every interface you already use.

What You're Actually Getting

AI.DI is not a document management UI with an API bolted on. It is a document intelligence data platform: a PostgreSQL warehouse of structured document intelligence, a live MCP server, a webhook event stream, a REST/GraphQL API, Snowflake Data Share, JDBC/ODBC direct access, vector embeddings on certified document chunks, and a 30-engine ML pipeline that improves continuously. Every document becomes structured, provenance-tracked, certified data — available to any model, pipeline, or analytics tool you're running.

The Data Model — What You're Querying

Table	Contents	Key Fields	Primary Use
`document_records`	Every document processed	id, original_name, document_type, workflow_status, asset_id, classification_confidence, storage_path	Document inventory, classification analysis
`extracted_fields`	Structured extraction from Abstract.DI	document_id, field_name, field_value, confidence_score, extraction_model, extraction_timestamp	Contract analytics, financial extraction
`sentry_fingerprints`	Cryptographic fingerprint records	document_id, fingerprint_hash, fingerprint_type, certified_at, version_chain, similarity_scores	Certification, duplicate detection, fraud monitoring
`hierarchy_nodes`	Full org hierarchy	id, parent_id, node_type, node_name, industry, ctr_score, completeness_pct	Portfolio analytics, CTR aggregation
`document_activity_log`	Every action on every document	document_id, event_type, actor_id, actor_role, timestamp, metadata	Audit trail, access pattern analysis
`vector_embeddings`	Embeddings on certified chunks	document_id, chunk_id, certified_version_hash, embedding_vector, model_version	Semantic search, RAG retrieval, clustering
`ctr_score_history`	CTR Score time series	node_id, score, dimension_scores, calculated_at, delta_from_prior	Readiness trending, portfolio benchmarking

Python SDK — Example Patterns

from aidi import DocumentWarehouse

client = DocumentWarehouse(api_key="YOUR_KEY", tenant_id="YOUR_TENANT")

# Query all Q1 2027 lease expirations across a portfolio — certified docs only

expirations = client.extractions.query(

  document_type="commercial_lease", field="expiration_date",

  date_range=("2027-01-01", "2027-03-31"), certified_only=True

)

# Get version-locked embeddings for RAG pipeline

embeddings = client.vectors.get_certified_chunks(

  document_ids=expirations.document_ids(), version_locked=True

)

# Subscribe to certification events for real-time model retraining

@client.events.on("document.certified", document_type="financial_statement")

async def on_new_financial_statement(event):

  extracted = await client.abstractions.get_fields(event.document_id)

  await my_model.retrain_incremental(extracted.to_feature_vector())

Value & Strategy · Tab 12

Any Industry. Any Complexity. Built for Scale.

AI.DI was not built for one vertical and adapted for others. The same engine that certifies a Blackstone real estate portfolio is equally compelling for a PE firm's data room, a hospital's compliance records, or a law firm's contract vault. The document problem is universal. So is the solution.

Unlimited Org Depth

Enterprise → Group → Entity → Asset → Unit → Counterparty. Any depth, any width, any industry. A 500-asset CRE fund, a 15-portfolio-company PE firm, a 200-branch bank — all map to the same hierarchy model with zero configuration overhead.

Edge Compute at Scale

Deno edge functions scale to zero when idle and to any volume on demand — same code handles 10 documents and 10 million. No ops team. No provisioning. No performance cliffs at scale.

Any File Type. Zero Exceptions.

PDF, DOCX, XLSX, PPTX, MSG/EML, CSV, ZIP, JPEG/PNG/TIFF scans, database records. No conversion required. No pre-processing. Whether a scanned fax or a native Word contract — AI.DI ingests, classifies, and extracts from all of it.

Commercial Real Estate

Asset managers, sponsors, operators across multifamily, office, industrial, retail, and mixed-use

Primary Market

Document Types

Title policies & ALTA surveys
Lease abstracts & leases
Insurance certificates
Environmental studies (Phase I/II)
Appraisals & BPOs
Loan documents & notes
Certificates of occupancy
Property management agreements

Key Use Cases

Acquisition due diligence
Loan closing packages
Lender covenant compliance
Insurance renewal management
Portfolio disposition readiness
LP reporting distributions

CTR Impact

Diligence prep: 6 weeks → 48 hours
Eliminate insurance gap incidents
Pre-qualify assets 12 months early
LP reports in 1 click, not 2 weeks
Close refinancings in half the time

Private Equity

GPs, fund managers, and portfolio operations teams managing company-level and fund-level documentation

Primary Market

Document Types

Fund formation documents
LP subscription agreements
Cap tables & equity agreements
Material contracts
Audited financials
Board minutes & resolutions
Exit transaction documents

Key Use Cases

Portfolio company exit readiness
LP capital call packages
Annual audit preparation
Co-investor reporting
Secondary transfer docs

CTR Impact

Exit prep starts 18 months early
LP Q&A response under 24 hours
Audit cycle cut 70%
Deal team on diligence, not hunting docs

Department-Level Entry Points

Department	Acute Pain	AI.DI Entry Product	Expansion Path
Legal / GC	Contract version disputes, discovery liability, GDPR compliance	Sentry certification + Document Gateway distribution	Full Document Warehouse for corporate legal corpus
Finance / Accounting	Audit prep fire drills, financial document reconciliation	Abstract.DI batch (financial extraction) + Blueprint audit	Sentry certification + Warehouse integration to ERP
Compliance / Risk	Regulatory filing tracking, compliance gaps, audit exposure	Sentry + Warehouse (compliance corpus) + CTR Score	Full platform across regulated document types
Transactions / Deal Team	Due diligence prep time, data room chaos	Document Gateway + Distribution Studio + Transaction Rooms	Abstract.DI batch for portfolio-wide extraction
IT / Data Engineering	Unstructured data not in Snowflake; LLM hallucinations	Document Warehouse + Snowflake + MCP Server	Full platform as enterprise document intelligence backbone
Operations / HR	Employee records, policy tracking, onboarding compliance	FileStar lifecycle governance + Abstract.DI HR extraction	Sentry certification + Document Gateway policy distribution

Get Started · Tab 13

Start with One Department. Get the Whole Platform.

AI.DI is not a pilot program with limited features. From your very first document, you have access to the complete platform — every engine, every view, every integration. We believe in earning your full commitment by delivering full capability from day one. Start small if you want. The platform is built for all — enormous portfolios and single-department deployments run on the exact same infrastructure.

The imkore Philosophy — Do Some or Do It All

The world's largest institutional real estate portfolios run on the same platform as a 12-asset regional operator starting their first compliance program. A single compliance officer in one department gets the same AI intelligence, the same CTR Score, the same Warehouse, the same MCP server as a 500-person investment management firm running 20 funds. We built for scale from day one — which means the smallest client gets the most powerful platform available at any price point. No feature tiers. No locked capabilities. No "upgrade to get the real thing."

Three Ways to Start — All Paths Lead to the Same Platform

Entry Path 01

Start with One Document Type

Pick your most painful document type — insurance certificates, leases, vendor contracts, financial statements. Run Abstract.DI on everything you have. Get a CTR Score on that category in 72 hours. See exactly what's missing, expiring, or wrong. The rest of the platform is right there when you're ready.

"We started with just our COIs. In three days we knew which assets were exposed. We hadn't done that audit in two years." — Property Operations Director

Entry Path 02

Start with One Department

Give legal, compliance, finance, or your deal team a standalone deployment. They get the full platform — just scoped to their hierarchy node and document types. No IT project. No enterprise rollout required. One steward, one asset group, full capability. When they prove ROI, the next department asks to join.

"Legal started it. Then finance wanted in. Then the deal team. We never ran a rollout — it spread itself." — Chief Operating Officer, PE Firm

Entry Path 03

Start with One Asset or Fund

Run a complete AI.DI deployment on a single asset or fund as a proof of concept with real production data. CTR Score goes live in 72 hours. Abstract.DI processes your existing archive in the first week. Distribution Studio sends your first LP package before the end of month one.

"We ran one asset. The CTR Score told us things we didn't know. That asset closed three months faster. Then we did the whole portfolio." — Managing Director, CRE Manager

imkore Blueprint — The Highest-Confidence Entry Point

Advisory Service · $50–150K · 60–90 Days

imkore Blueprint — Document Intelligence Audit & Readiness Roadmap

Blueprint evaluates your entire document ecosystem — every repository, every system, every process — and delivers a scored readiness assessment and a prioritized AI.DI product roadmap. Blueprint invariably reveals exactly which products the client needs and why. The roadmap we deliver IS the AI.DI implementation plan for your organization.

Discover

Map all document repositories across all systems

→

Assess

Evaluate governance, structure, and integrity

→

03-04

Classify + Validate

Standardize taxonomies, confirm authenticity, remove duplicates

→

05-06

Structure + Certify

Apply metadata conventions, establish certified records

→

07-08

Enable + Optimize

Prepare for AI and automation, maintain Continuous Transaction Readiness

Plans

Tier 01

Foundation

For teams managing one department, fund, or asset group getting organized and transaction-ready for the first time.

Custom

Based on asset count and user seats.

+Up to 25 assets — full platform

+Check-In Studio with Abstract.DI

+CTR Score Dashboard

+Distribution Studio

+Up to 10 user seats

Contact Sales