AI Data Unification

Unify your HRIS, CRM, ticketing, and warehouse into one knowledge graph every enterprise agent reasons over — on-premise, over MCP, with no data extraction. You own all the code and data.

No Data ExtractionQueries your systems in place over MCP
Read-Only & Role-ScopedEntra ID identity, 7-test safety suite
Open Source, You Own ItMIT-licensed, self-hosted in your network
1.6M+Platform Users
400+Organizations
Syracuse UniversitySUNYGoogleNVIDIAKaplan

MCP Architecture Guide for Enterprise

How to connect your corporate systems (HRIS, LMS, CRM, ticketing, knowledge base) to AI agents — step by step.

Read the Guide

AI Data Unification for Corporate L&D — One Knowledge Layer Every Agent Reasons Over

For enterprise teams, the records that AI agents need are scattered across an HRIS, a CRM, a ticketing system, a knowledge base, and a data warehouse. AI Data Unification maps them into one living knowledge graph — an ontology agents reason over — exposed over the Model Context Protocol (MCP) without extracting a single record to the cloud.

You own all the code and the data. The unified layer runs inside your network, behind your firewall and identity provider, and every agent — ibl.ai, Claude, Cursor, or your own apps — queries it scoped to the caller's role.

What This Is

AI Data Unification connects Snowflake, Salesforce, Workday, ServiceNow, Jira, Confluence, and Okta into a single ontology — a structured representation of your organization that agents can reason over.

It has two layers: a semantic layer (the nouns — entities, attributes, relationships) and an operational layer (the verbs — actions, functions, permissions). Together they become the single source of truth every agent reads from.

Unify once, reuse everywhere

Model an entity once and the tenth agent costs a fraction of the first, because it inherits the same definitions, the same data, and the same governance.

The implementation is open source at github.com/iblai/ontology — MIT-licensed, Python 3.11+, 163 passing tests. ibl.ai is family-owned and operated from New York, NY: a long-term partner, not a vendor that ships a license and disappears.

The Two Layers

The Semantic Layer — the nouns

Entity types model the real-world objects in your organization — employees, teams, accounts, deals, tickets, and projects. Attributes capture their characteristics, and relationships define how they connect. An employee belongs to a team; a deal belongs to an account; an approval is an action with rules.

This is the shared vocabulary your domain experts already use, made machine-readable so agents reason over the same reality your people do.

The Operational Layer — the verbs

Actions define the permissible changes an agent can make. Functions encode logic. Permissions govern who — human or agent — can do what.

Agents inherit the same permissions as the users they serve, so they can only act within your existing rules. Every action is captured as data, so audit trails are built in, not bolted on.

Five Core Principles

Shared understanding at scaleTeams operate from unified definitions instead of siloed, per-system data. Local decisions gain global context because every agent and every department reads from the same graph.
Human-readable by designThe ontology is expressed in familiar business terminology, not a technical schema. Domain experts can explore and validate it directly, bridging technical and non-technical teams.
Build once, reuse everywhereA properly modeled entity serves every application that needs it. The cost of launching the tenth agent is a fraction of launching the first, because the knowledge is already there.
Capture decisions as dataActions are recorded and become queryable knowledge. Audit trails and decision history are a property of the system, not an afterthought you instrument later.
Ground AI in organizational realityAgents operate on the same knowledge humans use. Grounding answers in your real entities and relationships removes the translation layer where hallucinations creep in.

How the Unified Layer Works

Inbound connections

Databases connect through the Google MCP Toolbox; REST-based systems connect through custom MCP servers, defined in tools.yaml. For enterprise teams, that means Snowflake, Salesforce, Workday, ServiceNow, Jira, Confluence, and Okta.

Source credentials never leave your network. Each inbound connector has isolated credential scope, with secrets encrypted at rest.

Knowledge materialization

Data is synced into three layers: text memories in Markdown, a Postgres cache, and vector embeddings for semantic search.

The unified graph stays fresh on a schedule you control.

Outbound — one MCP server

The unified layer exposes itself as a single MCP server over HTTPS, behind your firewall and identity provider.

Any authorized runtime — ibl.ai, Claude, Cursor, or your own apps — connects to it, and results are scoped to the caller's identity.

Security & Governance

Read-only enforcement

Version 1 is read-only by design. Before provisioning, a seven-test safety suite verifies that every write attempt is denied; if any write succeeds, provisioning halts and emits remediation SQL.

Agents can read your systems — they cannot mutate them.

Identity & accessEvery MCP request carries the user's Microsoft Entra ID JWT. The gateway validates it and resolves the caller's role from roles.yaml, so an agent sees exactly what that user is allowed to see — row-level scoping included, mapped to your SOC 2 obligations.
Permissions inherited from your peopleSchema and data permissions are governed centrally. Agents inherit the same permissions as the users they serve, so access control is consistent whether a human or an agent is asking.

Built-In Service Catalog

Pre-configured connectors for Corporate L&DThe catalog ships defaults for Snowflake, Salesforce, HubSpot, Workday, ServiceNow, Jira, Confluence, GitHub, Okta, Slack, Zendesk, and Zoom. Seed any of them with one command (ontology service add --from <key>), then test and provision from the CLI.
Provision from the CLIA single CLI covers the whole lifecycle — service discovery and testing, schema analysis, role validation, scheduled sync, and Docker Compose deployment. Stand the layer up, test it, and ship it from your terminal.

Full Ownership

Your code & dataThe entire unified knowledge layer is open source. Records stay in the systems you already run — the layer queries them in place; it never copies them to ibl.ai or any third party. "You will have your data, and we may not even have access to it."
Your infrastructureThe MCP server, the Postgres cache, and the vector index all run inside your perimeter. Deploy it in the cloud you choose, in your VPC, on-premise, or fully air-gapped.
Built on Agentic OSAI Data Unification is the knowledge foundation the rest of the ibl.ai platform reasons over. Pair it with Agentic OS to deploy agents that act on the unified graph — all owned and self-hosted by you.

Get Started

Review the code:Read the implementation, the safety suite, and the service catalog at github.com/iblai/ontology — open source and MIT-licensed.
Talk to us:Tell us which of your corporate l&d systems you want to unify, and we will map the ontology and provisioning plan with you.

Your systems, your graph, your agents — one unified knowledge layer, owned end to end and never extracted to the cloud.

What our partners say about us

Chris Gabriel

Chris Gabriel | Google

Lorena Barba

Lorena Barba | George Washington University

Dr. Juana Mendenhall

Dr. Juana Mendenhall | Morehouse College

Juile Diop

Juile Diop | MIT

Adam Tetelman

Adam Tetelman | Nvidia

Jason Dom

Jason Dom | American Public University System

Benjamin Breyer

Benjamin Breyer | Columbia University

Ken Fujiuchi

Ken Fujiuchi | SUNY

Erika Digirolamo

Erika Digirolamo | Monroe College

David Flaten

David Flaten | SUNY

David Vise

David Vise | Modern States Education Alliance

Linda Wood

Linda Wood | ARM Institute (U.S. Department of Defense)

Chris Gabriel

Chris Gabriel | Google

Lorena Barba

Lorena Barba | George Washington University

Dr. Juana Mendenhall

Dr. Juana Mendenhall | Morehouse College

Juile Diop

Juile Diop | MIT

Adam Tetelman

Adam Tetelman | Nvidia

Jason Dom

Jason Dom | American Public University System

Benjamin Breyer

Benjamin Breyer | Columbia University

Ken Fujiuchi

Ken Fujiuchi | SUNY

Erika Digirolamo

Erika Digirolamo | Monroe College

David Flaten

David Flaten | SUNY

David Vise

David Vise | Modern States Education Alliance

Linda Wood

Linda Wood | ARM Institute (U.S. Department of Defense)

Frequently Asked Questions