MIT: The AI Agent Index
The MIT AI Agent Index is a public database that catalogs agentic AI systems—tools capable of planning and executing tasks with minimal human oversight—by detailing their technical components, applications, and risk management practices. It reveals that most systems are developed in the USA, mainly by companies in software engineering, and while many projects offer open code and documentation, information on safety policies and external evaluations remains limited.
MIT: The AI Agent Index
Summary of Read Full Report
The AI Agent Index is a newly created public database documenting agentic AI systems. These systems, which plan and execute complex tasks with limited human oversight, are increasingly being deployed in various domains.
The index details each system’s technical components, applications, and risk management practices based on public data and developer input. An analysis of the data shows ample information on agentic systems' capabilities and applications. However, the authors found limited transparency regarding safety and risk mitigation.
The authors aim to provide a structured framework for documenting agentic AI systems and improve public awareness. It sheds light on the geographical spread, academic versus industry development, openness, and risk management of agentic systems.
The five most important takeaways from the AI Agent Index, with added details, are:
- The AI Agent Index is a public database designed to document key information about deployed agentic AI systems. It covers the system’s components, application domains, and risk management practices. The index aims to fill a gap by providing a structured framework for documenting the technical, safety, and policy-relevant features of agentic AI systems. The AI Agent Index is available at https://aiagentindex.mit.edu/.
- Agentic AI systems are being deployed at an increasing rate. Systems that meet the inclusion criteria have had initial deployments dating back to early 2023, with approximately half of the indexed systems deployed in the second half of 2024.
- Most indexed systems are developed by companies located in the USA, specializing in software engineering and/or computer use. Out of the 67 agents, 45 were created by developers in the USA. 74.6% of the agents specialize in either software engineering or computer use. While most agentic systems are developed by companies, a significant fraction are developed in academia. Specifically, 18 (26.9%) are academic, while 49 (73.1%) are from companies.
- Developers are relatively forthcoming about details related to usage and capabilities. The majority of indexed systems have released code and/or documentation. Specifically, 49.3% release code, and 70.1% release documentation. Systems developed as academic projects are released with a high degree of openness, with 88.8% releasing code.
- There is limited publicly available information about safety testing and risk management practices. Only 19.4% of indexed agentic systems disclose a formal safety policy, and fewer than 10% report external safety evaluations. Most of the systems that have undergone formal, publicly-reported safety testing are from a small number of large companies.
Related Articles
Gemini 3.1 Pro and the Case for Model-Agnostic Agentic Infrastructure
Google's Gemini 3.1 Pro doubled its reasoning benchmarks overnight. Here's why that makes model-agnostic agentic infrastructure more critical than ever.
Google Gemini 3.1 Pro, ChatGPT Ads, and Why Organizations Need to Own Their AI Infrastructure
Google launches Gemini 3.1 Pro with advanced reasoning while OpenAI rolls out ads in ChatGPT. These two moves reveal a growing tension in enterprise AI: who controls the intelligence layer, and whose interests does it serve?
ChatGPT Now Has Ads — And It Should Change How You Think About AI Infrastructure
OpenAI has started showing ads inside ChatGPT responses. This marks a turning point: organizations relying on consumer AI tools are now subject to someone else's monetization strategy. Here's why owning your AI infrastructure matters more than ever.
Gemini 3.1 Pro Just Dropped — Here's What It Means for Organizations Running Their Own AI
Google's Gemini 3.1 Pro launched today with 1M-token context, native multimodal reasoning, and agentic tool use. Here's why model releases like this one matter most to organizations that own their AI infrastructure — and why locking into a single provider is the costliest mistake you can make.
See the ibl.ai AI Operating System in Action
Discover how leading universities and organizations are transforming education with the ibl.ai AI Operating System. Explore real-world implementations from Harvard, MIT, Stanford, and users from 400+ institutions worldwide.
View Case StudiesGet Started with ibl.ai
Choose the plan that fits your needs and start transforming your educational experience today.