Make.com banner

ServiceNow Launches EnterpriseOps-Gym for AI Models

This article was generated with the help of AI and may contain errors.

ServiceNow Research has introduced EnterpriseOps-Gym, a new benchmark for evaluating autonomous AI agents in professional environments. This tool addresses the lack of standards for long-term planning and complex workflows.

EnterpriseOps-Gym: New Benchmark for AI Agents

EnterpriseOps-Gym is designed to assess agentic planning in realistic enterprise settings. It features a containerized Docker platform that simulates eight critical business domains, including customer service and HR.

The benchmark includes 164 relational databases and 512 functional tools, demonstrating that current AI models have a success rate below 40%. This tool can help identify and close performance gaps in AI technology for professional use.

Source: MarkTechPost

Read also: Mistral Forge Allows Companies to Create Their Own AI Models