The documentation set for this product strives to use bias-free language. For the purposes of this documentation set, bias-free is defined as language that does not imply discrimination based on age, disability, gender, racial identity, ethnic identity, sexual orientation, socioeconomic status, and intersectionality. Exceptions may be present in the documentation due to language that is hardcoded in the user interfaces of the product software, language used based on RFP documentation, or language that is used by a referenced third-party product. Learn more about how Cisco is using Inclusive Language.
What if your AI infrastructure unlocked the true potential of your enterprise data? With Cisco, NVIDIA, and VAST Data, you can accelerate retrieval-augmented generation (RAG) pipelines, streamline data movement, and scale agentic AI across your organization. Together, we provide a validated, enterprise-class AI data platform—so you can move beyond experimentation and deliver real business outcomes with AI.
Enterprises are shifting from experimenting with AI to deploying real, production-scale agentic AI systems. But success requires solving one of the hardest problems: making the right data available at the right time. Legacy storage architectures, siloed compute, and slow retrieval mechanisms limit the effectiveness of RAG and generative AI models.
Cisco, NVIDIA, and VAST Data have partnered to deliver a validated AI data platform designed specifically for enterprise-scale AI. Built on Cisco UCS® servers and Cisco AI PODs, integrated with VAST InsightEngine, and powered by NVIDIA AI Enterprise and designed by NVIDIA AI Data Platform, this solution forms the foundation of the Cisco Secure AI Factory with NVIDIA. It is the first enterprise architecture that unifies compute, fabric, and storage into a single, validated platform to accelerate RAG, retrieval, and agentic AI workflows at scale.
VAST InsightEngine
NVIDIA AI Enterprise – AI stack for the AI data platform
● Accelerate RAG pipelines: VAST InsightEngine and NVIDIA AI Data Platform streamline data retrieval to feed large language models with enterprise knowledge.
● Unlock agentic AI: AI agents are able to operate autonomously with fast, accurate, and contextual data pipelines.
● Get a fully validated architecture: The platform is built on Cisco AI PODs, the foundation of Cisco Secure AI Factory with NVIDIA, ensuring reliability and scalability.
● Unify your infrastructure: Cisco UCS, Cisco Nexus® Hyperfabric networking, and VAST InsightEngine integrate into a seamless, enterprise-class AI fabric.
● Get enterprise-ready security and scale: The platform is built with Cisco’s Secure AI Factory principles to ensure governance, resilience, and compliance.
AI is moving from models to agents
The shift from LLM experimentation to agentic AI systems is underway. Gartner predicts that by 2026, over 30 percent of enterprises will deploy AI agents to drive automation and decision-making. But to succeed, enterprises must solve the data problem—how to make vast, unstructured, and distributed data immediately useful to AI systems.
Key challenges include:
● Data retrieval bottlenecks: Traditional storage cannot meet AI inference latency requirements.
● Fragmented architectures: Disconnected storage, compute, and fabric introduce inefficiency.
● Scalability risks: AI pipelines break down as workloads and data sizes expand.
Cisco, NVIDIA, and VAST Data together eliminate these barriers by integrating compute, storage, and AI frameworks into a validated AI data platform that accelerates enterprise adoption of agentic AI.
“Cisco, NVIDIA, and VAST Data deliver one of the first NVIDIA AI Data Platform validated designs that accelerates RAG and unlocks agentic AI for the enterprise.”
“With VAST InsightEngine and NVIDIA AI Data Platform running on Cisco AI PODs, enterprises can finally bring AI agents to life at scale.”
The AI data platform is a part of Cisco Secure AI Factory with NVIDIA, providing a framework for governance, lifecycle management, and automation.
Cisco AI PODs with AI servers and networking as the validated compute and networking building blocks for NVIDIA AI Data Platform:
● RTX PRO Server from Cisco
(Cisco UCS C845A M8 Rack Servers, NVIDIA BlueField equipped)
● VAST InsightEngine
Provides a unified data engine optimized for AI retrieval and pipelines.
VAST Accelerated Core SW License:
◦ Includes VAST OS and VAST InsightEngine services
◦ Is fully supported and managed by VAST
◦ Includes VAST container orchestration (Rancher)
● Networking
Cisco Nexus 9000 Series Switches with NVIDIA Spectrum-X Ethernet Technology
● Storage and data platform
● VAST Data Platform on Cisco UCS (Cisco Ebox, Cisco UCS C225A M8 All-NVMe Node, NVIDIA BlueField equipped)
● VAST core SW license:
◦ Includes VAST InsightEngine and all VAST DataBase and VAST DataEngine functionalities
◦ Fully supported and managed by VAST
● VAST capacity software license:
◦ Includes VAST InsightEngine and covers VAST DataStore capabilities (including NFS, S3, SMB, and NVMe/TCP)
● NVIDIA AI Enterprise for AI model training, inference, and data pipeline orchestration
Key capabilities
● Enterprise RAG acceleration: Seamlessly retrieve and enrich enterprise data to fuel generative AI with accurate knowledge.
● agentic AI enablement: Provide AI agents with high-performance data retrieval and orchestration for autonomous workflows.
● Validated designs: NVIDIA AI Data Platform is the validated design that connects Cisco AI PODs and VAST InsightEngine.
● Cloud-managed Fabric: Get Cisco Nexus Hyperfabric to ensure integrated, secure, and automated networking.
Models and options
● AI data platform reference design: The first NVIDIA AI Data Platform reference built on VAST InsightEngine and Cisco AI PODs.
● Turnkey AI PODs: Cisco Secure AI Factory PODs with full-stack integration of compute, networking, and storage.
Cisco AI POD architecture supporting NVIDIA AI Data Platform
Table 1. Use cases
Industry |
Use case |
Financial services |
Accelerate fraud detection with agentic AI models retrieving real-time transaction data. |
Healthcare |
Empower clinical assistants with AI agents accessing genomic and imaging datasets. |
Retail and e-commerce |
Use RAG pipelines to deliver hyper-personalized recommendations at scale. |
Manufacturing |
Drive predictive maintenance and AI-guided automation using agentic AI with sensor data. |
Enterprise IT |
Deploy enterprise-wide AI assistants that retrieve knowledge from documents, policies, and systems. |
Use Cisco services to operationalize your AI data platform
Cisco services can help enterprises plan, deploy, and manage VAST InsightEngine on Cisco AI PODs with NVIDIA AI Data Platform. From readiness assessments to validated design deployment, Cisco experts accelerate adoption while reducing complexity. Lifecycle services ensure that your AI infrastructure evolves with your business needs.
Cisco, NVIDIA, and VAST offer a validated solution designed to enable faster data extraction and retrieval for agentic AI workflows.
VAST InsightEngine is one of the first storage solutions to offer an NVIDIA AI Data Platform reference design built on Cisco AI PODs, the AI Infrastructure building block of Cisco Secure AI Factory.
Cisco Secure AI Factory with NVIDIA provides validated architecture to speed enterprise AI adoption, no matter the use case.
Flexible payment solutions to help you achieve your objectives
Cisco Capital makes it easier to get the right technology to achieve your objectives, enable business transformation and help you stay competitive. We can help you reduce the total cost of ownership, conserve capital, and accelerate growth. In more than 100 countries, our flexible payment solutions can help you acquire hardware, software, services and complementary third-party equipment in easy, predictable payments. Learn more.
Cisco is uniquely positioned to operationalize agentic AI at scale. With Cisco Secure AI Factory with NVIDIA, enterprises gain validated AI infrastructure integrated with VAST InsightEngine and NVIDIA AI Data Platform. Only Cisco offers the end-to-end stack—from compute and fabric to storage and lifecycle services—validated with industry leaders to accelerate AI adoption securely and at scale.
Unlock the power of agentic AI with Cisco, NVIDIA, and VAST Data.
Learn more about Cisco Secure AI Factory with NVIDIA at:
https://www.cisco.com/site/us/en/solutions/artificial-intelligence/index.html.