NVIDIA NeMo Agent Toolkit (NAT) is a framework-agnostic library for building AI agents. It uses a configuration-driven approach where you define agents, tools, and workflows in YAML files. NAT works alongside existing frameworks like LangChain and LlamaIndex, adding capabilities like memory and observability without modifying your agent code.
See NVIDIA’s official documentation: Auto Memory Wrapper
The Zep integration for NAT uses the automatic memory wrapper — a general-purpose wrapper that adds memory capabilities to any NAT agent. Rather than requiring agents to explicitly call memory tools, the wrapper intercepts agent invocations and handles memory operations transparently.
This approach guarantees that all conversations are captured and relevant context is retrieved, regardless of which agent type you use or how the agent is implemented.
Traditional tool-based memory requires agents to explicitly invoke memory tools, which can be unreliable. The auto memory wrapper provides:
Package information:
nvidia-nat-zep-cloud>=3.11, <3.13Create a configuration file that defines the Zep memory backend and wraps your agent with automatic memory:
This configuration wraps a ReAct agent with automatic memory. Every user message and agent response is captured in Zep, and relevant context is retrieved before each agent call.
The auto memory wrapper intercepts agent invocations and handles memory operations in this sequence:
The wrapped agent is unaware of memory operations — it simply receives enriched context and produces responses.
All flags default to true:
Configure memory retrieval and storage behavior:
Zep automatically isolates memory by user. User IDs are extracted in this priority:
user_manager.get_id() — production with custom auth middleware (recommended)X-User-ID HTTP header — testing without middleware"default_user" — fallback for local developmentFor production deployments, implement a custom user_manager that extracts user IDs from your authentication system.
The auto memory wrapper works with any NeMo agent type: