Nemotron-3 Nano Omni
NVIDIA's 30B-A3B open multimodal model designed as a perception and context sub-agent for enterprise agent systems. Accepts text, image, video, and audio inputs with built-in chain-of-thought reasoning.
🧮
How many r's are in the word 'strawberry'?
🤖
What makes a good perception sub-agent in an enterprise AI system?