New NVIDIA Nemotron 3 Super Delivers 5x Higher Throughput for Agentic AI: Nemotron 3 Super agentic AI model
NVIDIA’s latest open 120B-parameter model targets long-horizon, multi-agent workflows with a hybrid Mamba–transformer design, sparse MoE routing, multi-token prediction, and a 1M-token context window — aiming to cut inference costs while boosting accuracy and throughput.





