
NVIDIA Nemotron 3 Ultra Arrives on AWS SageMaker
AWS has made NVIDIA's Nemotron 3 Ultra model available on Amazon SageMaker JumpStart with one-click deployment. The 550-billion-parameter model uses a hybrid Transformer-Mamba architecture that activates only 55 billion parameters per forward pass, delivering 5x faster inference and up to 30% lower costs for agentic AI workloads. The model supports up to 1 million token context length and is optimized for NVFP4 precision format.
