Evaluate Clinical ASR Models Faster with Agent Skills and NVIDIA Nemotron Speech
https://developer.nvidia.com/blog/evaluate-clinical-asr-models-faster-with-agent-skills-and-nvidia-nemotron-speech/
https://developer.nvidia.com/blog/evaluate-clinical-asr-models-faster-with-agent-skills-and-nvidia-nemotron-speech/
NVIDIA Technical Blog
Evaluate Clinical ASR Models Faster with Agent Skills and NVIDIA Nemotron Speech
Training a speech AI model to correctly recognize or synthesize clinical terminology is surprisingly difficult. Drug names like Acetaminophen, Amlodipine, Cefazolin, and Biktarvy are not part of…
👍1
Accelerating Federated Learning Research with AI Agents and NVIDIA FLARE Auto-FL
https://developer.nvidia.com/blog/accelerating-federated-learning-research-with-ai-agents-and-nvidia-flare-auto-fl/
https://developer.nvidia.com/blog/accelerating-federated-learning-research-with-ai-agents-and-nvidia-flare-auto-fl/
NVIDIA Technical Blog
Accelerating Federated Learning Research with AI Agents and NVIDIA FLARE Auto-FL
Federated learning (FL) research often begins with a deceptively simple question: What should we try next? A new aggregation rule, a FedProx coefficient, a server optimizer setting, a SCAFFOLD variant…
Model Quantization: Turn FP8 Checkpoints into High-Performance Inference Engines with NVIDIA TensorRT
https://developer.nvidia.com/blog/model-quantization-turn-fp8-checkpoints-into-high-performance-inference-engines-with-nvidia-tensorrt/
https://developer.nvidia.com/blog/model-quantization-turn-fp8-checkpoints-into-high-performance-inference-engines-with-nvidia-tensorrt/
NVIDIA Technical Blog
Model Quantization: Turn FP8 Checkpoints into High-Performance Inference Engines with NVIDIA TensorRT
Converting a quantized checkpoint into an NVIDIA TensorRT engine bridges the gap between model optimization and production deployment, enabling faster inference, higher throughput…
Delivering Lifecycle Control for AI Infrastructure at Scale with NVIDIA DGX Spark Enterprise Manageability
https://developer.nvidia.com/blog/delivering-lifecycle-control-for-ai-infrastructure-at-scale-with-nvidia-dgx-spark-enterprise-manageability/
https://developer.nvidia.com/blog/delivering-lifecycle-control-for-ai-infrastructure-at-scale-with-nvidia-dgx-spark-enterprise-manageability/
NVIDIA Technical Blog
Delivering Lifecycle Control for AI Infrastructure at Scale with NVIDIA DGX Spark Enterprise Manageability
As AI infrastructure scales, enterprise expectations for operational maturity are increasing. Organizations expect these systems to be provisionable, observable, secure, and manageable at scale—the…
NVIDIA Confidential Computing to Help Expand Apple’s Private Cloud Compute
https://blogs.nvidia.com/blog/nvidia-confidential-computing-apple-private-cloud-compute/
https://blogs.nvidia.com/blog/nvidia-confidential-computing-apple-private-cloud-compute/
NVIDIA Blog
NVIDIA Confidential Computing to Help Expand Apple’s Private Cloud Compute
NVIDIA GPUs to support server-side inference for Apple Intelligence, announced at WWDC.
👍1
Designing Production-Ready Battery Energy Storage Systems for AI Factories
https://developer.nvidia.com/blog/designing-production-ready-battery-energy-storage-systems-for-ai-factories/
https://developer.nvidia.com/blog/designing-production-ready-battery-energy-storage-systems-for-ai-factories/
NVIDIA Technical Blog
Designing Production-Ready Battery Energy Storage Systems for AI Factories
AI factories are changing what data-center infrastructure must do. Unlike traditional data centers, AI factories are built to manufacture intelligence at scale. They run power-dense training and…
Run DiffusionGemma on NVIDIA for Developer-Ready, High-Throughput Text Generation
https://developer.nvidia.com/blog/run-diffusiongemma-on-nvidia-for-developer-ready-high-throughput-text-generation/
https://developer.nvidia.com/blog/run-diffusiongemma-on-nvidia-for-developer-ready-high-throughput-text-generation/
NVIDIA Technical Blog
Run DiffusionGemma on NVIDIA for Developer-Ready, High-Throughput Text Generation
Developers building real-time AI—such as chat assistants, copilots, and agentic workflows—are often constrained by token-by-token generation speed. This limits responsiveness, increases serving costs…
👍2
NVIDIA Accelerates Google DeepMind’s DiffusionGemma for Local AI
https://blogs.nvidia.com/blog/rtx-ai-garage-local-gemma-diffusion/
https://blogs.nvidia.com/blog/rtx-ai-garage-local-gemma-diffusion/
NVIDIA Blog
NVIDIA Accelerates Google DeepMind’s DiffusionGemma for Local AI
The new DiffusionGemma open model generates text in parallel — not one token at a time — and is optimized to run on the NVIDIA RTX PRO platform, NVIDIA DGX Spark systems and GeForce RTX GPUs.
👍2
For Robotaxis, Safety Must Be Built In, Not Bolted On
https://blogs.nvidia.com/blog/halos-os-robotaxi-safety/
https://blogs.nvidia.com/blog/halos-os-robotaxi-safety/
NVIDIA Blog
For Robotaxis, Safety Must Be Built In, Not Bolted On
NVIDIA Halos OS delivers safety-certified platform software, standardized interfaces, AI guardrails, and pre-deployment validation for L4 robotaxi deployments at scale.
👍1
Save Big and Play Bigger: GeForce NOW Summer Sale Brings Major Membership Savings
https://blogs.nvidia.com/blog/geforce-now-thursday-summer-sale-2026/
https://blogs.nvidia.com/blog/geforce-now-thursday-summer-sale-2026/
NVIDIA Blog
Save Big and Play Bigger: GeForce NOW Summer Sale Brings Major Membership Savings
Level up with the best value in gaming, ‘Guild Wars’ and rewards, plus eight new games this week.
One-Click Multi-Tenant Security with NVIDIA Quantum InfiniBand
https://developer.nvidia.com/blog/one-click-multi-tenant-security-with-nvidia-quantum-infiniband/
https://developer.nvidia.com/blog/one-click-multi-tenant-security-with-nvidia-quantum-infiniband/
NVIDIA Technical Blog
One-Click Multi-Tenant Security with NVIDIA Quantum InfiniBand
NVIDIA Quantum InfiniBand now offers intent-based security profiles in Unified Fabric Manager (UFM) that enable multi-tenant fabric security in a single click. NVIDIA Quantum InfiniBand supports three…
👍3
Deploy Long-Context Reasoning and Agentic Workflows with MiniMax M3 on NVIDIA Accelerated Infrastructure
https://developer.nvidia.com/blog/deploy-long-context-reasoning-and-agentic-workflows-with-minimax-m3-on-nvidia-accelerated-infrastructure/
https://developer.nvidia.com/blog/deploy-long-context-reasoning-and-agentic-workflows-with-minimax-m3-on-nvidia-accelerated-infrastructure/
NVIDIA Technical Blog
Deploy Long-Context Reasoning and Agentic Workflows with MiniMax M3 on NVIDIA Accelerated Infrastructure
As enterprise AI adoption scales, developers are increasingly forced to stitch together fragmented pipelines—separate models for text, vision, and code—leading to added complexity, higher costs…
👍1
NVIDIA Blackwell Leads on First Agentic AI Infrastructure Benchmark
https://blogs.nvidia.com/blog/nvidia-blackwell-agentperf-artificial-analysis/
https://blogs.nvidia.com/blog/nvidia-blackwell-agentperf-artificial-analysis/
NVIDIA Blog
NVIDIA Blackwell Leads on First Agentic AI Infrastructure Benchmark
New AgentPerf results from Artificial Analysis show how accelerated computing systems handle real-world agentic workloads, with NVIDIA GB300 NVL72 running up to 20x more agents per megawatt than NVIDIA Hopper.
NVIDIA Achieves Leading Agentic Coding Performance on First Agentic AI Benchmark
https://developer.nvidia.com/blog/nvidia-achieves-leading-agentic-coding-performance-on-first-agentic-ai-benchmark/
https://developer.nvidia.com/blog/nvidia-achieves-leading-agentic-coding-performance-on-first-agentic-ai-benchmark/
NVIDIA Technical Blog
NVIDIA Achieves Leading Agentic Coding Performance on First Agentic AI Benchmark
AI agents have fundamentally changed the complexity of inference workloads. Until now, the industry has struggled to define a standard for measuring how inference systems perform under these…
👍1
Pretrained to Imagine, Fine-Tuned to Act: The Rise of World-Action Models
https://developer.nvidia.com/blog/pretrained-to-imagine-fine-tuned-to-act-the-rise-of-world-action-models/
https://developer.nvidia.com/blog/pretrained-to-imagine-fine-tuned-to-act-the-rise-of-world-action-models/
NVIDIA Technical Blog
Pretrained to Imagine, Fine-Tuned to Act: The Rise of World-Action Models | NVIDIA Technical Blog
Quick glossary for readers new to VLA/WAM terminology VLA Vision-Language-Action model: a robot policy that starts from a pretrained VLM backbone and adapts it to generate actions from visual…
👍1
Boosting MoE Training Throughput with Advanced Fusion Kernels
https://developer.nvidia.com/blog/boosting-moe-training-throughput-with-advanced-fusion-kernels/
https://developer.nvidia.com/blog/boosting-moe-training-throughput-with-advanced-fusion-kernels/
NVIDIA Technical Blog
Boosting MoE Training Throughput with Advanced Fusion Kernels
Mixture-of-experts (MoE) models have quickly become a foundational component of modern, large-scale AI systems. They are widely adopted because they enable substantially larger model capacity while…
Fine-Tuning Biological Foundation Models with LoRA Using NVIDIA BioNeMo Recipes
https://developer.nvidia.com/blog/fine-tuning-biological-foundation-models-with-lora-using-nvidia-bionemo-recipes/
https://developer.nvidia.com/blog/fine-tuning-biological-foundation-models-with-lora-using-nvidia-bionemo-recipes/
NVIDIA Technical Blog
Fine-Tuning Biological Foundation Models with LoRA Using NVIDIA BioNeMo Recipes
Foundation models are reshaping computational biology. Pretrained on massive corpora of protein or genomic sequences, models such as ESM2 (a protein language model) and Evo 2 (a DNA language model)…
NVIDIA Blackwell Tops MLPerf Training 6.0 with Industry-Leading Scale and Performance
https://developer.nvidia.com/blog/nvidia-blackwell-tops-mlperf-training-6-0-with-industry-leading-scale-and-performance/
https://developer.nvidia.com/blog/nvidia-blackwell-tops-mlperf-training-6-0-with-industry-leading-scale-and-performance/
NVIDIA Technical Blog
NVIDIA Blackwell Tops MLPerf Training 6.0 with Industry-Leading Scale and Performance
NVIDIA delivered a clean sweep in MLPerf Training v6.0, the latest edition of industry-standard AI training benchmarks developed by the MLCommons consortium. NVIDIA achieved the fastest time to train…
👍1
How to Optimize Transformer-Based Models for Low-Precision Training
https://developer.nvidia.com/blog/how-to-optimize-transformer-based-models-for-low-precision-training/
https://developer.nvidia.com/blog/how-to-optimize-transformer-based-models-for-low-precision-training/
NVIDIA Technical Blog
How to Optimize Transformer-Based Models for Low-Precision Training
Transformer architectures are the backbone of many modern large language and generative AI models. As these models grow in size, training runs consume more GPU hours and more engineering iteration…
Build On-Device AI Companions with the NVIDIA ACE Game Agent SDK and Unreal Engine 5 Plugins
https://developer.nvidia.com/blog/build-on-device-ai-companions-with-the-nvidia-ace-game-agent-sdk-and-unreal-engine-5-plugins/
https://developer.nvidia.com/blog/build-on-device-ai-companions-with-the-nvidia-ace-game-agent-sdk-and-unreal-engine-5-plugins/
NVIDIA Technical Blog
Build On-Device AI Companions with the NVIDIA ACE Game Agent SDK and Unreal Engine 5 Plugins
NVIDIA RTX technologies are deeply integrated into Unreal Engine 5 through the NVIDIA RTX Branch of Unreal Engine and the NVIDIA DLSS Unreal Engine plugin. This provides developers with direct access…
Build Your Own Transaction Foundation Model for Financial Intelligence
https://developer.nvidia.com/blog/build-your-own-transaction-foundation-model-for-financial-intelligence/
https://developer.nvidia.com/blog/build-your-own-transaction-foundation-model-for-financial-intelligence/
NVIDIA Technical Blog
Build Your Own Transaction Foundation Model for Financial Intelligence
Every swipe, transfer, and payment on a modern financial network encodes a pattern of human behavior. Transaction data is one of the richest signals an enterprise owns. Yet most production use cases…