Bits With Brains
Curated AI News for Decision-Makers
What Every Senior Decision-Maker Needs to Understand About AI and its Impact
Latest Nvidia AI Announcements: Computex 2024
6/8/24
Editorial team at Bits with Brains
Nvidia's announcements at Computex 2024 might just underscore the company's continued dominance in AI technology.
Nvidia's announcements at Computex 2024 underscore the company's continued dominance in AI technology. The announcements span consumer computing, accelerated computing, networking, enterprise computing, and industrial digitalization. Nvidia's pioneering work in AI began with the GeForce RTX in 2018, which introduced deep learning super sampling (DLSS) to enhance gaming performance. Today, Nvidia's AI applications extend to content creation, videoconferencing, live streaming, and productivity, with over 500 AI-powered apps and games accelerated by RTX GPUs.
Here's a summary of their announcements.
GeForce RTX AI Laptops and Hardware
At Computex 2024, Nvidia introduced new GeForce RTX AI laptops from manufacturers like ASUS and MSI. These laptops feature advanced AI hardware and Copilot features, supporting AI-enabled apps and games. Leveraging Nvidia's latest AI libraries and SDKs, these laptops are optimized for AI inferencing, training, gaming, 3D rendering, and video processing. They offer significant performance improvements, such as up to seven times faster text-to-image and up to ten times faster large language model inference compared to Macs. This marks a significant leap in performance and efficiency for AI applications on consumer hardware.
RTX AI Toolkit for Developers
The RTX AI Toolkit is a comprehensive suite of tools and SDKs designed to help developers customize, optimize, and deploy AI models on Windows applications. It includes tools for fine-tuning pretrained models, optimizing them for various hardware, and deploying them for local and cloud inference. The toolkit aims to bridge the gap between powerful AI models and the unique requirements of Windows app development, making it easier for developers to integrate AI capabilities into their applications. This toolkit is already being adopted by creative software providers like Adobe and Blackmagic Design to enhance AI performance in their apps.
Spectrum-X Ethernet Network for AI Factories
Nvidia's Spectrum-X is an Ethernet network explicitly designed for AI factories. Traditional Ethernet networks are insufficient for AI due to minimal server-to-server communication and high jitter tolerance. Spectrum-X optimizes GPU-to-GPU connectivity by leveraging network interface cards (NICs) and switches, significantly enhancing performance. It provides higher bandwidth, increased all-to-all bandwidth, better load balancing, and 1,000 times faster telemetry for real-time AI application optimization. This makes it a critical infrastructure component for AI-driven enterprises and cloud service providers.
Nvidia NIM Microservices for AI Inference
Nvidia NIM is a set of microservices designed to run generative AI models specifically for inference tasks. These microservices integrate with different tools and frameworks, making them accessible regardless of the development environment. NIM houses Nvidia's runtime libraries to maximize model execution speed, offering up to three times the throughput compared to off-the-shelf models. This improvement allows more efficient use of AI infrastructure, generating more output in less time and simplifying deployment by reducing setup and optimization time from weeks to minutes.
Omniverse Platform for Industrial Digitalization
Nvidia's Omniverse platform has reached a tipping point in industrial digitalization. It connects with key industrial technologies like Siemens and Rockwell, offering capabilities such as simulation, synthetic data generation for AI training, and high-fidelity visualization. The platform employs a three-computer solution in AI and robotics: an AI supercomputer for creating models, a runtime computer for real-time sensor processing in robots, and the Omniverse computer for digital twin simulation. This setup allows extensive virtual testing and optimization of AI models and robotic systems, driving efficiency and innovation in industrial applications.
Project G-Assist for Gaming
Project G-Assist is an AI assistant technology designed to enhance gamer experiences by providing context-aware responses to in-game queries, tracking system performance during gameplay, and optimizing system settings. This project aims to transform gaming by offering real-time assistance and optimization, making it easier for gamers to navigate complex game environments and improve their performance. The technology leverages Nvidia's AI capabilities to deliver personalized and efficient gaming support.
Collaboration with Microsoft on AI Capabilities
Nvidia and Microsoft are collaborating to enable developers to create generative AI-capable Windows native and web applications. This collaboration will allow API access to GPU-accelerated small language models (SLMs) and retrieval-augmented generation (RAG) capabilities that run on-device as part of Windows Copilot Runtime. These capabilities will support content summarization, content generation, task automation, and more, making it easier for developers to integrate advanced AI features into their applications.
Blackwell Platform for Accelerated Computing
The Blackwell platform delivers 1,000 times more performance than the Pascal platform released eight years ago. It features six transformative technologies for accelerated computing, including new Tensor Cores and TensorRT-LLM Compiler, which reduce LLM inference operating costs and energy consumption by up to 25 times. Blackwell's AI capabilities extend beyond chip-level advancements, with Nvidia continuing to innovate across all layers of the data center to improve AI factories. The platform is widely adopted by major cloud providers and AI companies, driving breakthroughs in data processing, engineering simulation, and generative AI.
GB200 NVL2 Platform for Generative AI in Data Centers
The GB200 NVL2 platform is designed to bring generative AI capabilities to every data center. It offers 40 petaflops of AI performance, 144 Arm Neoverse CPU cores, and 1.3 terabytes of fast memory in a single node. This platform significantly improves performance for large language models (LLMs) and data processing tasks compared to traditional CPU systems. It speeds up data processing by up to 18 times and provides up to nine times better performance for vector database search queries, making it a powerful tool for data-intensive AI applications.
Nvidia's Roadmap and Future Plans
Nvidia's roadmap includes continued innovation in AI technology, with plans to launch new products and platforms annually. The company aims to maintain its leadership in AI by developing advanced hardware, software, and networking solutions that cater to the evolving needs of AI-driven industries.
Nvidia's future plans involve expanding its AI capabilities across various sectors, including consumer computing, enterprise computing, and industrial digitalization, in an attempt to ensure that its technology remains at the forefront of the AI revolution.
Sources:
[1] https://siliconangle.com/2024/06/02/ai-ai-ai-deep-dive-nvidias-announcements-computex-2024/
[2] https://nvidianews.nvidia.com/news/nvidia-blackwell-platform-arrives-to-power-a-new-era-of-computing
[4] https://blogs.nvidia.com/blog/ai-innovations-ces-2024/
[5] https://www.nvidia.com/gtc/
[6] https://www.nvidia.com/en-us/geforce/news/gfecnt/20242/2024-geforce-rtx-laptops-available-now/
[7] https://blockchain.news/news/nvidia-rtx-powered-ai-hardware-software-computex-2024
[8] https://blogs.nvidia.com/blog/rtx-ai-pc-studio-computex/
[9] https://nvidianews.nvidia.com/news/nvidia-brings-ai-assistants-to-life-with-geforce-rtx-ai-pcs
[10] https://www.nvidia.com/en-us/geforce/news/computex-2024-new-rtx-ai-laptops/
[11] https://dataphoenix.info/nvidia-at-computex-2024-rtx-ai-toolkit-developer-preview/
[13] https://developer.nvidia.com/rtx/ai-toolkit
[15] https://www.reddit.com/r/LocalLLaMA/comments/1d81fnh/nvidia_announce_that_rtx_ai_toolkit_is_coming/
[16] https://www.nvidia.com/en-us/networking/spectrumx/
[19] https://nvidianews.nvidia.com/news/nvidia-supercharges-ethernet-networking-for-generative-ai
Sources