NVIDIA Unveils Upgraded NeMo Framework, Boosting LLM Training Efficiency AI NEWS

Home
AInews
NVIDIA Unveils Upgraded NeMo Framework, Boosting LLM Training Efficiency

NVIDIA Unveils Upgraded NeMo Framework, Boosting LLM Training Efficiency

2023-12-06

NVIDIA has updated its NeMo framework and enhanced the training of large language models (LLMs) on the H200 GPU. These developments are aimed at developers and researchers in the field of artificial intelligence, particularly those working on AI foundational models such as Llama 2 and Nemotron-3. The updated NeMo framework has now become cloud-native, supporting a wider range of model architectures and employing advanced parallel techniques for efficient training. In particular, the H200 GPU has made significant progress in improving the performance of the Llama 2 model, surpassing the performance of previous versions. These tools were announced on December 4th and are now globally available, serving various applications from academic research to industry use. The updates aim to meet the growing demand for better training performance in complex and diverse large language models. They focus on accelerating the training process, improving efficiency, and expanding model capabilities, which are crucial for computationally intensive models. Enhancements include mixed-precision implementation, optimized activation functions, and improved communication efficiency. The H200 GPU achieves a performance of 836 TFLOPS per GPU, significantly increasing training throughput. The introduction of Fully Sharded Data Parallelism and Mixture of Experts architecture optimizes model training and capacity. TensorRT-LLM enhances reinforcement learning based on human feedback, supporting larger models and improving performance. For those interested, NVIDIA provides the NeMo framework as an open-source library, along with containers on NGC, as part of the NVIDIA AI Enterprise Edition. NVIDIA also offers additional resources such as the GTC conference, webinars, and SDKs to further interact with NVIDIA's AI tools.

Miko

AI interactive learning companion for children

Comet

Smart browser with AI features available for any website

Mirelo AI

AI-generated soundtracks for your video projects

Giskard AI

AI platform for identifying model vulnerabilities

SnapCalorie

AI photo calorie tracker for accurate nutrition

Supio

**AI legal assistant for personal injury cases**

TTS Maker

Free AI tool for converting text to speech

RECENT AI TOOLS

Spot AI

Miko

Comet

Mirelo AI

Giskard AI

RECENT AI NEWS

Microsoft Deploys the World's First GB300 Supercluster for OpenAI

Unitree R1 Bipedal Humanoid Robot Ranks on TIME's 2025 Best Inventions List

Dishwashing and laundry "housework buddy" is here! Figure 03 humanoid robot: 1.68 meters tall, 5-hour battery life

Sora Reaches 1 Million Downloads Faster Than ChatGPT

Google Launches Gemini Enterprise: Unified AI Platform for Businesses

Figma Leverages Google's Gemini to Accelerate Enterprise AI in Its Design Platform

Intel Launches Panther Lake, the First Core Ultra Based on 18A Process

Amazon Launches Quick Suite, Introducing AI Agents to the Enterprise Workplace

RECENT AI TOOLS