Welcome to WarBulletin - your new best friend in the world of gaming. We're all about bringing you the hottest updates and juicy insights from across the gaming universe. Are you into epic RPG adventures or fast-paced eSports? We've got you covered with the latest scoop on everything from next-level PC gaming rigs to the coolest game releases. But hey, we're more than just news! Ever wondered what goes on behind the scenes of your favorite games? We're talking exclusive interviews with the brains behind the games, fresh off-the-press photos and videos straight from gaming conventions, and, of course, breaking news that you just can't miss. We know you love gaming 24/7, and that's why we're here round the clock, updating you on all things gaming. Whether it's the lowdown on a new patch or the buzz about the next big gaming celeb, we're on it.

Contacts

  • Owner: SNOWLAND s.r.o.
  • Registration certificate 06691200
  • 16200, Na okraji 381/41, Veleslavín, 162 00 Praha 6
  • Czech Republic

NVIDIA Further Boosts AI Performance By 3x For GeForce RTX GPUs, RTX PC & RTX Workstations With Latest Driver

NVIDIA has further boosted the AI performance of its GeForce RTX GPUs & RTX AI PC platforms with the latest R555 driver release.

NVIDIA's GeForce RTX GPUs & RTX PCs Offer The Fastest AI Performance Across All Segments, Now Boosted By 3X With Latest Drivers

During today's Microsoft Build, NVIDIA announced a range of new AI performance optimizations that are now available on the RTX platform which includes GeForce RTX GPUs, Workstations, and PCs.

Related Story Microsoft Intros Copilot+ PCs For The “AI PC” Era: Snapdragon X First, Intel Lunar Lake & AMD Strix Later With Faster NVIDIA RTX & AMD Radeon GPUs

The new optimizations are specifically targeted at a range of LLMs (Large Language Models) that power the latest Generative AI experiences. Using the latest R555 drivers, NVIDIA's RTX GPUs and AI PC platforms now offer up to 3x faster AI performance with ONNX Runtime (ORT) and DirectML. These two tools are used to run AI models locally on Windows PCs.

In addition to that, WebNN has also been accelerated with RTX via DirectML. This is an application programming interface for web developers to deploy new AI models. Microsoft is working with NVIDIA to further accelerate RTX GPU performance whilst adding DirectML support on PyTorch. Following is a full list of capabilities that the new R555 drivers offer for GeForce RTX GPUs and RTX PCs:

  • Support for DQ-GEMM metacommand to handle INT4 weight-only quantization for LLMs
  • New RMSNorm normalization methods for Llama 2, Llama 3, Mistral and Phi-3 models
  • Group and multi-query attention mechanisms, and sliding window attention to support Mistral
  • In-place KV updates to improve attention performance
  • Support for GEMM of non-multiple-of-8 tensors to improve context phase performance

In performance benchmarks of ORT, a generative AI extension released by Microsoft, NVIDIA shows gains across the board in both INT4 and FP16 data types. The performance improvements are up to 3x thanks to the optimization techniques added within these

Read more on wccftech.com