Lädt...

🔧 Nvidia's 1000x Performance Boost Claim Verified


Nachrichtenbereich: 🔧 Programmierung
🔗 Quelle: dev.to

Nvidia's keynote at the recent Computex was full of bold marketing and messaging, bordering on complete BS.

CEO Math Lesson

The "CEO Math" lesson with the "The more you buy, the more you save" conclusion has reminded me of another bold claim (and play with the numbers) from earlier this year.

At Blackwell's intro, one of the slides stated there's a 1000x boost in the compute power of Nvidia GPUs. Though many noticed the comparison was not apples-to-apples: FP16 data type performance for older generations was compared against FP8 and FP4 smaller data types introduced in the newer hardware. Apparently, lower precision computation is faster. The graph would be much nicer if the FP16 line continued. Like that:

Blackwell FP16 performance

It is great that the new hardware has acceleration for smaller data types. It follows the trend of quantized language models - trading off slight LLM performance degradation for smaller size and faster inference. Though presenting the figures in the way they were presented:

  • not explaining the difference in datatypes,
  • hiding the baseline and breaking consistency
  • not highlighting the downside of decreased precision...

... that seems like a sketchy move worth of "How to Lie with Statistics" book.

How to Lie with Statistics

Anyways... To come up with the above numbers for the FP16 performance for Hopper and Blackwell I found the specs for the products that had 4000 TFLOPS FP8 and 20000 TFLOPS FP4.

They are:

  • H100 SXM FP8 3,958 teraFLOPS and FP16 1,979 teraFLOPS

H100 SXM

  • GB200 NVL2 dual GPU system with FP4 40 PFLOPS and FP16 10 PFLOPS (5000 FP16 teraFLOPS per GPU)

GB200 NVL2

The improvement in performance is still impressive, yet 1000x is way nicer than a mere 263x ;)

...

🔧 Nvidia's 1000x Performance Boost Claim Verified


📈 69.83 Punkte
🔧 Programmierung

📰 Sony stellt drei Kopfhörer mit Noise Cancelling vor: WF-1000X, WI-1000X und WH-1000XM2


📈 48.48 Punkte
📰 IT Nachrichten

🎥 NVIDIA’s AI: Superhuman Performance…1000x Faster!


📈 35.73 Punkte
🎥 Künstliche Intelligenz Videos

🔧 🚀 Boosting TPC-H Q2 Query Performance by 1000x times: PawSQL Optimization Techniques


📈 30.31 Punkte
🔧 Programmierung

🎥 NVIDIA's DexMimicGen: 1000X Faster Humanoid Robot AI Agent Learning (GEN AI NEWS)


📈 29.66 Punkte
🎥 IT Security Video

🎥 NVIDIA's DexMimicGen: 1000X Faster Humanoid Robot AI Agent Learning (GEN AI NEWS)


📈 29.66 Punkte
🎥 IT Security Video

🎥 Nvidia's Eureka: 1000X Faster OpenAI GPT4 Powered AI Robot Agents


📈 29.66 Punkte
🎥 Künstliche Intelligenz Videos

📰 Verified mess — Twitter's $8 blue tick rollout sees 'verified' fakes


📈 27.77 Punkte
📰 IT Security Nachrichten

🕵️ Internet Bug Bounty: JWT audience claim is not verified


📈 26.51 Punkte
🕵️ Sicherheitslücken

🍏 Rumor repeats claim watchOS to get visionOS design elements, makes wild AI claim


📈 25.26 Punkte
🍏 iOS / Mac OS

🕵️ +1000X DISNEY+ ACCOUNTS W/ CAPTURE


📈 24.24 Punkte
🕵️ Hacking

🕵️ +1000X DISNEY+ ACCOUNTS W/ CAPTURE


📈 24.24 Punkte
🕵️ Hacking

🎥 Make Apt-Get Update & Upgrade 1000X Faster | Kali Linux 2019.1


📈 24.24 Punkte
🎥 IT Security Video

📰 Kopfhörer Sony WH-1000X M3 im Test


📈 24.24 Punkte
📰 IT Nachrichten

📰 Sony WF-1000X im Test: Wireless-In-Ear-Kopfhörer mit Geräuschfilterung


📈 24.24 Punkte
📰 IT Nachrichten

📰 Sony WF-1000X im Techstage-Test: Angenehme Drahtlos-Kopfhörer mit gutem Klang


📈 24.24 Punkte
📰 IT Nachrichten

📰 Für Action-Cams: Lexar Professional 1000x Micro SD 256


📈 24.24 Punkte
📰 IT Nachrichten

📰 Professional 1000x microSD: Lexars UHS-II-Speicherkarte bietet jetzt 256 GB


📈 24.24 Punkte
📰 IT Nachrichten

🔧 LLM Training: Data Costs 1000x More Than You Think!


📈 24.24 Punkte
🔧 Programmierung

🔧 Pathology AI Breakthrough: Train SOTA Models With 1000x Less Data


📈 24.24 Punkte
🔧 Programmierung

🔧 Video Generation Breakthrough: New Method Creates AI Videos 1000x Faster with 2 Simple Steps


📈 24.24 Punkte
🔧 Programmierung

🔧 New Method Makes AI Training Data Valuation 1000x Faster Without Model Access


📈 24.24 Punkte
🔧 Programmierung

📰 Noise Cancelling der 1000X-Serie: Sony Gaming-Headset im Amazon-Deal


📈 24.24 Punkte
📰 IT Nachrichten

📰 JBL Reflect Aero TWS gegen WF-1000X M4: Duell der kleinen Noise-Cancelling-Kopfhörer


📈 24.24 Punkte
📰 IT Nachrichten

📰 Sony WF-1000X M4 im Test: In-Ear-Kopfhörer mit etlichen Spielereien


📈 24.24 Punkte
📰 IT Nachrichten

🔧 Boost Your Resume with Free, Verified Certifications


📈 21.46 Punkte
🔧 Programmierung

📰 Does Being a Liability to an Employer Boost My SSDI Claim?


📈 20.21 Punkte
📰 IT Security Nachrichten

🐧 Marvel's Spider-Man 2 now Steam Deck Verified with a fresh update out to improve performance


📈 19.95 Punkte
🐧 Linux Tipps

🐧 Assassin's Creed Shadows out now - Steam Deck Verified but has NVIDIA problems on Desktop Linux


📈 19.31 Punkte
🐧 Linux Tipps

📰 Verified Priority Access (US): Nvidia startet Losverfahren für RTX 5090 FE & RTX 5080 FE


📈 19.31 Punkte
📰 IT Nachrichten