💾 trunk/11b81e725fab8dddb5cdb2dfb715c24ae23854f8: Fix softmax decomposition for symbolic empty dims (#184454)
Nachrichtenbereich: 💾 Downloads
🔗 Quelle: github.com
Avoid emitting an amax over a possibly empty data-dependent softmax dimension by padding an identity sentinel when the reduction size is not statically known nonzero. This keeps softmax and... [Weiterlesen]
🔧 The Mind's Mirror
📈 1244.05 Punkte
🔧 Programmierung
🔧 什么是Online Softmax and Flash Attention?
📈 194.31 Punkte
🔧 Programmierung
🔧 Why Your AI Needs Both Intuition and Rules
📈 161.9 Punkte
🔧 Programmierung
🔧 The Curious Case of Terraform Workspaces
📈 160.48 Punkte
🔧 Programmierung
🔧 Flash Attention: what it does and why it matters
📈 160.02 Punkte
🔧 Programmierung
🔧 What an LLM Actually Does
📈 137.16 Punkte
🔧 Programmierung
🔧 Chapter 5: Linear Transformation and Softmax
📈 125.73 Punkte
🔧 Programmierung