Learn/Core Concept What is neural network pruning? Pruning removes unnecessary connections or neurons from trained neural networks to reduce model size whilst maintaining accuracy. It identifies which weights contribute least to the final output and eliminates them, creating sparse networks that run faster and use less memory. This technique is crucial for deploying models on edge devices where computational resources are limited. QuantisationCompression |