Meno: | Júlia |
---|---|
Priezvisko: | Lichmanová |
Názov: | Improving Structured Pruning of Deep Neural Networks |
Vedúci: | Mgr. Vladimír Boľa, PhD. |
Rok: | 2025 |
Kµúčové slová: | deep neural networks, structured pruning, Monarch decomposition, low-rank approximations, spectral clustering, ILP, permutation algorithms, model compression |
Abstrakt: | To make large neural networks more efficient, this thesis focuses on structured compression using Monarch decomposition, a method for approximating weight matrices. To further improve Monarch efficiency, we propose to permute rows and columns of the weight matrices. We introduce two permutation algorithms, one based on Integer Linear Programming (ILP) and another using Spectral KNN, both designed to improve the accuracy of the decomposition. Our results show that Spectral KNN permutation algorithm offers a great trade-off between speed and accuracy, while running in a fraction of the time against ILP permutation algorithm. We also show that incorporating input activation statistics into both methods improves reconstruction quality and overall model accuracy. All code is available at https://github.com/Lima239/master-thesis-pruning-dnn |
Súbory diplomovej práce:
master-thesis-pruning-dnn.zip |
master_thesis_lichmanova.pdf |
Súbory prezentácie na obhajobe: