Model Pruning
2 videos • 4 views • by Tunadorable
1
Mixture of Sparse Attention for Automatic LLM Compression
Tunadorable
Download
2
LASER: Improving LLMs with Layer-Selective Rank Reduction
Tunadorable
Download