Accurate Sublayer Pruning for Large Language Models by Exploiting Latency and Tunability Information Posted by Data Mining Lab. 날짜: 8/01/2025