Pruning Heuchera - Search News

ICP: Immediate Compensation Pruning for Mid-to-high Sparsity

Abstract: The increasing adoption of large-scale models under 7 billion parameters in both language and vision domains enables inference tasks on a single consumer-grade GPU but makes fine-tuning ...

IEEE

GreenLLM: Towards Efficient Large Language Model via Energy-aware Pruning

Abstract: This paper proposes GreenLLM, a framework that effectively deploys generative Large Language Models (LLMs) on resource-limited edge devices to well meet the memory and timing constraints ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

ICP: Immediate Compensation Pruning for Mid-to-high Sparsity

GreenLLM: Towards Efficient Large Language Model via Energy-aware Pruning

Trending now