LLM Compressor, Model Quantization and Sparsification techniques and recipes

Existing lab resources

Potential Topics to Cover in the Lab

LLM-Compressor

  • Understanding why you SHOULD NOT quantize your own model (and the small number of use cases where you should)

  • Compressing a model using an existing recipe