Pruning
A method of optimizing a Checkpoint model to increase the speed of inference (prompt generation), file size, and VRAM cost.
A method of optimizing a Checkpoint model to increase the speed of inference (prompt generation), file size, and VRAM cost.