Skip to content

Add k-bit blockwise quantization (K=2-5) with warp-level CUDA kernels#1858

Open
TimDettmers wants to merge 12 commits into
mainfrom
feature/kbit-quantization
Open

Add k-bit blockwise quantization (K=2-5) with warp-level CUDA kernels#1858
TimDettmers wants to merge 12 commits into
mainfrom
feature/kbit-quantization

Apply suggestion from @matthewdouglas

ad7f194
Select commit
Loading
Failed to load commit list.
Sign in for the full log view

The logs for this run have expired and are no longer available.