Add k-bit blockwise quantization (K=2-5) with warp-level CUDA kernels#1858
Open
TimDettmers wants to merge 12 commits into
Open
Add k-bit blockwise quantization (K=2-5) with warp-level CUDA kernels#1858TimDettmers wants to merge 12 commits into
TimDettmers wants to merge 12 commits into
The logs for this run have expired and are no longer available.
Loading