Microsoft Unveils Efficient 2-Billion Parameter Language Model
Microsoft Research has introduced BitNet b1.58 2B4T, a new language model with 2 billion parameters that uses only 1.58 bits of weight per layer instead of the usual 16 or 32. Despite its reduced size, it matches the performance of full-precision models and runs efficiently on both GPUs and CPUs. The model was trained on … Read more