The MAMBA Model transformer by using a language modeling head on top rated (linear layer with weights tied for the input
If handed along, the product makes use of the preceding state in many of the blocks (which is https://k2spiceshop.com/product/liquid-k2-on-paper-online/