MoST: One Open-Source Model for Speech + Text
Meet MoST, a fully open-source AI that understands speech and text in one model. Instead of treating audio and words the same, MoST uses a Modality-Aware Mixture of Experts (MAMoE) to send each token to the right specialists. * Modality-specific experts learn the unique patterns of audio and text. * Shared experts