Recent advancements in training large multimodal models have been driven by efforts to eliminate modeling constraints and unify architectures across domains. Despite these strides, many existing ...
2402.11571 null 2024-02-14 UniEnc-CASSNAT: An Encoder-only Non-autoregressive ASR for Speech SSL Models Ruchao Fan et.al ... 2401.18045 null 2024-02-08 Computation and Parameter Efficient Multi-Modal ...