Moe Norman Distance - Search News

Serving MoE Models on Resource-Constrained Edge Devices via Dynamic Expert Swapping

Abstract: Mixture of experts (MoE) is a popular technique in deep learning that improves model capacity with conditionally-activated parallel neural network modules (experts). However, serving MoE ...

IEEE

Uni-MoE: Scaling Unified Multimodal LLMs With Mixture of Experts

Abstract: Recent advancements in Multimodal Large Language Models (MLLMs) underscore the significance of scalable models and data to boost performance, yet this often incurs substantial computational ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Serving MoE Models on Resource-Constrained Edge Devices via Dynamic Expert Swapping

Uni-MoE: Scaling Unified Multimodal LLMs With Mixture of Experts

Trending now