ElasticMM is an efficient and scalable serving system for large multimodal models (LMMs). It introduces Elastic Multimodal Parallelism (EMP), a new parallelization strategy that optimize resource ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results