Key innovations in Meta’s MobileLLM include prioritizing model depth over width, implementing embedding sharing and grouped-query attention and utilizing a novel immediate block-wise weight-sharing technique.
Meta AI develops compact language model for mobile devices
This was originally published on post