Google Gemma 4 12B, released June 3, is an open-weight multimodal model that processes text, images, audio, and video in a ...
With LLMs increasingly working multimodally, there are exciting developments for more performance and leaner sizes.