Interest in multimodal large language models (MLLMs) has exploded over the last year or so thanks to their versatile capabilities in tackling tasks across multiple varieties of data — text, images and videos, as well as time series and graph data.

Related Articles