Research Focus: Week of October 23, 2023

Welcome to Research Focus, a series of blog posts that highlights notable publications, events, code/datasets, new hires and other milestones from across the research community at Microsoft.

NEW RESEARCH

Kosmos-2.5: A Multimodal Literate Model

Current large language models (LLMs) primarily focus on textual information and cannot understand visual information. However, advancements in the field of multimodal large language models (MLLMs) aim to address this limitation. MLLMs combine visual and textual information within a single Transformer-based model, enabling

To finish reading, please visit source site

Research Focus: Week of October 23, 2023

NEW RESEARCH

Kosmos-2.5: A Multimodal Literate Model

Leave a Reply Cancel reply