AdvancedVocabulary#ai-llm#data-science-ml#frontend

Multi-Modal AI Vocabulary

Build fluency in the vocabulary of a single model reasoning jointly across image and text input.

0 / 5 completed
1 / 5
At standup, a dev mentions a single model that can accept an image and a piece of text together as input and reason jointly about both, rather than requiring a separate model for each input type. What is this kind of model called?