Apple AI research shows how MLLMs understand, generate, search for images

Apple's researchers continue to focus on multimodal LLMs, with studies exploring their use for image generation, understanding, and multi-turn web searches with cropped images. Apple's AI studies explore the use of multimodal LLMs with images. With iOS 18 , Apple made it possible to generate images on an iPhone through local AI models. Image Playground lets you create cartoon-like photos of just about anything, all without a Wi-Fi connection. Now, the company is continuing its image-related endeavors through research that explores how multi-modal LLMs use, generate, and understand images. Continue Reading on AppleInsider | Discuss on our Forums