Articles

Microsoft unveiled an AI model that recognizes image content and fixes visual problems

The new model of AI Kosmos-1 is a Multimodal Large Language Model (MLLM), able to respond not only to linguistic cues, but also to visual cues, and therefore respond better to question-and-answer sessions.

Multimodal artificial intelligence (MLLM) could be the key to the development of artificial general intelligence, a technology that could in the future replace humans in any intellectual task or work.

What is Kosmos-1

Kosmos-1 is a multimodal model developed by Microsoft researchers. Last Monday, it was unveiled as a model capable of:

  • read the content of the images,
  • solve visual puzzles,
  • recognize text in images,
  • score well on visual IQ tests
  • understand instructions given in natural language.

The development of theArtificial intelligence multimodal is seen as a crucial step towards creating an artificial general intelligence (AGI) capable of performing general human-level tasks.

Language Is Not All You Need: Aligning Perception with Language Models

“Being a fundamental part of intelligence, multimodal perception is a necessity to achieve artificial general intelligence, in terms of knowledge acquisition and real-world embedment,” the researchers write in their academic paper, Language Is Not All You Need: Aligning Perception with Language Model.

The Kosmos-1 model can analyze images and answer questions about them, read text from an image, write captions for images, and score between 22 and 26 percent on a visual IQ test, such as demonstrated in the visual examples in the Kosmos-1 study.

Innovation newsletter
Don't miss the most important news on innovation. Sign up to receive them by email.

AGI for OpenAI

OpenAI, Microsoft's key business partner in artificial intelligence, has set AGI as its primary focus. Kosmos-1 appears to be an exclusive initiative of Microsoft, without the assistance of OpenAI.

BlogInnovazione.it

Innovation newsletter
Don't miss the most important news on innovation. Sign up to receive them by email.

Latest Articles

The Benefits of Coloring Pages for Children - a world of magic for all ages

Developing fine motor skills through coloring prepares children for more complex skills like writing. To color…

May 2, 2024

The Future is Here: How the Shipping Industry is Revolutionizing the Global Economy

The naval sector is a true global economic power, which has navigated towards a 150 billion market...

May 1, 2024

Publishers and OpenAI sign agreements to regulate the flow of information processed by Artificial Intelligence

Last Monday, the Financial Times announced a deal with OpenAI. FT licenses its world-class journalism…

April 30 2024

Online Payments: Here's How Streaming Services Make You Pay Forever

Millions of people pay for streaming services, paying monthly subscription fees. It is common opinion that you…

April 29 2024