Elon Musk-led xAI Corp. Launches First Multimodal Model Grok-1.5V

Published by: Insights Desk Released: Apr 15, 2024 Source: DemandTalk

Highlights:

The business claims that Grok-1.5V, which specializes in what it terms “multidisciplinary reasoning,” is more than capable of competing with current multimodal models across a range of fields.
According to benchmark data provided by xAI, Grok-1.5V performs better than industry competitors including GPT-4V, Claude, 3Sonnet, Claude 3 Opus, and Gemini Pro 1.5.

Elon Musk-led xAI Corp. launched its first multimodal model recently. The development adds to an AI arms race that never seems to get over.

Grok-1.5 Vision, also known as Grok-1.5V, is a considerably more advanced large language model than the original Grok-1 since it can comprehend both text and visuals, including displayed documents, images, screenshots, charts, diagrams, and more.

The business claims that Grok-1.5V, which specializes in what it terms “multidisciplinary reasoning,” is more than capable of competing with current multimodal models across a range of fields. It has intelligent spatiotemporal perception capabilities, or what’s called real-world spatial understanding in the AI community, which enable it to reason with complex text, analyze scientific images, and engage with visual content in a manner akin to that of a human.

The developer provided several real-world applications for the Grok-1.5V. For example, it can be used to convert drawings into kid-friendly stories, determine which object in a group is the largest, help drivers navigate obstacles by ensuring there is enough room, convert tables into CSV files, and determine whether a wooden deck needs to be replaced because it is decaying. Even the context of internet memes that the user is unfamiliar with will be explained.

According to benchmark data provided by xAI, Grok-1.5V performs better than industry competitors including GPT-4V, Claude, 3Sonnet, Claude 3 Opus, and Gemini Pro 1.5. Grok-1.5V outperformed its competitors by a significant margin in a new benchmark known as the RealWorldQA benchmark, which the company developed to assess real-world spatial comprehension.

Less than a month has passed since Musk’s team debuted the regular Grok-1.5 LLM, which defeated Grok-1 in terms of math and coding capabilities. Now, Grok is available in multimodal form. Additionally, Grok-1.5 demonstrated that it could handle far longer contexts than the original, allowing it to verify information from other sources and enhance answer accuracy.

The xAI claims that Grok-1.5V will soon be made accessible to early testers, beginning with those who have enrolled in X’s Premium service, which offers extra advantages to users of the social media platform formerly Twitter.

The startup, which debuted in July 2023, has advanced rapidly. Musk stated at the time that he was starting the business in response to AI developers like OpenAI and Google, who are very secretive about the inner workings of their AI models. According to Musk, the objective is to develop AI that is more accountable and transparent than the work of its competitors.

ai governance for the enterprise...

empower ai and real-time insights at the edge...

power ai and analytics workloads with performance,...

how to choose the right ai foundation model...

pros enterprise ai for the industrial industries (...

unlocking ai’s potential: challenges and opportu...

transforming procurement with ai: opportunities, c...

adobe acrobat ai assistant: reinventing productivi...

adobe acrobat ai assistant: reinventing productivi...

ai, automation, and the strategic cao...

an introduction to ai in customer service...

5 ways ai can transform your customer experience...

ciso guide to generative ai attacks...

10 reasons to hire a customer-led voice assistant...

10 reasons to hire a customer-led voice assistant...

the definitive buying guide for contact center her...

cfo's guide to ai...

discover the future of business innovation with ge...

preparing for the future of cx by harnessing the p...

tableau gpt: innovate for the future with generati...

profitable ai-powered data management solutions to...

business-centric cognitive architecture revolution...

ai use cases – innovations for business success...

the role of ai in software development...

ai in cybersecurity – your digital guardian...

how chatbot marketing supports today’s business ...

advanced adaptive ai bolsters business intelligenc...

the dynamic impact of ai in procurement...

ai in customer service – revealing common applic...

how to use dall-e for marketing success...

rpa vs ai: a comparative analysis for business aut...

maximizing business efficiency through ai integrat...

7 trendiest ai marketing campaigns igniting commer...

liquid neural network unveiling the fluid intellig...

the art of prompt engineering in general & marketi...

what is amazon bedrock?...

decode data like never before: chatgpt for data an...

workforce planning models –the power of ai skil...

black friday and the impact of ai in e-commerce...

how digital brain is a game changer for business s...

microsoft introduces bing generative search in lim...

cytoreason raises usd 80 m in the funding round in...

google unveils a suite of new features for ai apps...

kindo reels in usd 20.6 m and acquires whiterabbit...

microsoft’s spreadsheetllm enhances ai’s compr...

herculesai raises usd 26 m to develop and expand i...

intel capital leads usd 15 m investment in ai cons...

aws unveils app studio to accelerate app developme...

captions llc raises usd 60 m for generative video ...

enso technologies secures usd 6 m for smb-focused ...

hebbia raises usd 130 m to develop data search pla...

meta releases four open-source language models...

harvey is reportedly raising usd 100 m at usd 1.5 ...

cloudflare introduces a new no-code feature to pre...

redactive raises usd 7.5 m to expand headcount and...

rapid7 acquires noetic cyber to help businesses fi...

runway ai aims for usd 450 m amid ai startup inter...

gen ai coding assistant startup magic ai aims to r...

anthropic introduces new program to fund enhanced ...

meta to open-source meta llm compiler for code opt...

role of machine learning in networking...

Elon Musk-led xAI Corp. Launches First Multimodal Model Grok-1.5V

Highlights:

Insights Desk

Related posts

Microsoft Introduces Bing Generative Search in Lim...

CytoReason Raises USD 80 M in the Funding Round In...

Google Unveils a Suite of New Features for AI Apps...

Kindo Reels in USD 20.6 M and Acquires WhiteRabbit...

Microsoft’s SpreadsheetLLM Enhances AI’s Compr...

HerculesAI Raises USD 26 M to Develop and Expand i...

Intel Capital Leads USD 15 M Investment in AI Cons...

AWS Unveils App Studio to Accelerate App Developme...

Captions LLC Raises USD 60 M for Generative Video ...

Enso Technologies Secures USD 6 M for SMB-focused ...

Our Brands