MistralAI Launches Mixtral 8x22B, a Leading Open-source AI Model

Published by: Insights Desk Released: Apr 11, 2024 Source: DemandTalk

Highlights:

Mistral, established by AI professionals from Google LLC and Meta, stands out among startups aimed at providing open-source models for public use.
It is widely anticipated that Mixtral 8x22B will surpass the performance of Mistral AI’s earlier Mixtral 8x7B model.

Paris-based open-source generative AI startup Mistral AI has launched Mixtral 8x22B, a new large language model, as part of its strategy to remain competitive with the major players in the industry.

This recently introduced model is anticipated to surpass the performance of the company’s earlier version, the Mixtral 8x7B. Numerous specialists regard it as a highly formidable competitor to more prominent competitors, including OpenAI’s GPT-3.5 and Meta Platforms Inc.’s Llama 2.

The startup, having secured USD 415 million in funding this past December and now valued at over USD 2 billion, describes its latest model as the most potent to date. It features a 65,000-token context window, indicating the volume of text it can simultaneously process and refer to. Moreover, the Mixtral 8x22B model is equipped with up to 176 billion parameters, which represent the count of internal variables it leverages for decision-making and forecasting.

Mistral, established by AI experts formerly with Google LLC and Meta, stands among a group of AI startups dedicated to developing open-source models accessible to all. In a move that diverged from the norm, the company initially released its new model through a torrent link shared on the social media platform X. Subsequently, Mistral made the Mixtral 8x22B model accessible on the Hugging Face and Together AI platforms, allowing users to retrain further and adapt it for more specific applications.

Just days following the release of new models by its competitors, the startup unveiled the Mixtral 8x22B. On Tuesday, OpenAI introduced GPT-4 Turbo with Vision, the latest addition to its GPT-4 Turbo lineup, equipped with visual capabilities, allowing it to process photos, drawings, and other user-uploaded images. Later the same day, Google released Gemini Pro 1.5 LLM to the public, offering developers a free version with a limit of up to 50 requests daily.

In a bid to match its competitor, Meta also announced its plans to unveil Llama 3 later in this month.

The Mixtral 8x22B model is widely anticipated to surpass the performance of Mistral AI’s former Mixtral 8x7B model, which nearly outperformed GPT-3.5 and Llama 2 in several critical benchmarks.

The model utilizes a sophisticated, sparse “mixture-of-experts” (MoE) architecture designed for efficient computation and superior performance on a broad spectrum of tasks. This sparse MoE strategy seeks to offer users an amalgamation of distinct models, each tailored to excel in various types of functions, as a means to enhance performance while optimizing costs.

“At every layer, for every token, a router network chooses two of these groups (the ‘experts’) to process the token and combine their output additively. This technique increases the number of parameters of a model while controlling cost and latency, as the model only uses a fraction of the total set of parameters per token,” Mistral AI states on its website.

The distinctive architecture of Mixtral 8x22B, despite its vast size, necessitates only about 44 billion active parameters for each forward pass. This renders it quicker and more economical to operate compared to models of similar magnitude.

Therefore, the introduction of Mixtral 8x22B marks a significant achievement in the field of open-source generative AI, offering researchers, developers, and enthusiasts alike the chance to experiment with some of the most cutting-edge models without facing obstacles like restricted access and substantial expenses. It is accessible under the liberal Apache 2.0 license.

Feedback from the AI community on social media has been largely favorable, with enthusiasts expressing optimism that it will provide substantial benefits for tasks including customer service, drug discovery, and climate modeling.

While Mistral AI has received significant commendation for its commitment to open-source principles, it has not been without its detractors. The company’s models are classified as “frontier models,” implying they carry a risk of being misused. Additionally, given that the company’s AI models are freely downloadable and can be built upon by anyone, the startup is unable to control or prevent the use of its technology for nefarious purposes.

ai governance for the enterprise...

empower ai and real-time insights at the edge...

power ai and analytics workloads with performance,...

how to choose the right ai foundation model...

pros enterprise ai for the industrial industries (...

unlocking ai’s potential: challenges and opportu...

transforming procurement with ai: opportunities, c...

adobe acrobat ai assistant: reinventing productivi...

adobe acrobat ai assistant: reinventing productivi...

ai, automation, and the strategic cao...

an introduction to ai in customer service...

5 ways ai can transform your customer experience...

ciso guide to generative ai attacks...

10 reasons to hire a customer-led voice assistant...

10 reasons to hire a customer-led voice assistant...

the definitive buying guide for contact center her...

cfo's guide to ai...

discover the future of business innovation with ge...

preparing for the future of cx by harnessing the p...

tableau gpt: innovate for the future with generati...

profitable ai-powered data management solutions to...

business-centric cognitive architecture revolution...

ai use cases – innovations for business success...

the role of ai in software development...

ai in cybersecurity – your digital guardian...

how chatbot marketing supports today’s business ...

advanced adaptive ai bolsters business intelligenc...

the dynamic impact of ai in procurement...

ai in customer service – revealing common applic...

how to use dall-e for marketing success...

rpa vs ai: a comparative analysis for business aut...

maximizing business efficiency through ai integrat...

7 trendiest ai marketing campaigns igniting commer...

liquid neural network unveiling the fluid intellig...

the art of prompt engineering in general & marketi...

what is amazon bedrock?...

decode data like never before: chatgpt for data an...

workforce planning models –the power of ai skil...

black friday and the impact of ai in e-commerce...

how digital brain is a game changer for business s...

microsoft introduces bing generative search in lim...

cytoreason raises usd 80 m in the funding round in...

google unveils a suite of new features for ai apps...

kindo reels in usd 20.6 m and acquires whiterabbit...

microsoft’s spreadsheetllm enhances ai’s compr...

herculesai raises usd 26 m to develop and expand i...

intel capital leads usd 15 m investment in ai cons...

aws unveils app studio to accelerate app developme...

captions llc raises usd 60 m for generative video ...

enso technologies secures usd 6 m for smb-focused ...

hebbia raises usd 130 m to develop data search pla...

meta releases four open-source language models...

harvey is reportedly raising usd 100 m at usd 1.5 ...

cloudflare introduces a new no-code feature to pre...

redactive raises usd 7.5 m to expand headcount and...

rapid7 acquires noetic cyber to help businesses fi...

runway ai aims for usd 450 m amid ai startup inter...

gen ai coding assistant startup magic ai aims to r...

anthropic introduces new program to fund enhanced ...

meta to open-source meta llm compiler for code opt...

role of machine learning in networking...

MistralAI Launches Mixtral 8x22B, a Leading Open-source AI Model

Highlights:

Insights Desk

Related posts

Microsoft Introduces Bing Generative Search in Lim...

CytoReason Raises USD 80 M in the Funding Round In...

Google Unveils a Suite of New Features for AI Apps...

Kindo Reels in USD 20.6 M and Acquires WhiteRabbit...

Microsoft’s SpreadsheetLLM Enhances AI’s Compr...

HerculesAI Raises USD 26 M to Develop and Expand i...

Intel Capital Leads USD 15 M Investment in AI Cons...

AWS Unveils App Studio to Accelerate App Developme...

Captions LLC Raises USD 60 M for Generative Video ...

Enso Technologies Secures USD 6 M for SMB-focused ...

Our Brands