Microsoft is Developing Large Language Model with 500 B Parameters

Published by: Insights Desk Released: May 07, 2024 Source: DemandTalk

Highlights:

The 500 billion parameters that Microsoft’s MAI-1 is said to have suggested that it might be positioned as a type of middle ground between GPT-3 and ChatGPT-4.
It has been reported that Microsoft may power MAI-1 using training data and other resources from Inflection AI.

Microsoft Corp. is developing a large language model with over 500 billion parameters.

It is anticipated that the LLM, internally referred to as MAI-1, will launch as early as this month.

In mid-2020, OpenAI unveiled GPT-3, revealing that its first iteration contained 175 billion parameters. Although it hasn’t yet released precise figures, the business has stated that GPT-4 is larger. According to some reports, Google LLC’s Gemini Ultra, which performs similarly to GPT-4, has 1.6 trillion parameters, and OpenAI’s flagship LLM is said to have 1.76 trillion.

The 500 billion parameters that Microsoft’s MAI-1 is said to have suggested that it might be positioned as a type of middle ground between GPT-3 and ChatGPT-4. Compared to OpenAI’s flagship LLM, this setup would enable the model to give excellent response accuracy while consuming a substantially smaller amount of electricity. Microsoft would pay less for inference as a result.

Mustafa Suleyman, the founder of LLM developer Inflection AI Inc., is reportedly supervising the development of MAI-1. Suleyman and most of the startup’s staff joined Microsoft in March after a purported USD 625 million transaction. The Executive formerly co-founded the DeepMind AI research group at Google LLC.

It has been reported that Microsoft may power MAI-1 using training data and other resources from Inflection AI. Web content and text produced by GPT-4 are among the information categories allegedly included in the model’s training dataset. According to reports, Microsoft is using a “large cluster of servers” with graphics cards made by Nvidia Corp. to carry out the development process.

According to the reports, the corporation hasn’t decided how it will employ MAI-1 yet. The model is too complicated to execute on consumer devices if it really has 500 billion parameters. It follows that Microsoft will probably implement MAI-1 in its data centers, where the LLM may be incorporated into Azure and Bing services.

If the model demonstrates enough potential by May 16, the corporation is expected to introduce MAI-1 at its Build developer conference. This suggests that the company anticipates having a functioning model prototype in a few weeks if it doesn’t already have one.

Less than two weeks have passed since Microsoft released a language model named Pi-3 Mini

for public use. Now, it has announced that it is working on MAI-1. According to the company, the latter model can outperform LLMs more than ten times its size and has 3.8 billion parameters. Pi-3 is a member of an AI family that also consists of two larger, somewhat more effective neural networks.

ai governance for the enterprise...

empower ai and real-time insights at the edge...

power ai and analytics workloads with performance,...

how to choose the right ai foundation model...

pros enterprise ai for the industrial industries (...

unlocking ai’s potential: challenges and opportu...

transforming procurement with ai: opportunities, c...

adobe acrobat ai assistant: reinventing productivi...

adobe acrobat ai assistant: reinventing productivi...

ai, automation, and the strategic cao...

an introduction to ai in customer service...

5 ways ai can transform your customer experience...

ciso guide to generative ai attacks...

10 reasons to hire a customer-led voice assistant...

10 reasons to hire a customer-led voice assistant...

the definitive buying guide for contact center her...

cfo's guide to ai...

discover the future of business innovation with ge...

preparing for the future of cx by harnessing the p...

tableau gpt: innovate for the future with generati...

profitable ai-powered data management solutions to...

business-centric cognitive architecture revolution...

ai use cases – innovations for business success...

the role of ai in software development...

ai in cybersecurity – your digital guardian...

how chatbot marketing supports today’s business ...

advanced adaptive ai bolsters business intelligenc...

the dynamic impact of ai in procurement...

ai in customer service – revealing common applic...

how to use dall-e for marketing success...

rpa vs ai: a comparative analysis for business aut...

maximizing business efficiency through ai integrat...

7 trendiest ai marketing campaigns igniting commer...

liquid neural network unveiling the fluid intellig...

the art of prompt engineering in general & marketi...

what is amazon bedrock?...

decode data like never before: chatgpt for data an...

workforce planning models –the power of ai skil...

black friday and the impact of ai in e-commerce...

how digital brain is a game changer for business s...

microsoft introduces bing generative search in lim...

cytoreason raises usd 80 m in the funding round in...

google unveils a suite of new features for ai apps...

kindo reels in usd 20.6 m and acquires whiterabbit...

microsoft’s spreadsheetllm enhances ai’s compr...

herculesai raises usd 26 m to develop and expand i...

intel capital leads usd 15 m investment in ai cons...

aws unveils app studio to accelerate app developme...

captions llc raises usd 60 m for generative video ...

enso technologies secures usd 6 m for smb-focused ...

hebbia raises usd 130 m to develop data search pla...

meta releases four open-source language models...

harvey is reportedly raising usd 100 m at usd 1.5 ...

cloudflare introduces a new no-code feature to pre...

redactive raises usd 7.5 m to expand headcount and...

rapid7 acquires noetic cyber to help businesses fi...

runway ai aims for usd 450 m amid ai startup inter...

gen ai coding assistant startup magic ai aims to r...

anthropic introduces new program to fund enhanced ...

meta to open-source meta llm compiler for code opt...

role of machine learning in networking...

Microsoft is Developing Large Language Model with 500 B Parameters

Highlights:

Insights Desk

Related posts

Microsoft Introduces Bing Generative Search in Lim...

CytoReason Raises USD 80 M in the Funding Round In...

Google Unveils a Suite of New Features for AI Apps...

Kindo Reels in USD 20.6 M and Acquires WhiteRabbit...

Microsoft’s SpreadsheetLLM Enhances AI’s Compr...

HerculesAI Raises USD 26 M to Develop and Expand i...

Intel Capital Leads USD 15 M Investment in AI Cons...

AWS Unveils App Studio to Accelerate App Developme...

Captions LLC Raises USD 60 M for Generative Video ...

Enso Technologies Secures USD 6 M for SMB-focused ...

Our Brands