Cohere Aya 23 Launch Redefines Multilingual AI Boundaries

Published by: Insights Desk Released: May 24, 2024 Source: DemandTalk

Highlights:

Cohere offers a larger version of Aya with 35 billion parameters for developers with more advanced requirements.
Aya 23 was trained by Cohere on a multilingual training dataset, also known as Aya, which was open-sourced earlier this year.

Cohere Inc. recently introduced Aya 23, a new family of open-source large language models capable of understanding 23 languages.

Toronto-based Cohere, an OpenAI competitor, is backed by more than USD 400 million in funding from Nvidia Corp., Oracle Corp., and other investors. It provides a set of large language models (LLMs) optimized for the enterprise market. Cohere also offers Embed, a neural network designed to convert data into mathematical structures that language models can more easily understand.

The recent Cohere Aya 23 launch of the LLM series has two algorithms at launch. The first algorithm features 8 billion parameters and is designed for use cases that require a balance between response quality and performance. Cohere offers a larger version of Aya with 35 billion parameters for developers with more advanced requirements.

The latter edition, Aya-23-35B, is based on an LLM called Command R that Cohere introduced last March. Until this past April, it served as Cohere’s flagship AI model when the company debuted a more advanced algorithm. Command R supports prompts with up to 128,000 tokens, features a built-in retrieval-augmented generation (RAG) capability, and can automatically carry out tasks in external applications.

Underneath, Aya-23-35B relies on a widely recognized LLM design called the decoder-only Transformer architecture. Models employing this architecture ascertain the meaning of individual words within a user prompt by analyzing their context, specifically, the text that precedes them. This architecture’s algorithms can produce more accurate output than many earlier neural networks.

According to Cohere, Aya-23-35B enhances several aspects of the standard decoder-only Transformer architecture. The company’s enhancements have made the model more proficient in understanding user prompts.

The mechanism enabling LLMs to discern the meaning of a word from its context isn’t typically constructed as a singular software module. Instead, it comprises multiple software modules, each employing a distinct approach to interpreting text. Cohere Aya 23 launch implements these components using a technique known as grouped query attention, which reduces their RAM usage, thus accelerating inference.

Aya-23-35B also incorporates a technology known as rotational positional embeddings.
An LLM considers the meaning of words and their position within a sentence when interpreting text. Rotational positional embeddings allow LLMs to process word location information more effectively, enhancing the quality of their output.

Aya 23 was trained by Cohere on a multilingual training dataset, also known as Aya, which was open-sourced earlier this year. The dataset comprises 513 million prompts and corresponding answers for large language models across 114 languages. This extensive resource was developed through an open-source initiative with contributions from approximately 3,000 collaborators.

Additionally, as part of the initiative, Cohere released Aya-101, a large language model capable of understanding 101 languages. The company claims that its new Aya-23-35B model demonstrated superior performance in a series of internal assessments compared to the previous algorithm. It also demonstrated greater proficiency than other open-source LLMs in multilingual text-processing tasks.

ai governance for the enterprise...

empower ai and real-time insights at the edge...

power ai and analytics workloads with performance,...

how to choose the right ai foundation model...

pros enterprise ai for the industrial industries (...

unlocking ai’s potential: challenges and opportu...

transforming procurement with ai: opportunities, c...

adobe acrobat ai assistant: reinventing productivi...

adobe acrobat ai assistant: reinventing productivi...

ai, automation, and the strategic cao...

an introduction to ai in customer service...

5 ways ai can transform your customer experience...

ciso guide to generative ai attacks...

10 reasons to hire a customer-led voice assistant...

10 reasons to hire a customer-led voice assistant...

the definitive buying guide for contact center her...

cfo's guide to ai...

discover the future of business innovation with ge...

preparing for the future of cx by harnessing the p...

tableau gpt: innovate for the future with generati...

profitable ai-powered data management solutions to...

business-centric cognitive architecture revolution...

ai use cases – innovations for business success...

the role of ai in software development...

ai in cybersecurity – your digital guardian...

how chatbot marketing supports today’s business ...

advanced adaptive ai bolsters business intelligenc...

the dynamic impact of ai in procurement...

ai in customer service – revealing common applic...

how to use dall-e for marketing success...

rpa vs ai: a comparative analysis for business aut...

maximizing business efficiency through ai integrat...

7 trendiest ai marketing campaigns igniting commer...

liquid neural network unveiling the fluid intellig...

the art of prompt engineering in general & marketi...

what is amazon bedrock?...

decode data like never before: chatgpt for data an...

workforce planning models –the power of ai skil...

black friday and the impact of ai in e-commerce...

how digital brain is a game changer for business s...

microsoft introduces bing generative search in lim...

cytoreason raises usd 80 m in the funding round in...

google unveils a suite of new features for ai apps...

kindo reels in usd 20.6 m and acquires whiterabbit...

microsoft’s spreadsheetllm enhances ai’s compr...

herculesai raises usd 26 m to develop and expand i...

intel capital leads usd 15 m investment in ai cons...

aws unveils app studio to accelerate app developme...

captions llc raises usd 60 m for generative video ...

enso technologies secures usd 6 m for smb-focused ...

hebbia raises usd 130 m to develop data search pla...

meta releases four open-source language models...

harvey is reportedly raising usd 100 m at usd 1.5 ...

cloudflare introduces a new no-code feature to pre...

redactive raises usd 7.5 m to expand headcount and...

rapid7 acquires noetic cyber to help businesses fi...

runway ai aims for usd 450 m amid ai startup inter...

gen ai coding assistant startup magic ai aims to r...

anthropic introduces new program to fund enhanced ...

meta to open-source meta llm compiler for code opt...

role of machine learning in networking...

Cohere Aya 23 Launch Redefines Multilingual AI Boundaries

Highlights:

Insights Desk

Related posts

Microsoft Introduces Bing Generative Search in Lim...

CytoReason Raises USD 80 M in the Funding Round In...

Google Unveils a Suite of New Features for AI Apps...

Kindo Reels in USD 20.6 M and Acquires WhiteRabbit...

Microsoft’s SpreadsheetLLM Enhances AI’s Compr...

HerculesAI Raises USD 26 M to Develop and Expand i...

Intel Capital Leads USD 15 M Investment in AI Cons...

AWS Unveils App Studio to Accelerate App Developme...

Captions LLC Raises USD 60 M for Generative Video ...

Enso Technologies Secures USD 6 M for SMB-focused ...

Our Brands