News | PaLM RLHF: An Open-source Alternative to ChatGPT?

PaLM RLHF: An Open-source Alternative to ChatGPT?

Published by: Insights Desk Released: Jan 03, 2023 Source: DemandTalk

Highlights:

ChatGPT and PaLM RLHF share a secret ingredient in Reinforcement Learning with Human Feedback, an approach designed to align better language models with what users want them to achieve.
The size of PaLM is 540 billion parameters, where “parameters” refers to the pieces of the language model learned from training data.

Recently, Philip Wang, the developer who reverse-engineered closed-source AI systems like Meta’s Make-A-Video, released PaLM RLHF, a text-generating model that works like ChatGPT. The system uses a large language model from Google called PaLM and a technique called Reinforcement Learning with Human Feedback, or RLHF, to make a system that can do almost everything ChatGPT can, like write emails and suggest computer code.

Like ChatGPT, PaLM RLHF is mainly a way to predict words using statistics. When PaLM RLHF is given a considerable amount of training data, like posts from Reddit, news articles, and e-books, it learns how likely words appear based on patterns like the meaning of the text around them.

ChatGPT and PaLM RLHF share a secret ingredient in Reinforcement Learning with Human Feedback, an approach designed to align better language models with what users want them to achieve. RLHF entails training a language model — in PaLM RLHF’s case, PaLM — and fine-tuning it using a dataset with prompts and what human volunteers expect the model to say.

The prompts are then put into the fine-tuned model. This creates several responses, ranked from best to worst by volunteers. Lastly, the rankings are used to train a “reward model” that ranks the original model’s responses in order of preference to find the best answers to a given prompt.

The size of PaLM is 540 billion parameters, where “parameters” refers to the pieces of the language model learned from training data. A 2020 study estimated that developing a text-generating model with 1.5 billion parameters might cost USD 1.6 million.

For instance, it took three months and 384 Nvidia A100 GPUs to train the 176 billion-parameter open-source model Bloom; where a single A100 GPU costs thousands of dollars.

In a LinkedIn post regarding PaLM RLHF, Sebastian Raschka, an AI researcher, notes that scaling up the required dev workflows could prove difficult. “Even if someone provides you with 500 GPUs to train this model, you still need to have to deal with infrastructure and have a software framework that can handle that. It’s obviously possible, but it’s a big effort now (of course, we are developing frameworks to make that simpler, but it’s still not trivial, yet)”, he said.

PaLM RLHF might not replace ChatGPT right now — unless a well-funded venture (or person) bothers to train and make it accessible to the public.

In other news, several additional projects to copy ChatGPT are developing quickly, including one run by the research team CarperAI. The first ChatGPT-like AI model that has been trained with human feedback will be made available by CarperAI in collaboration with the open AI research organisation EleutherAI, the firms Scale AI and Hugging Face.

ai governance for the enterprise...

empower ai and real-time insights at the edge...

power ai and analytics workloads with performance,...

how to choose the right ai foundation model...

pros enterprise ai for the industrial industries (...

unlocking ai’s potential: challenges and opportu...

transforming procurement with ai: opportunities, c...

adobe acrobat ai assistant: reinventing productivi...

adobe acrobat ai assistant: reinventing productivi...

ai, automation, and the strategic cao...

an introduction to ai in customer service...

5 ways ai can transform your customer experience...

ciso guide to generative ai attacks...

10 reasons to hire a customer-led voice assistant...

10 reasons to hire a customer-led voice assistant...

the definitive buying guide for contact center her...

cfo's guide to ai...

discover the future of business innovation with ge...

preparing for the future of cx by harnessing the p...

tableau gpt: innovate for the future with generati...

profitable ai-powered data management solutions to...

business-centric cognitive architecture revolution...

ai use cases – innovations for business success...

the role of ai in software development...

ai in cybersecurity – your digital guardian...

how chatbot marketing supports today’s business ...

advanced adaptive ai bolsters business intelligenc...

the dynamic impact of ai in procurement...

ai in customer service – revealing common applic...

how to use dall-e for marketing success...

rpa vs ai: a comparative analysis for business aut...

maximizing business efficiency through ai integrat...

7 trendiest ai marketing campaigns igniting commer...

liquid neural network unveiling the fluid intellig...

the art of prompt engineering in general & marketi...

what is amazon bedrock?...

decode data like never before: chatgpt for data an...

workforce planning models –the power of ai skil...

black friday and the impact of ai in e-commerce...

how digital brain is a game changer for business s...

microsoft introduces bing generative search in lim...

cytoreason raises usd 80 m in the funding round in...

google unveils a suite of new features for ai apps...

kindo reels in usd 20.6 m and acquires whiterabbit...

microsoft’s spreadsheetllm enhances ai’s compr...

herculesai raises usd 26 m to develop and expand i...

intel capital leads usd 15 m investment in ai cons...

aws unveils app studio to accelerate app developme...

captions llc raises usd 60 m for generative video ...

enso technologies secures usd 6 m for smb-focused ...

hebbia raises usd 130 m to develop data search pla...

meta releases four open-source language models...

harvey is reportedly raising usd 100 m at usd 1.5 ...

cloudflare introduces a new no-code feature to pre...

redactive raises usd 7.5 m to expand headcount and...

rapid7 acquires noetic cyber to help businesses fi...

runway ai aims for usd 450 m amid ai startup inter...

gen ai coding assistant startup magic ai aims to r...

anthropic introduces new program to fund enhanced ...

meta to open-source meta llm compiler for code opt...

role of machine learning in networking...

PaLM RLHF: An Open-source Alternative to ChatGPT?

Insights Desk

Related posts

Microsoft Introduces Bing Generative Search in Lim...

CytoReason Raises USD 80 M in the Funding Round In...

Google Unveils a Suite of New Features for AI Apps...

Kindo Reels in USD 20.6 M and Acquires WhiteRabbit...

Microsoft’s SpreadsheetLLM Enhances AI’s Compr...

HerculesAI Raises USD 26 M to Develop and Expand i...

Intel Capital Leads USD 15 M Investment in AI Cons...

AWS Unveils App Studio to Accelerate App Developme...

Captions LLC Raises USD 60 M for Generative Video ...

Enso Technologies Secures USD 6 M for SMB-focused ...

Our Brands