News | OpenAI is Utilizing GPT-4 to Determine LLM’s Behavior

OpenAI is Utilizing GPT-4 to Determine LLM’s Behavior

Published by: Insights Desk Released: May 10, 2023 Source: DemandTalk

Highlights:

According to the researchers at OpenAI, it is conceivable to disassemble GPT-2 into its parts using this neuron-based architecture.
According to OpenAI’s researchers, the research may one day improve LLM performance by minimizing the drawbacks.

ChatGPT’s creator, OpenAI LP, is developing a tool that it claims will eventually enable it to comprehend which components of a large language model are in charge of its behavior.

Although the tool is incomplete, the company has open-sourced the code and made it accessible on GitHub so that others can examine and improve it.

OpenAI explained in a blog post that LLMs are sometimes compared to a “black box.” It’s challenging to comprehend why a generative artificial intelligence model reacts a certain way to particular types of inputs. Its “interpretability research” aims to understand better the factors that influence LLM behavior.

Researchers at OpenAI stated, “Language models have become more capable and more broadly deployed, but our understanding of how they work internally is still very limited. For example, it might be difficult to detect from their outputs whether they use biased heuristics or engage in deception.”

Ironically, OpenAI’s new tool depends on an LLM to attempt to determine how certain parts of other, less complex LLMs function. In their research, OpenAI tried to understand one of its predecessors, GPT-2, using GPT-4, its most recent and advanced LLM.

It’s essential to first comprehend how LLMs function. They resemble the human brain in that they are composed of numerous “neurons” that analyze a different pattern in a text to determine how the model will react to a given input. As a result, a neuron focused on Marvel superheroes may boost the likelihood that the LLM will name characters from the Marvel comic and film universe if a model is asked which superheroes have the best superpowers.

According to the researchers at OpenAI, it is conceivable to disassemble GPT-2 into parts using this neuron-based architecture. The tool searches for instances where running text sequences regularly stimulates a particular neuron. GPT-4 is then shown these extremely active neurons and asked to come up with an answer.

The tool will specifically ask GPT-4 to forecast the neuron’s potential behavior. The accuracy of these predictions will then be evaluated by comparing them to the neuron’s actual behavior. According to OpenAI, the methodology enables it to score each neuron’s pattern explanation based on how it actually behaves when prompted and to explain each neuron’s behavior within the GPT-2 system.

The total number of neurons in GPT-2 is 307,200, and according to OpenAI’s researchers, they were able to come up with an explanation for every single one of them. Following that, a database containing these justifications was created and released as open-source software along with the tool itself.

According to OpenAI’s researchers, the research may one day contribute to improving LLM performance by minimizing drawbacks like “toxicity” or “bias.” The team behind it acknowledged it would take some time before the tool is actually useful for this purpose.

The results show that it was only highly confidently able to explain the nature of roughly 1,000 of the GPT-2 neurons. Much work must be done to understand better and predict the actions of the remaining 306,000 neurons.

OpenAI stated that there is much room for advancement in their studies. For instance, even though it concentrated on providing brief explanations in natural language, it acknowledged that some neurons might exhibit considerably more sophisticated behavior that is difficult, to sum up in such a brief manner. The researchers mentioned, “For example, neurons could be highly polysemantic (representing many distinct concepts) or could represent single concepts that humans don’t understand or have words for.”

According to the company, one of OpenAI’s objectives is to move beyond simple neurons to identify and comprehend entire neural circuits that are in charge of carrying out more sophisticated behaviors. These circuits include neurons and the “attention heads” interacting with them. The researchers would also wish to describe each neuron’s methods to produce a specific behavior.

The researchers wrote, “We explained the behavior of neurons without attempting to explain the mechanisms that produce that behavior. This means that even high-scoring explanations could do very poorly on out-of-distribution texts since they are simply describing a correlation.”

OpenAI expressed excitement about its progress in employing LLMs to generate, test, and iterate on regular hypotheses, precisely like a human interpretability researcher would. However, there is still much work to be done.

ai governance for the enterprise...

empower ai and real-time insights at the edge...

power ai and analytics workloads with performance,...

how to choose the right ai foundation model...

pros enterprise ai for the industrial industries (...

unlocking ai’s potential: challenges and opportu...

transforming procurement with ai: opportunities, c...

adobe acrobat ai assistant: reinventing productivi...

adobe acrobat ai assistant: reinventing productivi...

ai, automation, and the strategic cao...

an introduction to ai in customer service...

5 ways ai can transform your customer experience...

ciso guide to generative ai attacks...

10 reasons to hire a customer-led voice assistant...

10 reasons to hire a customer-led voice assistant...

the definitive buying guide for contact center her...

cfo's guide to ai...

discover the future of business innovation with ge...

preparing for the future of cx by harnessing the p...

tableau gpt: innovate for the future with generati...

profitable ai-powered data management solutions to...

business-centric cognitive architecture revolution...

ai use cases – innovations for business success...

the role of ai in software development...

ai in cybersecurity – your digital guardian...

how chatbot marketing supports today’s business ...

advanced adaptive ai bolsters business intelligenc...

the dynamic impact of ai in procurement...

ai in customer service – revealing common applic...

how to use dall-e for marketing success...

rpa vs ai: a comparative analysis for business aut...

maximizing business efficiency through ai integrat...

7 trendiest ai marketing campaigns igniting commer...

liquid neural network unveiling the fluid intellig...

the art of prompt engineering in general & marketi...

what is amazon bedrock?...

decode data like never before: chatgpt for data an...

workforce planning models –the power of ai skil...

black friday and the impact of ai in e-commerce...

how digital brain is a game changer for business s...

microsoft introduces bing generative search in lim...

cytoreason raises usd 80 m in the funding round in...

google unveils a suite of new features for ai apps...

kindo reels in usd 20.6 m and acquires whiterabbit...

microsoft’s spreadsheetllm enhances ai’s compr...

herculesai raises usd 26 m to develop and expand i...

intel capital leads usd 15 m investment in ai cons...

aws unveils app studio to accelerate app developme...

captions llc raises usd 60 m for generative video ...

enso technologies secures usd 6 m for smb-focused ...

hebbia raises usd 130 m to develop data search pla...

meta releases four open-source language models...

harvey is reportedly raising usd 100 m at usd 1.5 ...

cloudflare introduces a new no-code feature to pre...

redactive raises usd 7.5 m to expand headcount and...

rapid7 acquires noetic cyber to help businesses fi...

runway ai aims for usd 450 m amid ai startup inter...

gen ai coding assistant startup magic ai aims to r...

anthropic introduces new program to fund enhanced ...

meta to open-source meta llm compiler for code opt...

role of machine learning in networking...

OpenAI is Utilizing GPT-4 to Determine LLM’s Behavior

Insights Desk

Related posts

Microsoft Introduces Bing Generative Search in Lim...

CytoReason Raises USD 80 M in the Funding Round In...

Google Unveils a Suite of New Features for AI Apps...

Kindo Reels in USD 20.6 M and Acquires WhiteRabbit...

Microsoft’s SpreadsheetLLM Enhances AI’s Compr...

HerculesAI Raises USD 26 M to Develop and Expand i...

Intel Capital Leads USD 15 M Investment in AI Cons...

AWS Unveils App Studio to Accelerate App Developme...

Captions LLC Raises USD 60 M for Generative Video ...

Enso Technologies Secures USD 6 M for SMB-focused ...

Our Brands