News | Google Researchers Reveal A ChatGPT-Style AI Model That Can Guide A Robot Without Requiring Any Special Training

Google Researchers Reveal A ChatGPT-Style AI Model That Can Guide A Robot Without Requiring Any Special Training

Published by: Insights Desk Released: Mar 09, 2023 Source: DemandTalk

Highlights:

A robot powered by Artificial Intelligence (AI) and trained on a multimodal embodied visual-language model with far more than 562 billion parameters was unveiled this week by Google LLC and the Technical University of Berlin researchers.
PaLM-E functions by observing its immediate surroundings through the robot’s camera and can do so without using any scene representation that has been previously processed.

A robot powered by Artificial Intelligence (AI) and trained on a multimodal embodied visual-language model with more than 562 billion parameters was unveiled this week by Google LLC and the Technical University of Berlin researchers.

The robot can perform various tasks based on human voice commands thanks to PaLM-E, a model that integrates AI-powered vision and language to enable autonomous robotic control. This eliminates the need for ongoing retraining. In other words, it’s a robot that can comprehend what is being requested and then go ahead and complete those tasks right away.

For instance, if the robot is instructed to “bring the chips from the drawer,” PaLM-E will immediately devise a plan of action based on the instruction and its field of vision. The mobile robot platform will autonomously act using a controlled robotic arm.

PaLM-E functions by observing its immediate surroundings through the robot’s camera and can do so without using any scene representation that has been previously processed. It merely looks, takes in what it sees, and determines what it must do. Therefore, there is no need for a person to first annotate the visual data.

PaLM-E can respond to changes in the environment as it performs a task, according to Google’s researchers. For instance, if the robot goes to fetch the chips and someone else takes them from it and puts them on a table in the room, the robot will notice what happened, look for them, grab them, and then deliver them to the person who initially asked for them.

Based on the existing PaLM large language model, which integrates sensory data and robotic control, PaLM-E is called an “embodied visual-language model.” It operates by making ongoing observations of its surroundings and encoding this data into a series of vectors, much like it does with words as “language tokens.” This enables it to comprehend sensory data, like how it understands vocal commands.

According to the researchers, PaLM-E can “positively transfer” knowledge and skills from one task to another, outperforming single-task robot models in performance. It also exhibits “multimodal chain-of-thought reasoning,” which means it can evaluate a series of inputs, including language and visual inputs and “multi-image inference.” According to the researchers, it uses multiple images to make an inference or predict something.

Overall, PaLM-E represents a significant advance in autonomous robotics. Google stated that its next steps would be to investigate other applications in practical contexts like home automation and industrial robotics. The researchers also hoped their work would stimulate additional investigation into embodied AI and multimodal reasoning.

ai governance for the enterprise...

empower ai and real-time insights at the edge...

power ai and analytics workloads with performance,...

how to choose the right ai foundation model...

pros enterprise ai for the industrial industries (...

unlocking ai’s potential: challenges and opportu...

transforming procurement with ai: opportunities, c...

adobe acrobat ai assistant: reinventing productivi...

adobe acrobat ai assistant: reinventing productivi...

ai, automation, and the strategic cao...

an introduction to ai in customer service...

5 ways ai can transform your customer experience...

ciso guide to generative ai attacks...

10 reasons to hire a customer-led voice assistant...

10 reasons to hire a customer-led voice assistant...

the definitive buying guide for contact center her...

cfo's guide to ai...

discover the future of business innovation with ge...

preparing for the future of cx by harnessing the p...

tableau gpt: innovate for the future with generati...

profitable ai-powered data management solutions to...

business-centric cognitive architecture revolution...

ai use cases – innovations for business success...

the role of ai in software development...

ai in cybersecurity – your digital guardian...

how chatbot marketing supports today’s business ...

advanced adaptive ai bolsters business intelligenc...

the dynamic impact of ai in procurement...

ai in customer service – revealing common applic...

how to use dall-e for marketing success...

rpa vs ai: a comparative analysis for business aut...

maximizing business efficiency through ai integrat...

7 trendiest ai marketing campaigns igniting commer...

liquid neural network unveiling the fluid intellig...

the art of prompt engineering in general & marketi...

what is amazon bedrock?...

decode data like never before: chatgpt for data an...

workforce planning models –the power of ai skil...

black friday and the impact of ai in e-commerce...

how digital brain is a game changer for business s...

microsoft introduces bing generative search in lim...

cytoreason raises usd 80 m in the funding round in...

google unveils a suite of new features for ai apps...

kindo reels in usd 20.6 m and acquires whiterabbit...

microsoft’s spreadsheetllm enhances ai’s compr...

herculesai raises usd 26 m to develop and expand i...

intel capital leads usd 15 m investment in ai cons...

aws unveils app studio to accelerate app developme...

captions llc raises usd 60 m for generative video ...

enso technologies secures usd 6 m for smb-focused ...

hebbia raises usd 130 m to develop data search pla...

meta releases four open-source language models...

harvey is reportedly raising usd 100 m at usd 1.5 ...

cloudflare introduces a new no-code feature to pre...

redactive raises usd 7.5 m to expand headcount and...

rapid7 acquires noetic cyber to help businesses fi...

runway ai aims for usd 450 m amid ai startup inter...

gen ai coding assistant startup magic ai aims to r...

anthropic introduces new program to fund enhanced ...

meta to open-source meta llm compiler for code opt...

role of machine learning in networking...

Google Researchers Reveal A ChatGPT-Style AI Model That Can Guide A Robot Without Requiring Any Special Training

Insights Desk

Related posts

Microsoft Introduces Bing Generative Search in Lim...

CytoReason Raises USD 80 M in the Funding Round In...

Google Unveils a Suite of New Features for AI Apps...

Kindo Reels in USD 20.6 M and Acquires WhiteRabbit...

Microsoft’s SpreadsheetLLM Enhances AI’s Compr...

HerculesAI Raises USD 26 M to Develop and Expand i...

Intel Capital Leads USD 15 M Investment in AI Cons...

AWS Unveils App Studio to Accelerate App Developme...

Captions LLC Raises USD 60 M for Generative Video ...

Enso Technologies Secures USD 6 M for SMB-focused ...

Our Brands