Google Enhances Bard AI Chatbot with Gemini Pro

Published by: Insights Desk Released: Feb 02, 2024 Source: DemandTalk

Highlights:

The Gemini model is available in three sizes: the Pro, a smaller Nano tailored for Pixel phones and mobile devices, and an exceptionally potent Ultra, specifically crafted for enterprise services.
MusicFX harnesses Google’s MusicLM AI model, adept at generating high-fidelity musical tracks from user text prompts or interpreting a melody that the user hums.

Google LLC recently revealed the complete upgrade of its Bard artificial intelligence chatbot, integrating Gemini Pro, its most potent Large Language Model (LLM) model. Additionally, the Bard AI now boasts image generation capabilities powered by its Imagen 2 model.

The company has also unveiled ImageFX, a novel image generation tool, alongside an enhancement to MusicFX, an experimental AI model that converts text into music.

Gemini Pro has been accessible in Bard since December, albeit limited to a select subset of English-speaking users. This update will see the global rollout of Gemini Pro, extending its availability to users in over 40 languages across more than 230 countries and territories.

Gemini embodies Google’s most robust LLM, boasting advanced text generation, question answering, document summarization, conversational logic, and coding capabilities. The Gemini model is available in three sizes: the Pro, a smaller Nano tailored for Pixel phones and mobile devices, and an exceptionally potent Ultra, specifically crafted for enterprise services.

In addition to the upgrade, Google Bard will acquire the ability to generate images using text prompts, courtesy of the Imagen 2 text-to-image model. This marks the second iteration of the Imagen model, introduced by Google in May 2022.

Incorporating image-generating capabilities into Bard enables it to produce vivid, imaginative, and photorealistic images based on user text descriptions. This enhancement aligns Bard with Microsoft Corp.’s Bing Chat, which utilizes OpenAI’s DALL-E 3 to generate pictures from user conversations.

“Just type in a description — like ‘create an image of a dog riding a surfboard’ — and Bard will generate custom, wide-ranging visuals to help bring your idea to life,” Bard’s Product Lead, Jack Krawczyk, stated in the release.

To facilitate the secure sharing of artwork generated by Bard, all graphics will be watermarked using SynthID, a tool devised by Google DeepMind researchers for identifying AI-generated images. SynthID watermarks are invisible to the human eye but can be easily detected by computer-assisted tools.

Google’s latest standalone ImageFX tool, fueled by Imagen 2, has been integrated into the company’s AI Test Kitchen. This platform grants public access to experimental AI tools developed by Google. Google has also refreshed MusicFX, an AI model that transforms text into music, enabling users to create songs.

ImageFX functions similarly to other generative AI artwork creation tools, enabling users to input simple text prompts to generate images. Users can then continue to modify these images by providing additional prompts.

Kristin Yim, Product Manager at Google Labs stated, “People often discover new ideas through testing a range of prompts and concepts as they iterate. To spur further creativity, ImageFX includes a prompt interface featuring ‘expressive chips’ that let you quickly experiment with adjacent dimensions of your creation and ideas.”

MusicFX harnesses Google’s MusicLM AI model, adept at generating high-fidelity musical tracks from user text prompts or interpreting a melody that the user hums. Google debuted the text-to-music experiment last year, and since its introduction, users have generated over 10 million tracks. The text-to-music feature has been enhanced to enable the creation of 70-second music loops. Moreover, users can now utilize “expressive chips” for exploratory prompts, facilitating the iteration of generated music.

Yim said, “With feedback and improvements to our underlying MusicLM model, we’re enabling new capabilities like higher-quality audio and faster music generation.”

Google stated that both ImageFX and MusicLM utilize SynthID to watermark their outputs. This ensures that artwork and songs generated by these tools can be identified as AI-generated.

ai governance for the enterprise...

empower ai and real-time insights at the edge...

power ai and analytics workloads with performance,...

how to choose the right ai foundation model...

pros enterprise ai for the industrial industries (...

unlocking ai’s potential: challenges and opportu...

transforming procurement with ai: opportunities, c...

adobe acrobat ai assistant: reinventing productivi...

adobe acrobat ai assistant: reinventing productivi...

ai, automation, and the strategic cao...

an introduction to ai in customer service...

5 ways ai can transform your customer experience...

ciso guide to generative ai attacks...

10 reasons to hire a customer-led voice assistant...

10 reasons to hire a customer-led voice assistant...

the definitive buying guide for contact center her...

cfo's guide to ai...

discover the future of business innovation with ge...

preparing for the future of cx by harnessing the p...

tableau gpt: innovate for the future with generati...

profitable ai-powered data management solutions to...

business-centric cognitive architecture revolution...

ai use cases – innovations for business success...

the role of ai in software development...

ai in cybersecurity – your digital guardian...

how chatbot marketing supports today’s business ...

advanced adaptive ai bolsters business intelligenc...

the dynamic impact of ai in procurement...

ai in customer service – revealing common applic...

how to use dall-e for marketing success...

rpa vs ai: a comparative analysis for business aut...

maximizing business efficiency through ai integrat...

7 trendiest ai marketing campaigns igniting commer...

liquid neural network unveiling the fluid intellig...

the art of prompt engineering in general & marketi...

what is amazon bedrock?...

decode data like never before: chatgpt for data an...

workforce planning models –the power of ai skil...

black friday and the impact of ai in e-commerce...

how digital brain is a game changer for business s...

microsoft introduces bing generative search in lim...

cytoreason raises usd 80 m in the funding round in...

google unveils a suite of new features for ai apps...

kindo reels in usd 20.6 m and acquires whiterabbit...

microsoft’s spreadsheetllm enhances ai’s compr...

herculesai raises usd 26 m to develop and expand i...

intel capital leads usd 15 m investment in ai cons...

aws unveils app studio to accelerate app developme...

captions llc raises usd 60 m for generative video ...

enso technologies secures usd 6 m for smb-focused ...

hebbia raises usd 130 m to develop data search pla...

meta releases four open-source language models...

harvey is reportedly raising usd 100 m at usd 1.5 ...

cloudflare introduces a new no-code feature to pre...

redactive raises usd 7.5 m to expand headcount and...

rapid7 acquires noetic cyber to help businesses fi...

runway ai aims for usd 450 m amid ai startup inter...

gen ai coding assistant startup magic ai aims to r...

anthropic introduces new program to fund enhanced ...

meta to open-source meta llm compiler for code opt...

role of machine learning in networking...

Google Enhances Bard AI Chatbot with Gemini Pro

Highlights:

Insights Desk

Related posts

Microsoft Introduces Bing Generative Search in Lim...

CytoReason Raises USD 80 M in the Funding Round In...

Google Unveils a Suite of New Features for AI Apps...

Kindo Reels in USD 20.6 M and Acquires WhiteRabbit...

Microsoft’s SpreadsheetLLM Enhances AI’s Compr...

HerculesAI Raises USD 26 M to Develop and Expand i...

Intel Capital Leads USD 15 M Investment in AI Cons...

AWS Unveils App Studio to Accelerate App Developme...

Captions LLC Raises USD 60 M for Generative Video ...

Enso Technologies Secures USD 6 M for SMB-focused ...

Our Brands