Microsoft Unfolds VALL-E, Featuring a Text-to-speech AI Technology

Published by: Insights Desk Released: Jan 12, 2023 Source: DemandTalk

Highlights:

The VALL-E tool is so powerful that it can even simulate voices from a short sample, and there is no other AI model which can sound as natural as that.
The concern is that the more AI improves, the better the audio deepfakes, and then there can be a problem.

Microsoft Corp., on January 10th provided a peek at a text-to-speech Artificial Intelligence (AI) tool, VALL-E. It can simulate a voice after listening to an audio sample for just three seconds.

The company stated that the tool can retain the speaker’s emotional tone for the rest of the message while simulating the acoustics of the room from where it first heard the voice. It is so powerful that it can even do it from a short sample, and there is no other AI model which can sound as natural as that.

Voice simulation is not new anymore. In the past, there have been tools that are able to simulate human voices, but not for the best of reasons. The concern is that the more AI improves, the better the audio deepfakes, and then there can be a problem.

Currently, there is no real review of the tool as Microsoft hasn’t released it to the public, although it has provided some completed work samples. It will be great to see and use a tool that needs the mimicry of just three seconds, and the copied voice will go on to speak for any length of time.

If it’s as good as Microsoft says it is and can sound as human, with emotions and all, you can know why Microsoft wants to invest in the AI that has taken the world by storm, the very popular ChatGPT. If both VALL-E and ChatGPT are combined, people asking questions on the phone at call centers will not be able to differentiate a human from a robot. There is a possibility that this partnership could create something similar to a podcast but without a real guest.

Any threats?

Yes. If given in the hands of the wrong people, a powerful tool like this can be used for spreading wrong information, mimicking the voices of politicians, journalists, and celebrities.

Experts’ Talk

Microsoft said in its paper, “Since VALL-E could synthesize speech that maintains speaker identity, it may carry potential risks in misuse of the model, such as spoofing voice identification or impersonating a specific speaker. To mitigate such risks, it is possible to build a detection model to discriminate whether an audio clip was synthesized by VALL-E. We will also put Microsoft AI Principles into practice when further developing the models.”

Insights Desk

Insights Desk is an integral part of AI Demand, contributing content resources and marketing vision. It creates and curates content for different technology verticals by keeping upcoming trends and technological regulations in mind. Insights Desk has been a part of technological content creation with the advent of enterprise security.

ai governance for the enterprise...

empower ai and real-time insights at the edge...

power ai and analytics workloads with performance,...

how to choose the right ai foundation model...

pros enterprise ai for the industrial industries (...

unlocking ai’s potential: challenges and opportu...

transforming procurement with ai: opportunities, c...

adobe acrobat ai assistant: reinventing productivi...

adobe acrobat ai assistant: reinventing productivi...

ai, automation, and the strategic cao...

an introduction to ai in customer service...

5 ways ai can transform your customer experience...

ciso guide to generative ai attacks...

10 reasons to hire a customer-led voice assistant...

10 reasons to hire a customer-led voice assistant...

the definitive buying guide for contact center her...

cfo's guide to ai...

discover the future of business innovation with ge...

preparing for the future of cx by harnessing the p...

tableau gpt: innovate for the future with generati...

profitable ai-powered data management solutions to...

business-centric cognitive architecture revolution...

ai use cases – innovations for business success...

the role of ai in software development...

ai in cybersecurity – your digital guardian...

how chatbot marketing supports today’s business ...

advanced adaptive ai bolsters business intelligenc...

the dynamic impact of ai in procurement...

ai in customer service – revealing common applic...

how to use dall-e for marketing success...

rpa vs ai: a comparative analysis for business aut...

maximizing business efficiency through ai integrat...

7 trendiest ai marketing campaigns igniting commer...

liquid neural network unveiling the fluid intellig...

the art of prompt engineering in general & marketi...

what is amazon bedrock?...

decode data like never before: chatgpt for data an...

workforce planning models –the power of ai skil...

black friday and the impact of ai in e-commerce...

how digital brain is a game changer for business s...

microsoft introduces bing generative search in lim...

cytoreason raises usd 80 m in the funding round in...

google unveils a suite of new features for ai apps...

kindo reels in usd 20.6 m and acquires whiterabbit...

microsoft’s spreadsheetllm enhances ai’s compr...

herculesai raises usd 26 m to develop and expand i...

intel capital leads usd 15 m investment in ai cons...

aws unveils app studio to accelerate app developme...

captions llc raises usd 60 m for generative video ...

enso technologies secures usd 6 m for smb-focused ...

hebbia raises usd 130 m to develop data search pla...

meta releases four open-source language models...

harvey is reportedly raising usd 100 m at usd 1.5 ...

cloudflare introduces a new no-code feature to pre...

redactive raises usd 7.5 m to expand headcount and...

rapid7 acquires noetic cyber to help businesses fi...

runway ai aims for usd 450 m amid ai startup inter...

gen ai coding assistant startup magic ai aims to r...

anthropic introduces new program to fund enhanced ...

meta to open-source meta llm compiler for code opt...

role of machine learning in networking...

Microsoft Unfolds VALL-E, Featuring a Text-to-speech AI Technology

Insights Desk

Related posts

Microsoft Introduces Bing Generative Search in Lim...

CytoReason Raises USD 80 M in the Funding Round In...

Google Unveils a Suite of New Features for AI Apps...

Kindo Reels in USD 20.6 M and Acquires WhiteRabbit...

Microsoft’s SpreadsheetLLM Enhances AI’s Compr...

HerculesAI Raises USD 26 M to Develop and Expand i...

Intel Capital Leads USD 15 M Investment in AI Cons...

AWS Unveils App Studio to Accelerate App Developme...

Captions LLC Raises USD 60 M for Generative Video ...

Enso Technologies Secures USD 6 M for SMB-focused ...

Our Brands