Nvidia Launches Generative AI Microservice for Accurate Responses

Published by: Insights Desk Released: Nov 29, 2023 Source: DemandTalk

Highlights:

NeMo Retriever, the new service, plays a crucial role in the suite’s frameworks and tools tailored for developing, customizing, and deploying generative AI models.
NeMo Retriever allows the integration of current data into a large language model (LLM) from various sources such as databases, HTML, PDFs, images, videos, and other formats.

In order to provide more accurate results, Nvidia Corp. has announced the release of a new generative artificial intelligence microservice that enables enterprise businesses to connect custom chatbots, copilots, and AI summarization tools to real-time proprietary company data.

The recently introduced NeMo Retriever, a Nvidia NeMo cloud-native framework and toolset component, facilitates generative AI models’ development, customization, and deployment. This service is crafted to empower enterprise organizations with the capability to integrate retrieval-augmented generation features into their generative AI applications.

Retrieval-augmented generation (RAG) is a technique that enhances the precision and reliability of generative AI models. It achieves this by supplementing the inherent “knowledge” gaps in large language models with facts and data retrieved from external sources. Initially, a large language model undergoes comprehensive training to acquire general task knowledge and capabilities, encompassing understanding conversational prompts, summarization, and question-and-answer functionalities. Given the expensive and time-consuming nature of training, it is typically performed only once, or infrequently, to prepare the deployment model.

Nevertheless, once in operation, the model will be devoid of real-time information and the latest domain-specific knowledge, potentially resulting in inaccuracies and occurrences known as ‘hallucinations.’ This refers to instances where a large language model responds confidently but incorrectly to a question.

With NeMo Retriever, current data can be integrated into an LLM from various sources, such as databases, HTML, PDFs, images, videos, and other formats. Consequently, the model gains a comprehensive collection of facts sourced from the enterprise customer’s proprietary data, ensuring updates as new information emerges. This data can be stored anywhere, including cloud environments, data centers, or on-premises, and accessed securely.

Vice President of hyperscale and high-performance computing at Nvidia, Ian Buck, said, “This is the holy grail for chatbots across the enterprise because the vast majority of useful data is the proprietary data that is not the publicly available data embedded inside of these models but what is available inside companies. So, combining AI with a customer’s database makes it more productive, more accurate, more useful and lets customers optimize models’ capabilities.”

By integrating proprietary data, inaccurate answers can be minimized, as the LLM gains improved contextual information for generating results, thereby enhancing accuracy. Analogous to research papers citing their sources, Retriever’s RAG capability supplements additional sources of expert information from a company’s internal domain-specific knowledge. This augmentation better equips the LLM, providing more precise and accurate responses to posed questions.

In contrast to community-driven open-source RAG toolkits, Nvidia emphasizes that Retriever is specifically crafted to endorse commercial and production-ready generative AI models. These models are pre-optimized for RAG capabilities, offering enterprise support and managed security patches.

Nvidia is already collaborating with enterprise clients like Dropbox Inc., SAP SE, ServiceNow Inc., electronics systems designer Cadence Design Systems Inc., and others to leverage the new feature to integrate RAG into their custom generative AI tools, apps, and services.

According to Anirudh Devgan, President and CEO of Cadence, the company’s researchers are collaborating with Nvidia to use Retriever to improve accuracy and help make higher-quality electronics. Devgan said, “Generative AI introduces innovative approaches to address customer needs, such as tools to uncover potential flaws early in the design process.”

According to Buck, leveraging Retriever enables customers to obtain more accurate results when training generative AI models in less time. This streamlines the process for enterprise customers, allowing them to deploy off-the-shelf models and use internal data without the extensive time, cost, and effort traditionally required to maintain model consistency.

Within Nvidia AI Enterprise, NeMo Retriever will incorporate the aforementioned RAG capabilities, forming an integral part of this end-to-end cloud-native software platform designed to simplify AI application development. Developers can now register for early access to NeMo Retriever.

ai governance for the enterprise...

empower ai and real-time insights at the edge...

power ai and analytics workloads with performance,...

how to choose the right ai foundation model...

pros enterprise ai for the industrial industries (...

unlocking ai’s potential: challenges and opportu...

transforming procurement with ai: opportunities, c...

adobe acrobat ai assistant: reinventing productivi...

adobe acrobat ai assistant: reinventing productivi...

ai, automation, and the strategic cao...

an introduction to ai in customer service...

5 ways ai can transform your customer experience...

ciso guide to generative ai attacks...

10 reasons to hire a customer-led voice assistant...

10 reasons to hire a customer-led voice assistant...

the definitive buying guide for contact center her...

cfo's guide to ai...

discover the future of business innovation with ge...

preparing for the future of cx by harnessing the p...

tableau gpt: innovate for the future with generati...

profitable ai-powered data management solutions to...

business-centric cognitive architecture revolution...

ai use cases – innovations for business success...

the role of ai in software development...

ai in cybersecurity – your digital guardian...

how chatbot marketing supports today’s business ...

advanced adaptive ai bolsters business intelligenc...

the dynamic impact of ai in procurement...

ai in customer service – revealing common applic...

how to use dall-e for marketing success...

rpa vs ai: a comparative analysis for business aut...

maximizing business efficiency through ai integrat...

7 trendiest ai marketing campaigns igniting commer...

liquid neural network unveiling the fluid intellig...

the art of prompt engineering in general & marketi...

what is amazon bedrock?...

decode data like never before: chatgpt for data an...

workforce planning models –the power of ai skil...

black friday and the impact of ai in e-commerce...

how digital brain is a game changer for business s...

microsoft introduces bing generative search in lim...

cytoreason raises usd 80 m in the funding round in...

google unveils a suite of new features for ai apps...

kindo reels in usd 20.6 m and acquires whiterabbit...

microsoft’s spreadsheetllm enhances ai’s compr...

herculesai raises usd 26 m to develop and expand i...

intel capital leads usd 15 m investment in ai cons...

aws unveils app studio to accelerate app developme...

captions llc raises usd 60 m for generative video ...

enso technologies secures usd 6 m for smb-focused ...

hebbia raises usd 130 m to develop data search pla...

meta releases four open-source language models...

harvey is reportedly raising usd 100 m at usd 1.5 ...

cloudflare introduces a new no-code feature to pre...

redactive raises usd 7.5 m to expand headcount and...

rapid7 acquires noetic cyber to help businesses fi...

runway ai aims for usd 450 m amid ai startup inter...

gen ai coding assistant startup magic ai aims to r...

anthropic introduces new program to fund enhanced ...

meta to open-source meta llm compiler for code opt...

role of machine learning in networking...

Nvidia Launches Generative AI Microservice for Accurate Responses

Highlights:

Insights Desk

Related posts

Microsoft Introduces Bing Generative Search in Lim...

CytoReason Raises USD 80 M in the Funding Round In...

Google Unveils a Suite of New Features for AI Apps...

Kindo Reels in USD 20.6 M and Acquires WhiteRabbit...

Microsoft’s SpreadsheetLLM Enhances AI’s Compr...

HerculesAI Raises USD 26 M to Develop and Expand i...

Intel Capital Leads USD 15 M Investment in AI Cons...

AWS Unveils App Studio to Accelerate App Developme...

Captions LLC Raises USD 60 M for Generative Video ...

Enso Technologies Secures USD 6 M for SMB-focused ...

Our Brands