Patronus AI Secures USD 17M for Its AI Reliability Testing Platform

Published by: Insights Desk Released: May 23, 2024 Source: DemandTalk

Highlights:

Patronus AI’s software employs AI to automatically create adversarial prompts, which test an LLM’s reliability by attempting to elicit unwanted output.
One of the focuses of Patronus AI’s platform is enhancing the reliability of LLMs equipped with RAG, or retrieval-augmented generation, capabilities.

Recently, a startup specializing in assisting companies with identifying and resolving reliability issues in their large language models, Patronus AI Inc., has secured a USD 17 million investment.

Notable Capital spearheaded the Series A funding round, with participation from publicly traded observability provider Datadog Inc., Lightspeed Venture Partners, Factorial Capital, and several angel investors from the tech industry. This injection of funds raises Patronus AI’s total external financing to USD 20 million.

Inaccurate information in prompt responses is just one of the risks companies need to address before deploying a large language model (LLM) to production. User prompts can sometimes lead the model to generate copyrighted material or reveal sensitive business data. More subdued problems also need to be addressed, like when an LLM output doesn’t follow a company’s text style guidelines.

San Francisco-based Patronus AI has created a platform designed to assist developers in tackling these challenges. The company claims that its software utilizes AI to automatically create adversarial prompts, which are prompts that test an LLM’s reliability by attempting to deceive it into producing undesired outputs.

The platform also offers prepackaged reliability evaluations developed by the company. Additionally, Patronus AI has integrated a dashboard that visualizes the results of these reliability tests using charts. For instance, if an evaluation includes 100 prompts intended to test the accuracy of an LLM’s responses, the dashboard can show how many of those prompts were handled incorrectly.

“Model hallucinations and safety risks are here to stay. What enterprises need is transparency into model performance and accuracy in order to circumvent risks. For the first time, we’re giving companies a way to truly understand what they are working with so they can deploy LLMs with confidence,” said Chief Executive Anand Kannappan.

One of the use cases Patronus AI aims to address with its platform is enhancing the reliability of LLMs equipped with RAG (retrieval-augmented generation) features. Standard language models generate responses solely based on information from their training datasets. In contrast, an RAG-enabled LLM can enhance its knowledge by accessing external data sources, thereby improving response quality.

The process of incorporating data from external sources into an LLM’s prompts involves multiple steps. According to Patronus AI, developers can use its platform to ensure these steps are executed correctly. The company claims its software delivers at least 20% better “evaluation performance” compared to competing methods.

Developers can also utilize Patronus AI to identify the most suitable LLM for a specific software project. By using the platform, an application team can test multiple models with the same set of prompts to determine which one produces the most accurate responses. The company states that its platform supports both off-the-shelf and customized LLMs.

Sometimes, a language model that performs well initially in production may become less accurate over time. This issue arises when the types of prompts users input into the LLM evolve. Patronus’ platform addresses this by providing an application programming interface that enables developers to continuously monitor and evaluate a deployed LLM for gradual declines in accuracy.

The company intends to utilize the funds from its recently disclosed financing round to bolster product development efforts. This includes expanding its AI research and engineering teams and enhancing its go-to-market efforts. The hiring drive is expected to double the company’s headcount to approximately 24 employees by the end of the year.

ai governance for the enterprise...

empower ai and real-time insights at the edge...

power ai and analytics workloads with performance,...

how to choose the right ai foundation model...

pros enterprise ai for the industrial industries (...

unlocking ai’s potential: challenges and opportu...

transforming procurement with ai: opportunities, c...

adobe acrobat ai assistant: reinventing productivi...

adobe acrobat ai assistant: reinventing productivi...

ai, automation, and the strategic cao...

an introduction to ai in customer service...

5 ways ai can transform your customer experience...

ciso guide to generative ai attacks...

10 reasons to hire a customer-led voice assistant...

10 reasons to hire a customer-led voice assistant...

the definitive buying guide for contact center her...

cfo's guide to ai...

discover the future of business innovation with ge...

preparing for the future of cx by harnessing the p...

tableau gpt: innovate for the future with generati...

profitable ai-powered data management solutions to...

business-centric cognitive architecture revolution...

ai use cases – innovations for business success...

the role of ai in software development...

ai in cybersecurity – your digital guardian...

how chatbot marketing supports today’s business ...

advanced adaptive ai bolsters business intelligenc...

the dynamic impact of ai in procurement...

ai in customer service – revealing common applic...

how to use dall-e for marketing success...

rpa vs ai: a comparative analysis for business aut...

maximizing business efficiency through ai integrat...

7 trendiest ai marketing campaigns igniting commer...

liquid neural network unveiling the fluid intellig...

the art of prompt engineering in general & marketi...

what is amazon bedrock?...

decode data like never before: chatgpt for data an...

workforce planning models –the power of ai skil...

black friday and the impact of ai in e-commerce...

how digital brain is a game changer for business s...

microsoft introduces bing generative search in lim...

cytoreason raises usd 80 m in the funding round in...

google unveils a suite of new features for ai apps...

kindo reels in usd 20.6 m and acquires whiterabbit...

microsoft’s spreadsheetllm enhances ai’s compr...

herculesai raises usd 26 m to develop and expand i...

intel capital leads usd 15 m investment in ai cons...

aws unveils app studio to accelerate app developme...

captions llc raises usd 60 m for generative video ...

enso technologies secures usd 6 m for smb-focused ...

hebbia raises usd 130 m to develop data search pla...

meta releases four open-source language models...

harvey is reportedly raising usd 100 m at usd 1.5 ...

cloudflare introduces a new no-code feature to pre...

redactive raises usd 7.5 m to expand headcount and...

rapid7 acquires noetic cyber to help businesses fi...

runway ai aims for usd 450 m amid ai startup inter...

gen ai coding assistant startup magic ai aims to r...

anthropic introduces new program to fund enhanced ...

meta to open-source meta llm compiler for code opt...

role of machine learning in networking...

Patronus AI Secures USD 17M for Its AI Reliability Testing Platform

Highlights:

Insights Desk

Related posts

Microsoft Introduces Bing Generative Search in Lim...

CytoReason Raises USD 80 M in the Funding Round In...

Google Unveils a Suite of New Features for AI Apps...

Kindo Reels in USD 20.6 M and Acquires WhiteRabbit...

Microsoft’s SpreadsheetLLM Enhances AI’s Compr...

HerculesAI Raises USD 26 M to Develop and Expand i...

Intel Capital Leads USD 15 M Investment in AI Cons...

AWS Unveils App Studio to Accelerate App Developme...

Captions LLC Raises USD 60 M for Generative Video ...

Enso Technologies Secures USD 6 M for SMB-focused ...

Our Brands