Baseten’s AI Inference Platform Announces Funding Closure

Published by: Insights Desk Released: Mar 05, 2024 Source: DemandTalk

Highlights:

Baseten offers a dashboard that allows developers to monitor an AI model’s infrastructure usage and relevant metrics like processing times for user requests.
Companies can access Baseten’s AI inference platform as a managed cloud service or deploy it within their Amazon Web Services and Google Cloud environments.

Recently, Baseten Labs Inc., a startup simplifying the deployment of artificial intelligence models for developers, revealed the closure of a USD 40 million funding round.

IVP and Spark Capital spearheaded Baseten’s AI inference platform’s investment, with numerous existing backers also contributing. As reported by Forbes, the Series B funding has resulted in Baseten being valued at over USD 200 million.

Executing inference, which involves running AI models in production, often demands considerable time and resources. Developers need to set up the infrastructure to accommodate sudden surges in traffic for running the model. They must also ensure that the AI replies to user prompts promptly, avoids cloud cost overruns, and completes dozens of other technical tasks.

Baseten, headquartered in San Francisco, is dedicated to simplifying this process. The platform provided by Baseten automates numerous tasks associated with managing production AI environments, beginning with the deployment of the initial model.

Once developers train a new neural network, they must package it into a format compatible with their organization’s cloud infrastructure for deployment. Baseten has created an open-source tool named Truss to expedite this process. The startup asserts that AI models can be deployed on its platform using just a few lines of code with the help of the tool.

After a neural network has been deployed in production, Baseten implements an autoscaling engine to guarantee the prompt processing of user requests. An AI model may become inundated with requests in an abrupt manner, resulting in prolonged response durations. Upon detecting an increase in utilization, the autoscaling engine generates replicas of the AI model in order to distribute the extra traffic.

Additionally, it automates removing replicas once user activity returns to normal levels. The company claims its platform allows developers to implement a “scale to zero” approach, where an AI workload shuts down entirely when inactive. This approach helps to mitigate unnecessary cloud costs.

Baseten offers a dashboard that allows developers to monitor an AI model’s infrastructure usage and relevant metrics like processing times for user requests. A complementary observability tool simplifies the process of troubleshooting technical issues.

In a blog post, Chief Executive Officer Tuhin Srivastava wrote, “Our native workflows serve large models in production, so users don’t need to think about version management, roll-out, and observability. In 2023, we scaled inference loads hundreds of times over without a minute of downtime.”

Businesses can deploy Baseten’s platform in their own Amazon Web Services and Google Cloud environments or utilize it as a managed cloud service. Nearly twenty significant organizations and tens of thousands of developers utilize the software manufacturer’s platform, per Forbes. Based on reports, the company generates annual revenue in the “mid-single-digit millions.”

To extend its customer reach, the company aims to almost double its 25-person sales and marketing team by the end of the year. The recently closed funding round for Baseten will also support product development initiatives. The company intends to introduce support for additional cloud platforms, develop tools to optimize the performance of customers’ AI models, and simplify the training process.

ai governance for the enterprise...

empower ai and real-time insights at the edge...

power ai and analytics workloads with performance,...

how to choose the right ai foundation model...

pros enterprise ai for the industrial industries (...

unlocking ai’s potential: challenges and opportu...

transforming procurement with ai: opportunities, c...

adobe acrobat ai assistant: reinventing productivi...

adobe acrobat ai assistant: reinventing productivi...

ai, automation, and the strategic cao...

an introduction to ai in customer service...

5 ways ai can transform your customer experience...

ciso guide to generative ai attacks...

10 reasons to hire a customer-led voice assistant...

10 reasons to hire a customer-led voice assistant...

the definitive buying guide for contact center her...

cfo's guide to ai...

discover the future of business innovation with ge...

preparing for the future of cx by harnessing the p...

tableau gpt: innovate for the future with generati...

profitable ai-powered data management solutions to...

business-centric cognitive architecture revolution...

ai use cases – innovations for business success...

the role of ai in software development...

ai in cybersecurity – your digital guardian...

how chatbot marketing supports today’s business ...

advanced adaptive ai bolsters business intelligenc...

the dynamic impact of ai in procurement...

ai in customer service – revealing common applic...

how to use dall-e for marketing success...

rpa vs ai: a comparative analysis for business aut...

maximizing business efficiency through ai integrat...

7 trendiest ai marketing campaigns igniting commer...

liquid neural network unveiling the fluid intellig...

the art of prompt engineering in general & marketi...

what is amazon bedrock?...

decode data like never before: chatgpt for data an...

workforce planning models –the power of ai skil...

black friday and the impact of ai in e-commerce...

how digital brain is a game changer for business s...

microsoft introduces bing generative search in lim...

cytoreason raises usd 80 m in the funding round in...

google unveils a suite of new features for ai apps...

kindo reels in usd 20.6 m and acquires whiterabbit...

microsoft’s spreadsheetllm enhances ai’s compr...

herculesai raises usd 26 m to develop and expand i...

intel capital leads usd 15 m investment in ai cons...

aws unveils app studio to accelerate app developme...

captions llc raises usd 60 m for generative video ...

enso technologies secures usd 6 m for smb-focused ...

hebbia raises usd 130 m to develop data search pla...

meta releases four open-source language models...

harvey is reportedly raising usd 100 m at usd 1.5 ...

cloudflare introduces a new no-code feature to pre...

redactive raises usd 7.5 m to expand headcount and...

rapid7 acquires noetic cyber to help businesses fi...

runway ai aims for usd 450 m amid ai startup inter...

gen ai coding assistant startup magic ai aims to r...

anthropic introduces new program to fund enhanced ...

meta to open-source meta llm compiler for code opt...

role of machine learning in networking...

Baseten’s AI Inference Platform Announces Funding Closure

Highlights:

Insights Desk

Related posts

Microsoft Introduces Bing Generative Search in Lim...

CytoReason Raises USD 80 M in the Funding Round In...

Google Unveils a Suite of New Features for AI Apps...

Kindo Reels in USD 20.6 M and Acquires WhiteRabbit...

Microsoft’s SpreadsheetLLM Enhances AI’s Compr...

HerculesAI Raises USD 26 M to Develop and Expand i...

Intel Capital Leads USD 15 M Investment in AI Cons...

AWS Unveils App Studio to Accelerate App Developme...

Captions LLC Raises USD 60 M for Generative Video ...

Enso Technologies Secures USD 6 M for SMB-focused ...

Our Brands