How Poppins empowered product teams to ship high quality AI features with Basalt

Poppins used Basalt to empower product teams to handle prompting, analyse logs, and iterate 2x faster on their product. Using the prompt prototyping and AI powered evals, they were able to go from draft to production in a few days.

5000

Evaluations

180+

Test in workbench

15

Prompts in prod

149

Prompt versions

Basalt customer story

Published on

11 Jan 2022

How Poppins uses AI

LLM features are becoming core in their product, allowing users to seamlessly create a new listing and empowering content moderators to root out frauds. With Basalt, they iterate on prompt quality, monitor performance, and test models to optimize both outcomes and efficiency.

‍

How Basalt helped them accelerate

Before adopting Basalt, the team struggled with uncertainty in their prompt engineering process

"Some prompts generated good results, some didn't... but we had no idea why, or how to systematically improve them," explains Loick, cofounder and CTO.

This lack of visibility made it challenging to maintain consistent quality across their AI features. Now, armed with detailed analytics and testing capabilities, they iterate continuously with concrete, data-driven feedback. A prime example of their improved process can be seen in their content moderation feature. After carefully optimizing the prompt structure and parameters with Basalt's testing framework, they've achieved remarkable results: zero customer complaints about moderation decisions. Nada. 🎉

With Basalt, they shifted AI ownership towards product teams and prompts are directly managed by PMs.

AI in our app is made of product rules. So it should be owned by product

What’s next on the AI roadmap

☑️ Test multiple models per prompt to reduce both $ cost and carbon footprint.

☑️ Evaluate prompt quality even outside the app (e.g. in tools like Cursor).

☑️ Integrate Basalt into more product workflows, continuously.

‍

Stats

5000

Evaluations

Extensive use of LLM-as-a-judge evaluators

180+

Test in workbench

Multiple batch runs accros hundreds of test cases

15

Prompts in prod

Multiple AI prompts in production in less than a month

149

Prompt versions

Heavy iteration on prompts to reach high quality standards

Go beyond in evaluating your prompts with Basalt

The platform to integrate AI in your product in secondsGo from idea to ready-to-deploy prompts

Start for free

Craft and prototype

Generate prompt with AI Copilot and prototype AI features, effortlessly

Test and evaluate

Create test cases, run evaluations, and score your prompts with confidence

Deploy

Effortlessly deploy your features to production and keep track of them using the Basalt SDK

Monitor

Keep track of your feature's performance and detect issues with detailed logs and traces

Go beyond in evaluating your prompts with Basalt

Craft and prototype

Test and evaluate

Deploy

Monitor

Go beyond in evaluating your prompts with Basalt