Loading…

Thursday May 28, 2026 14:45 - 15:15 CEST
Limited Capacity full
Adding this to your schedule will put you on the waitlist.
Coding Agentic Systems is fun — it feels like magic!
Vibe-coding Agentic System is even more fun — double magic, hah!?

But quality evaluation of LLM generations? That part is usually… boring.
And if you’re not in the Python ecosystem, good luck finding a framework that actually works for you.
So eval tasks quietly sit in our backlogs, waiting for better days, the v2 release, or some future “we’ll fix it later.”

Drawing from real experience building production RAGs, agentic pipelines, and LLM-powered features across different stacks, I'll share the lessons learned the hard way — what broke, what worked, and what we wished we'd measured from day one.
The goal of this talk is simple: make evals understandable, and the creation process easy — with Coding Agents doing the heavy lifting alongside you. Practical tips, tricks, and a fresh perspective on making evaluation a natural part of developing LLM-powered applications.
Speakers
avatar for Nail Khusainov

Nail Khusainov

Staff Engineer - ML/AI, ADEO Services
12 years cooking ML systems — occasionally burning them, but hey, sometimes they turn out great!
Thursday May 28, 2026 14:45 - 15:15 CEST
🤖 DATA/AI ARENA 135 Rue Sadi Carnot, Ronchin, France

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Share Modal

Share this link via

Or copy link