Loading…

Wednesday May 27, 2026 09:00 - 09:45 CEST
Limited Capacity full
Adding this to your schedule will put you on the waitlist.
For years, the industry's motto has been simple: collect and store as much data as possible, believing that immense value will emerge from this accumulation. This vision, driven by tech giants like **GAFAM** that collect petabytes of data daily, isn't always practical for every company. At the same time, single computing instances on the cloud have become exponentially more powerful, allowing us to handle large amounts of data on a single machine.

This new way of thinking is captured by the **Small Data Manifesto**, which states that more data doesn't always equal better results, modern hardware is often underused, and data topics should be developed locally first. In the data department of Decathlon, we discovered this is more than just a theory. More than 90% of the data tables in our datalake (Decathlon's centralized data repository) was less than 50 GiB. This finding inspired us to define a new approach to data transformation.

After setting the context and introducing the Small Data Manifesto, we will present **Light Computing**, which brings these principles to life. Instead of spinning up expensive clusters to transform data, we use powerful single-node tools and only scale up when the data truly requires it. The leading tools of this field are **Polars** and **DuckDB**. We'll cover two main aspects: what technically defines these tools and what they bring in terms of concrete cost and architectural benefits.

The session will conclude with a live demonstration of Polars, showing how we can easily compute 50 GB of data on a small computing instance on Databricks.
Speakers
avatar for Arnaud Vennin

Arnaud Vennin

Data Engineer, Decathlon
Raised on the eastern plateaus of Rouen in the northwest of France, I studied primarily in my hometown, where I completed my undergraduate degree in Environmental Sciences. I spent one semester abroad in Nijmegen, a city in the Netherlands. Afterward, I obtained a master's degree... Read More →
Wednesday May 27, 2026 09:00 - 09:45 CEST
🤖 DATA/AI ARENA 135 Rue Sadi Carnot, Ronchin, France

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Share Modal

Share this link via

Or copy link