SD'25 - Agenda

Day 1

8:00

Registration, Ground Floor

Coffee served in Foyer

8:00

Registration, Ground Floor

Coffee served in Foyer

8:00

Registration, Ground Floor

Coffee served in Foyer

8:50

Running Start

Joran Dirk Greef | TigerBeetle

8:50

Running Start

Joran Dirk Greef | TigerBeetle

8:50

Running Start

Joran Dirk Greef | TigerBeetle

9:00

DuckLake - The SQL-Powered Lakehouse Format for the Rest of Us

Prof. Hannes Mühleisen | DuckDB

Managing data on Object Stores has been a painful affair. Users had to choose between data swamp chaos or a maze of metadata files with catalog servers on top. DuckLake is a new paradigm for managing data on object stores: First, it uses classical SQL data management systems to manage metadata. Second, actual data is stored in Parquet files on pretty arbitrary storage. Third, processing queries is done client-side, or anywhere really. DuckDB is the first system to integrate with DuckLake using an extension with the same name. Conceptually, DuckLake enables central control over truth while de-centralizing compute and storage entirely. DuckLake turns data warehouse architecture upside down by departing from the integrated metadata/compute layer towards a fully disconnected operation with only centralized metadata. For the first time, DuckLake allows a “multi-player” experience with DuckDB, where computation stays fully local, but transactional control is centralized. In our talk, we will describe the design rationale of the DuckLake format and its principles of simplicity, scalability, and speed. We will show the DuckDB implementation of DuckLake in action and discuss the implications for data architecture in general.

9:00

DuckLake - The SQL-Powered Lakehouse Format for the Rest of Us

Prof. Hannes Mühleisen | DuckDB

Managing data on Object Stores has been a painful affair. Users had to choose between data swamp chaos or a maze of metadata files with catalog servers on top. DuckLake is a new paradigm for managing data on object stores: First, it uses classical SQL data management systems to manage metadata. Second, actual data is stored in Parquet files on pretty arbitrary storage. Third, processing queries is done client-side, or anywhere really. DuckDB is the first system to integrate with DuckLake using an extension with the same name. Conceptually, DuckLake enables central control over truth while de-centralizing compute and storage entirely. DuckLake turns data warehouse architecture upside down by departing from the integrated metadata/compute layer towards a fully disconnected operation with only centralized metadata. For the first time, DuckLake allows a “multi-player” experience with DuckDB, where computation stays fully local, but transactional control is centralized. In our talk, we will describe the design rationale of the DuckLake format and its principles of simplicity, scalability, and speed. We will show the DuckDB implementation of DuckLake in action and discuss the implications for data architecture in general.

9:00

DuckLake - The SQL-Powered Lakehouse Format for the Rest of Us

Prof. Hannes Mühleisen | DuckDB

Managing data on Object Stores has been a painful affair. Users had to choose between data swamp chaos or a maze of metadata files with catalog servers on top. DuckLake is a new paradigm for managing data on object stores: First, it uses classical SQL data management systems to manage metadata. Second, actual data is stored in Parquet files on pretty arbitrary storage. Third, processing queries is done client-side, or anywhere really. DuckDB is the first system to integrate with DuckLake using an extension with the same name. Conceptually, DuckLake enables central control over truth while de-centralizing compute and storage entirely. DuckLake turns data warehouse architecture upside down by departing from the integrated metadata/compute layer towards a fully disconnected operation with only centralized metadata. For the first time, DuckLake allows a “multi-player” experience with DuckDB, where computation stays fully local, but transactional control is centralized. In our talk, we will describe the design rationale of the DuckLake format and its principles of simplicity, scalability, and speed. We will show the DuckDB implementation of DuckLake in action and discuss the implications for data architecture in general.

10:00

Building a Distributed Protocol

Dominik Tornow | Resonate HQ

Distributed protocols are the foundation of scalable and reliable systems — yet we often get lost in implementation details instead of grounding our designs in systems thinking. This talk offers a different path: we’ll explore how a small set of simple, well-crafted abstractions gives rise to complex, distributed systems. We’ll walk through how we move from foundational ideas to working systems, how we reason across layers to build reliable systems from unreliable components, and how we ensure correctness through formal modeling and deterministic simulation testing. A talk for system thinkers and system builders who want to move beyond ad hoc solutions — toward understandable distributed protocols that power scalable and reliable distributed systems.

10:00

Building a Distributed Protocol

Dominik Tornow | Resonate HQ

Distributed protocols are the foundation of scalable and reliable systems — yet we often get lost in implementation details instead of grounding our designs in systems thinking. This talk offers a different path: we’ll explore how a small set of simple, well-crafted abstractions gives rise to complex, distributed systems. We’ll walk through how we move from foundational ideas to working systems, how we reason across layers to build reliable systems from unreliable components, and how we ensure correctness through formal modeling and deterministic simulation testing. A talk for system thinkers and system builders who want to move beyond ad hoc solutions — toward understandable distributed protocols that power scalable and reliable distributed systems.

10:00

Building a Distributed Protocol

Dominik Tornow | Resonate HQ

Distributed protocols are the foundation of scalable and reliable systems — yet we often get lost in implementation details instead of grounding our designs in systems thinking. This talk offers a different path: we’ll explore how a small set of simple, well-crafted abstractions gives rise to complex, distributed systems. We’ll walk through how we move from foundational ideas to working systems, how we reason across layers to build reliable systems from unreliable components, and how we ensure correctness through formal modeling and deterministic simulation testing. A talk for system thinkers and system builders who want to move beyond ad hoc solutions — toward understandable distributed protocols that power scalable and reliable distributed systems.

11:00

Coffee Break

11:00

Afternoon Break

11:00

Coffee Break

11:30

Simplicity Is the New Black: Where Some Chase Scale for Scale’s Sake, Simplicity Is Your Competitive Edge

Stephanie Wang | MongoDB

Lately, we’ve seen a new wave of interest in distributing in-process, in-memory systems – projects like Deepseek’s Smallpond (distributed DuckDB) and DataFusion for Ray are getting a lot of buzz. But let’s be honest: this isn’t a new trend. For decades, “going distributed” (i.e. horizontal scale-out via partitioning) has been the go-to move when things get big, or might get big someday. But here’s the thing: just because something can be distributed doesn’t mean it should be. In this talk, we challenge the idea that “distributed” is the right default. We’ll unpack what really happens when you scale out and show how those tradeoffs can crush performance and developer sanity if you’re not careful. Instead, we’ll explore how to make big problems small and only layer on distributed strategies when it’s clearly the right solution. This isn’t a talk against distributed systems – it’s a talk about earning them. You’ll walk away with a systems-thinking mindset that helps you scale with purpose, not panic. Because sometimes, the smartest way to go big… is to stay small… until you absolutely can’t.

11:30

Simplicity Is the New Black: Where Some Chase Scale for Scale’s Sake, Simplicity Is Your Competitive Edge

Stephanie Wang | MongoDB

Lately, we’ve seen a new wave of interest in distributing in-process, in-memory systems – projects like Deepseek’s Smallpond (distributed DuckDB) and DataFusion for Ray are getting a lot of buzz. But let’s be honest: this isn’t a new trend. For decades, “going distributed” (i.e. horizontal scale-out via partitioning) has been the go-to move when things get big, or might get big someday. But here’s the thing: just because something can be distributed doesn’t mean it should be. In this talk, we challenge the idea that “distributed” is the right default. We’ll unpack what really happens when you scale out and show how those tradeoffs can crush performance and developer sanity if you’re not careful. Instead, we’ll explore how to make big problems small and only layer on distributed strategies when it’s clearly the right solution. This isn’t a talk against distributed systems – it’s a talk about earning them. You’ll walk away with a systems-thinking mindset that helps you scale with purpose, not panic. Because sometimes, the smartest way to go big… is to stay small… until you absolutely can’t.

11:30

Simplicity Is the New Black: Where Some Chase Scale for Scale’s Sake, Simplicity Is Your Competitive Edge

Stephanie Wang | MongoDB

Lately, we’ve seen a new wave of interest in distributing in-process, in-memory systems – projects like Deepseek’s Smallpond (distributed DuckDB) and DataFusion for Ray are getting a lot of buzz. But let’s be honest: this isn’t a new trend. For decades, “going distributed” (i.e. horizontal scale-out via partitioning) has been the go-to move when things get big, or might get big someday. But here’s the thing: just because something can be distributed doesn’t mean it should be. In this talk, we challenge the idea that “distributed” is the right default. We’ll unpack what really happens when you scale out and show how those tradeoffs can crush performance and developer sanity if you’re not careful. Instead, we’ll explore how to make big problems small and only layer on distributed strategies when it’s clearly the right solution. This isn’t a talk against distributed systems – it’s a talk about earning them. You’ll walk away with a systems-thinking mindset that helps you scale with purpose, not panic. Because sometimes, the smartest way to go big… is to stay small… until you absolutely can’t.

12:30

Lunch (60min)

12:30

Lunch (60min)

12:30

Lunch (60min)

1:30

Big Data and AI at the CERN LHC: Navigating the Edge of Scale and Speed for Physics Discovery

Dr. Thea Klaeboe Aarrestad | ETH Zürich

The CERN Large Hadron Collider (LHC) generates an unprecedented O(10,000) exabytes of raw data annually from high-energy proton collisions. Managing this vast data volume while adhering to computational and storage constraints requires real-time event filtering systems capable of processing millions of collisions per second. These systems, leveraging a multi-tiered architecture of FPGAs, CPUs, and GPUs, must rapidly reconstruct and analyze collision events, discarding over 98% of the data within microseconds. As the LHC transitions to its high-luminosity era (HL-LHC), these data-processing systems—operating in radiation-shielded caverns 100 meters underground — must contend with data rates comparable to 5% of global internet traffic, alongside unprecedented event complexity. Ensuring data integrity for physics discovery demands efficient machine learning (ML) algorithms optimized for real-time inference, achieving extreme throughput and ultra-low latency.

1:30

Big Data and AI at the CERN LHC: Navigating the Edge of Scale and Speed for Physics Discovery

Dr. Thea Klaeboe Aarrestad | ETH Zürich

The CERN Large Hadron Collider (LHC) generates an unprecedented O(10,000) exabytes of raw data annually from high-energy proton collisions. Managing this vast data volume while adhering to computational and storage constraints requires real-time event filtering systems capable of processing millions of collisions per second. These systems, leveraging a multi-tiered architecture of FPGAs, CPUs, and GPUs, must rapidly reconstruct and analyze collision events, discarding over 98% of the data within microseconds. As the LHC transitions to its high-luminosity era (HL-LHC), these data-processing systems—operating in radiation-shielded caverns 100 meters underground — must contend with data rates comparable to 5% of global internet traffic, alongside unprecedented event complexity. Ensuring data integrity for physics discovery demands efficient machine learning (ML) algorithms optimized for real-time inference, achieving extreme throughput and ultra-low latency.

1:30

Big Data and AI at the CERN LHC: Navigating the Edge of Scale and Speed for Physics Discovery

Dr. Thea Klaeboe Aarrestad | ETH Zürich

The CERN Large Hadron Collider (LHC) generates an unprecedented O(10,000) exabytes of raw data annually from high-energy proton collisions. Managing this vast data volume while adhering to computational and storage constraints requires real-time event filtering systems capable of processing millions of collisions per second. These systems, leveraging a multi-tiered architecture of FPGAs, CPUs, and GPUs, must rapidly reconstruct and analyze collision events, discarding over 98% of the data within microseconds. As the LHC transitions to its high-luminosity era (HL-LHC), these data-processing systems—operating in radiation-shielded caverns 100 meters underground — must contend with data rates comparable to 5% of global internet traffic, alongside unprecedented event complexity. Ensuring data integrity for physics discovery demands efficient machine learning (ML) algorithms optimized for real-time inference, achieving extreme throughput and ultra-low latency.

2:30

Coffee Break

2:30

Coffee Break

2:30

Coffee Break

3:00

Hello Systems

Loris Cro | Zig Software Foundation

Sometimes you hear about the amazing escapades of systems programmers who delve into the depths of a niche subject and save the day by fixing impossible bugs and increasing performance by orders of magnitude. These are all great adventures, but usually those are not stories where we feel we could be the protagonist because most of us do not consider themselves "systems programmers". In this talk I will tell you a different story about systems thinking at the application level, one that could very well have anybody in this room as its protagonist. Most importantly, I will tell you a story about software written by developers for developers. In other words, a story where we are both the protagonist and, at times, even the dastardly villain.

3:00

Hello Systems

Loris Cro | Zig Software Foundation

Sometimes you hear about the amazing escapades of systems programmers who delve into the depths of a niche subject and save the day by fixing impossible bugs and increasing performance by orders of magnitude. These are all great adventures, but usually those are not stories where we feel we could be the protagonist because most of us do not consider themselves "systems programmers". In this talk I will tell you a different story about systems thinking at the application level, one that could very well have anybody in this room as its protagonist. Most importantly, I will tell you a story about software written by developers for developers. In other words, a story where we are both the protagonist and, at times, even the dastardly villain.

3:00

Hello Systems

Loris Cro | Zig Software Foundation

Sometimes you hear about the amazing escapades of systems programmers who delve into the depths of a niche subject and save the day by fixing impossible bugs and increasing performance by orders of magnitude. These are all great adventures, but usually those are not stories where we feel we could be the protagonist because most of us do not consider themselves "systems programmers". In this talk I will tell you a different story about systems thinking at the application level, one that could very well have anybody in this room as its protagonist. Most importantly, I will tell you a story about software written by developers for developers. In other words, a story where we are both the protagonist and, at times, even the dastardly villain.

4:00

Building Systems, Simply

matklad | TigerBeetle

One of the meta values of TigerBeetle is simplicity. Simplicity is hard, but it gets you all the nice things — performance, correctness, maintainability. In this talk, we'll uncover fundamental simplicity in how software is built, tested, documented, and released — seemingly "boring" aspects, which nontheless are a foundation for everything else.

4:00

Building Systems, Simply

matklad | TigerBeetle

One of the meta values of TigerBeetle is simplicity. Simplicity is hard, but it gets you all the nice things — performance, correctness, maintainability. In this talk, we'll uncover fundamental simplicity in how software is built, tested, documented, and released — seemingly "boring" aspects, which nontheless are a foundation for everything else.

4:00

Building Systems, Simply

matklad | TigerBeetle

One of the meta values of TigerBeetle is simplicity. Simplicity is hard, but it gets you all the nice things — performance, correctness, maintainability. In this talk, we'll uncover fundamental simplicity in how software is built, tested, documented, and released — seemingly "boring" aspects, which nontheless are a foundation for everything else.

5:00

Wrap Up

Marina Pape | TigerBeetle

The Eye Filmmuseum remains open until midnight, attendees can stay on

5:00

Wrap Up

Marina Pape | TigerBeetle

The Eye Filmmuseum remains open until midnight, attendees can stay on

5:00

Wrap Up

Marina Pape | TigerBeetle

The Eye Filmmuseum remains open until midnight, attendees can stay on

6:30

Opening Night Rooftop Event

At The 360º

Exclusively for Premium Ticket Holders, TigerBeetle Team and Speakers

6:30

Opening Night Rooftop Event

At The 360º

Exclusively for Premium Ticket Holders, TigerBeetle Team and Speakers

6:30

Opening Night Rooftop Event

At The 360º

Exclusively for Premium Ticket Holders, TigerBeetle Team and Speakers

Agenda

Agenda

Thursday June 19

Thu June 19

Friday June 20

Fri June 20

Day 1

Day 1

Registration, Ground Floor

Registration, Ground Floor

Registration, Ground Floor

Running Start

Running Start

Running Start

DuckLake - The SQL-Powered Lakehouse Format for the Rest of Us

DuckLake - The SQL-Powered Lakehouse Format for the Rest of Us

DuckLake - The SQL-Powered Lakehouse Format for the Rest of Us

Building a Distributed Protocol

Building a Distributed Protocol

Building a Distributed Protocol

Coffee Break

Afternoon Break

Coffee Break

Simplicity Is the New Black: Where Some Chase Scale for Scale’s Sake, Simplicity Is Your Competitive Edge

Simplicity Is the New Black: Where Some Chase Scale for Scale’s Sake, Simplicity Is Your Competitive Edge

Simplicity Is the New Black: Where Some Chase Scale for Scale’s Sake, Simplicity Is Your Competitive Edge

Lunch (60min)

Lunch (60min)

Lunch (60min)

Big Data and AI at the CERN LHC: Navigating the Edge of Scale and Speed for Physics Discovery

Big Data and AI at the CERN LHC: Navigating the Edge of Scale and Speed for Physics Discovery

Big Data and AI at the CERN LHC: Navigating the Edge of Scale and Speed for Physics Discovery

Coffee Break

Coffee Break

Coffee Break

Hello Systems

Hello Systems

Hello Systems

Building Systems, Simply

Building Systems, Simply

Building Systems, Simply

Wrap Up

Wrap Up

Wrap Up

Opening Night Rooftop Event

Opening Night Rooftop Event

Opening Night Rooftop Event

Day 2

Day 2

Registration, Ground Floor

Registration, Ground Floor

Registration, Ground Floor

Running Start

Running Start

Running Start

Don't Forget To Flush

Don't Forget To Flush

Don't Forget To Flush

New Shared-Log Abstractions for Modern Applications

New Shared-Log Abstractions for Modern Applications

New Shared-Log Abstractions for Modern Applications

What Isn't Your System Supposed to Do?

What Isn't Your System Supposed to Do?

What Isn't Your System Supposed to Do?

Lunch (60min)

Lunch (60min)

Lunch (60min)

Lightning Talks

Lightning Talks

Lightning Talks

A Systems View to AI

A Systems View to AI

A Systems View to AI

Coffee Break

Coffee Break

Coffee Break

Jepsen 18: Serializable Mom

Jepsen 18: Serializable Mom

Jepsen 18: Serializable Mom

1000x: The Power of an Interface for Performance