Data Council | Austin 2023

What's New

More Days

Data Council now spans 3 full days, with more time for amazing talks, speaker office hours and the best hallway track you’ve ever seen.

Cutting-Edge AI Tech

Hear from the top companies & startups pushing the boundaries of AI who are building and harnessing models like GPT-3, Stable Diffusion and DALL-E to solve real-world problems.

Expanded Content

Learn about the topics and themes in data infrastructure that matter. We span topics across data engineering, ML tooling, analytics, and AI that are applicable to any size company striving to be data-driven.

Inspiring Community

Ok, that’s not new, but Data Council consistently attracts the best and brightest minds of data. Network with startups, find your dream job, discover new OSS, chat with investors and be inspired by our amazing, geeky attendees.

Watch 2023 Talks

Catch up on the amazing technical talks you may have missed.

Data Contracts: Accountable Data Quality

Data Warehouses are Gilded Cages. What Comes Next?

Tino Tereshko

Co-Founder & VP, Product

MotherDuck

Co-Founder & VP, Product, MotherDuck

Writing Unit Tests for Data Science Code

Dr. Nile Wilson

Senior Data Scientist

Microsoft

Senior Data Scientist, Microsoft

Founder & CEO, MotherDuck

Publishing Jupyter Notebooks with Quarto

Hot or Not? Trends & Buzzwords in Data

How to Build a Streaming Database in Three Challenging Steps

Frank McSherry

Chief Scientist

Materialize

Chief Scientist, Materialize

Building a Control Plane for Data

Shirshanka Das

Co-Founder and CEO

Acryl Data

Co-Founder and CEO, Acryl Data

Hot Takes and Tragic Mistakes: How (not) to Integrate Data People in Your App Dev Team Workflows

Noelle Saldana

Director of Product Management, Data Science & Analytics

Director of Product Management, Data Science & Analytics,

Conversation Simulator: A Real Life Case Leveraging OpenAI's API

Maddie Schults

General Manager

Crisis Text Line

General Manager, Crisis Text Line

Designing and Building Metric Trees

Abhi Sivasailam

Founder & CEO

Levers Labs

Founder & CEO, Levers Labs

Scalable and Sustainable Feature Engineering with Hamilton

Elijah Ben Izzy

Co-founder & CTO

DAGWorks Inc.

Co-founder & CTO, DAGWorks Inc.

Designing for Intelligence at GitHub Next: Patterns and Practices for Making AI-powered Products

Idan Gazit

Senior Director of Research, GitHub Next

GitHub

Senior Director of Research, GitHub Next, GitHub

AI - The Future is Now

Idan Gazit

Senior Director of Research, GitHub Next

GitHub

Senior Director of Research, GitHub Next, GitHub

Creating Self Service, High Velocity Data Cultures

Generative AI and the Natural Language Interface for Data

Malloy - An Experimental Language for Data

Lloyd Tabb

Founder/Former CTO - Looker & Co-creator of Malloy

Meta

Founder/Former CTO - Looker & Co-creator of Malloy, Meta

Creating our own Kubernetes and Docker to run our data infrastructure

Building an Open Data Lake House Using Trino and Apache Iceberg

Matt Fuller

Co-Founder & VP, Product

Starburst

Co-Founder & VP, Product , Starburst

Behind the Curtain: What it Takes to Support the World's Most Popular Open Source Communities

Katrina Riehl

President of Board of Directors

NumFOCUS

President of Board of Directors, NumFOCUS

Generative AI for Product Builders

Tristan Zajonc

CEO & Co-Founder

Continual

CEO & Co-Founder, Continual

Creating the Right Developer Community for Your Company

Wesley Faulkner

Sr Community Manager

AWS

Sr Community Manager, AWS

Change Data Streaming Patterns With Debezium & Apache Flink

Gunnar Morling

Senior Staff Software Engineer

Decodable

Senior Staff Software Engineer, Decodable

ETL with Change Data Capture in 30 Minutes

Gunnar Morling

Senior Staff Software Engineer

Decodable

Senior Staff Software Engineer, Decodable

The Road to Exceptional Data Correctness

Emma Tang

prev Engineering Manager at Stripe

prev Engineering Manager at Stripe,

Data Products Aren't Just for Data Teams!

Katie Hindson

Head of Product & Data

Lightdash

Head of Product & Data, Lightdash

Extreme Self-Service: Turning Data Consumers into Data Constructors

Alice Leach

Data Engineer

Whatnot

Data Engineer, Whatnot

Evolving AI Laws and the Imperative to Build Safe, Compliant, and Risk-proof AI

A New Era of Applied AI: How to Accelerate Enterprise Adoption of AI for Business Impact

Gaurav Rao

EVP & GM Machine Learning and AI

AtScale

EVP & GM Machine Learning and AI, AtScale

Scaling Uber Metric System from Elasticsearch to Pinot

Yupeng Fu

Principal Software Engineer

Uber

Principal Software Engineer, Uber

The state of cross-company data exchange

How Investors Think About Data

Leigh Marie Braswell

Principal

Founders Fund

Principal, Founders Fund

Building a Business Review Program from Scratch

Katie Bauer

Head of Data

GlossGenius

Head of Data, GlossGenius

Building a Business Review Program from Scratch

Greg Johnson

Head of Business Analytics

GlossGenius

Head of Business Analytics, GlossGenius

This App Ends Tantrums: How ML, NLP, and Five Minutes of Playtime Help Parents, Caregivers, and Children Enjoy Life Together

Mady Mantha

Co-Founder & CTO

Happypillar

Co-Founder & CTO, Happypillar

Extinguishing the Garbage Fire of ML Testing

Emily Curtin

Staff MLOps Engineer

Intuit Mailchimp

Staff MLOps Engineer, Intuit Mailchimp

A deep dive into the dbt manifest

Aaron Richter, PhD

Lead Data Engineer

formerly Squarespace

Lead Data Engineer, formerly Squarespace

How to Ensure Your Model Does Not Drift? From Human-In-The-Loop Concept to Building Fully Adaptive Ml Models Using Crowdsourcing

Generative AI for Search

D. Sivakumar

Co-Founder & CEO

Tonita

Co-Founder & CEO, Tonita

HuggingFace + Ray AIR Integration: A Python Developer’s Guide to Scaling Transformers

Jules Damji

Lead Developer Advocate

Anyscale

Lead Developer Advocate, Anyscale

How Vercel Builds Dozens of Metrics from One Heterogenous Table

Thomas Mickley-Doyle

Lead, Analytics and Data Science

Vercel

Lead, Analytics and Data Science, Vercel

CDC Stream Processing with Apache Flink

Timo Walther

Principal Software Engineer @ Confluent, PMC @ Apache Flink

Confluent

Principal Software Engineer @ Confluent, PMC @ Apache Flink, Confluent

How to Be a 10x Analyst

Robert Yi

Chief Product Officer

Hyperquery

Chief Product Officer, Hyperquery

The Data Infra Behind Zillow's 3x Growth in Experiment Volume

Aaron Wroblewski

Senior Manager, Machine Learning Engineering & Science

Zillow

Senior Manager, Machine Learning Engineering & Science, Zillow

How Freewheel Processes Billions of Ad-tech Events in Real-time

Margi Dubal

Director, Data Engineering

Freewheel

Director, Data Engineering, Freewheel

Automatically Fix Data Issues & Label Errors in Most ML Datasets

Train, Deploy, and Run a ML model using Python, Snowpark and Streamlit

Ahmad Khan

Head of AI/ML Strategy

Snowflake

Head of AI/ML Strategy, Snowflake

The Story of DevRel at Snowflake: How We Got Here

Daniel Myers

Developer Relations

Snowflake

Developer Relations, Snowflake

Why People Started Testing Their Models and Data in CI / CD Pipelines

Shir Chorev

Co-Founder & CTO

Deepchecks

Co-Founder & CTO, Deepchecks

MLOps for League of Legends - Heimerdinger Toolbelt

Ian Schweer

Software Engineer

Riot Games

Software Engineer, Riot Games

Scaling Uber Metric System from Elasticsearch to Pinot

The Story of DevRel at Snowflake: How We Got Here

Felipe Hoffa

Data Cloud Advocate

Snowflake

Data Cloud Advocate, Snowflake

The Fun-Sized MLOps Stack from Scratch

Mikiko Bazeley

Head of MLOps

Featureform

Head of MLOps, Featureform

Ten years of building open source standards: From Parquet to Arrow to OpenLineage

Julien Le Dem

Principal Engineer

Datadog

Principal Engineer , Datadog

Experiencing Data as Code with Dremio Arctic and Apache Iceberg

Alex Merced

Developer Advocate

Dremio

Developer Advocate, Dremio

How Investors Think About Data

AI - The Future is Now

George Mathew

Managing Director

Insight Partners

Managing Director , Insight Partners

What I Don’t Want To Exist In The Data World In 5 Years

Ben Rogojan

Principal Consultant And Owner Of Seattle Data Guy

Seattle Data Guy

Principal Consultant And Owner Of Seattle Data Guy, Seattle Data Guy

Real-time Schema Discovery

Daniel Selans

Co-founder & CTO

Streamdal

Co-founder & CTO, Streamdal

Building an ML Experimentation Platform for Easy Reproducibility

Vino Duraisamy

Data Engineer, lakeFS Advocate

Treeverse

Data Engineer, lakeFS Advocate, Treeverse

Cubing and Metrics in SQL, oh my!

Julian Hyde

Senior Staff Engineer

Google

Senior Staff Engineer, Google

Hot or Not? Trends & Buzzwords in Data

Pedram Navid

Head of Data Engineering & DevRel

Dagster Labs

Head of Data Engineering & DevRel, Dagster Labs

The Missing Manual: Everything you need to know about Snowflake optimization

Innovating on Software Development

Hamel Husain

Machine Learning Consultant

Parlance Labs

Machine Learning Consultant, Parlance Labs

Everything I Know About Data Science I Learned From Model Railroading

Peter Lenz

Vice President of Data Science

Near

Vice President of Data Science, Near

Hierarchical Forecasting in Python

Max Mergenthaler

CEO & Co-Founder

Nixtla

CEO & Co-Founder, Nixtla

The Missing Manual: Everything you need to know about Snowflake optimization

How to Interpret & Explain Your Black-Box Models

Sophia Yang

Senior Data Scientist

Anaconda

Senior Data Scientist, Anaconda

DataOps for Business Intelligence: How "Dashboards as Code" Can Help You Develop and Validate Your Analytics

Dan Eisenberg

VP of Technology

Hashboard (formerly Glean)

VP of Technology, Hashboard (formerly Glean)

Feed The Alligators With the Lights On: How Data Engineers Can See Who Really Uses Data

Mark Grover

CEO & Co-Founder

Stemma

CEO & Co-Founder, Stemma

Building a better world with AI, one architectural drawing at a time

Using Spatial Indexes to perform geospatial analytics at massive scale

Matt Forrest

VP of Spatial Data Science

Carto

VP of Spatial Data Science, Carto

ML in Production – What Does “Production” Even Mean?

Dean Pleban

Co-Founder & CEO

Dagshub

Co-Founder & CEO, Dagshub

How to Make Marketing Fall In Love with Data Modeling

Erik Edelmann

Data + Advocacy

Hightouch

Data + Advocacy, Hightouch

URGENT! Help these Pets Find Homes: Working Across Teams in DataHub

Paul Logan

Developer Relations Lead

Acryl Data

Developer Relations Lead, Acryl Data

Building an Open Data Lake House Using Trino and Apache Iceberg

Tom Nats

Director Customer Solutions

Starburst

Director Customer Solutions , Starburst

The First Data Hire’s Guide to the Modern Data Stack

Prateek Chawla

Principal Engineer & Founding Engineer

Monte Carlo

Principal Engineer & Founding Engineer , Monte Carlo

Data Contracts in the Modern Data Stack

Zachary Klein

Software Engineer, Machine Learning & Data Platforms

Whatnot

Software Engineer, Machine Learning & Data Platforms, Whatnot

URGENT! Help these Pets Find Homes: Working Across Teams in DataHub

Maggie Hays

Community Product Manager

Acryl Data

Community Product Manager, Acryl Data

Teamwork Makes the (Open Source) Dream Work: the Power of Cross-community Collaboration

Maggie Hays

Community Product Manager

Acryl Data

Community Product Manager, Acryl Data

From 1 to IPO: Growing the Data Team and Data Culture at GitLab

Taylor Murphy

Head of Product and Data

Meltano

Head of Product and Data, Meltano

Conversation Simulator: A Real Life Case Leveraging OpenAI's API

Mateo García, PhD

Lead Data Scientist

Crisis Text Line

Lead Data Scientist, Crisis Text Line

How to Optimize Apache Spark Cloud Clusters for Cost and Runtime Goals with Sync Computing

Jeffrey Chou

Co-Founder & CEO

Sync Computing

Co-Founder & CEO, Sync Computing

How to End the Long-tail of Most Data Requests?

Ahmed Elsamadisi

Founder & CEO

Narrator

Founder & CEO, Narrator

Modern Data Management - How to Set Your Data Team Up for Success

Alec Bialosky

Business Operations

Select Star

Business Operations, Select Star

Continuous Data Pipeline for Real-time Benchmarking & Data Set Augmentation

Ivan Aguilar

Senior Data Scientist

Teleskope

Senior Data Scientist, Teleskope

How to Make Marketing Fall In Love with Data Modeling

Meredith Adler

Data Advocate / Data Engineer

Hightouch

Data Advocate / Data Engineer, Hightouch

Incident Management for Data People

Kyle Kirwan

Co-Founder & CEO

Bigeye

Co-Founder & CEO, Bigeye

Teamwork Makes the (Open Source) Dream Work: the Power of Cross-community Collaboration

Kyle Eaton

Growth Lead

Great Expectations

Growth Lead, Great Expectations

Making Moves with Arrow Data: Introducing Arrow Database Connectivity (ADBC)

Matthew Topol

Staff Software Engineer

Voltron Data

Staff Software Engineer, Voltron Data

Building a better world with AI, one architectural drawing at a time

HuggingFace + Ray AIR Integration: A Python Developer’s Guide to Scaling Transformers

Antoni Baum

Software Engineer

Anyscale Inc

Software Engineer, Anyscale Inc

Govern Your Data Clients - the Right Way to Scale

Yaniv Ben Hemo

Co-Founder & CEO

Memphis

Co-Founder & CEO, Memphis

Achieve Better Data Quality with Data Warehouse Observability

Eric Jones

Data Solution Architect

Databand

Data Solution Architect, Databand

How to Optimize Apache Spark Cloud Clusters for Cost and Runtime Goals with Sync Computing

Kartik Nagappa

Staff Product Manager

Sync Computing

Staff Product Manager, Sync Computing

Getting Real(-Time): When to move from Batch to Streaming (and how to do it without hiring an entirely new team)

Data Product Success: Aligning with Data's Core Purpose - A Framework for Data Product Management for Increasing Adoption & User Love

The End of History? Convergence of Batch and Realtime Data Technologies

Matt Housley

Co-Founder

Ternary Data

Co-Founder, Ternary Data

An Open Semantic Layer with Cube

Pavel Tiunov

CTO & Co-Founder

Cube Dev

CTO & Co-Founder , Cube Dev

Like Legos, but better: An interactive workshop on the building blocks of user segmentation with reverse ETL

Kurt Steckel

Senior Customer Data Architect

Census

Senior Customer Data Architect, Census

LLM’s & Semantic Layer: Self-serve has Entered the Chat

Paul Blankley

Co-founder & CTO

Zenlytic

Co-founder & CTO, Zenlytic

Streaming Analytics with dbt: The Fun Parts

Dennis Hume

Senior DevEx Engineer

Materialize

Senior DevEx Engineer, Materialize

Scaling Experimentation to 20 Billion Users

Timothy Chan

Head of Data

Statsig

Head of Data, Statsig

The things I wish I knew -- What I've gotten right and wrong from startups to the White House, and the world ahead

DJ Patil

Former U.S Chief Data Scientist

Former U.S Chief Data Scientist,

Latency is the Mind Killer: it’s Time to Reimagine Data Interactions

David Wilson

Co-Founder & CEO

Hunch Tools

Co-Founder & CEO, Hunch Tools

AI - The Future is Now

Gregory Larson

VP Engineering

Jasper

VP Engineering, Jasper

Hot or Not? Trends & Buzzwords in Data

AI - The Future is Now

Barry McCardel

Co-Founder & CEO

Hex

Co-Founder & CEO, Hex

Hot or Not? Trends & Buzzwords in Data

Barry McCardel

Co-Founder & CEO

Hex

Co-Founder & CEO, Hex

AI - The Future is Now

Sean Owen

Principal Specialist for Data Science and ML

Databricks

Principal Specialist for Data Science and ML, Databricks

more speakers to be announced soon ...

View all

About Our Tracks

🗓️ 2023 Event Schedule

01 Data Eng & Infrastructure

Data Engineering & Infrastructure focuses on modern data engineering workflows, storage systems, design patterns and more. Themes in this track are centered around the most important pieces of the data pipeline workflow: data-ops, data quality, ingest & ETL/ELT, monitoring, metadata and other issues pertaining to the modern data stack.

02 Data Sci & Algos

The Data Science & Algorithms track is focused on helping data science professionals be more effective in their role. We discuss helpful algorithms, modern research and data science frameworks & methodologies that can be useful in data science functions across the enterprise.

03 ML OPs & Platforms

This track focuses on the engineering behind existing and novel machine learning systems, frameworks and tooling. You’ll learn about topics such as data preparation, feature engineering, model quality & monitoring, ml-ops, and best practices in generalizing ML workflows.

04 Analytics

The Analytics track focuses on tools and techniques for data analysts, covering topics such as Business Intelligence (BI), customer analytics, A/B testing and data visualization. You’ll learn about how top teams are solving their analytics challenges and discover the best new tools in the process.

05 Data Streaming

This track looks at various tooling used to build real time applications and the underlying data infrastructure that supports the movement of data to computation. Examples of talks include case studies on how applications use messaging systems, how applications maintain consistency and correctness while ingesting large amounts of data, and computational frameworks for processing streaming data with low latency.

06 Applied & Generative AI

The Applied AI track demonstrates the intersection of applied deep learning methods in product form. It covers topics such as Large Language/Transformer Models, generative AI, product-based implementations of new research methods and exciting new features powered by machine learning inside products.

07 Building Data Products

In the modern data era, teaching companies how to harness the power of data inside their products presents new challenges for product managers. Come to this track to better understand how top teams are executing full-stack product development of data-oriented products. You’ll learn how to tool your team to work effectively with your own data engineering and science teams as you build the data-oriented products of the future.

08 Data Culture & Community

A brand-new track at Data Council this year, Data Culture & Community is a place for Data Leaders and Community Managers to share stories and insights on how they have built vibrant, cross-functional, and collaborative spaces for data practitioners, developers, and beyond. Whether you are building a data team from scratch, establishing a strong data culture within your organization, or rallying a global network of developers around your software, you’ll have plenty to learn from our speakers who have done it all.

09 Lightning Talks

The Lightning Talks track is composed of 15min, bite-size sessions from various startups sharing lessons learned & best practices of their fast-growing companies. You can learn about new tools & approaches that startups use, cutting-edge open source projects, and data teams' lessons learned from supporting their company growth.

10 Workshops

This two-day track is dedicated to peer-led workshops, provided for free. Participants try out the latest products alongside their creators, get hands on experience with innovative tools, and meet some of the most talented builders in our industry.

Platinum

Gold

Silver

Bronze

Community Partner

Founder & Chair

Pete Soderling is the founder of Data Council and the Data Community Fund. As a former software engineer, repeat founder and investor in more than 70 data-oriented startups, Pete’s lifetime goal is to help 10,000 engineers start successful companies. Most importantly, Pete is a community builder — from his earliest days of working with the data engineering community starting in 2013, he has witnessed the unique power of specialized networks to bring inspiration, knowledge and support to technical professionals.

Data Council 2023

What's New

Follow / Join Us

Contact Us

Menu