CONFERENCE DETAILS

 

Engaging Talks

Our talks are backed by our 100% No Bullshit Guarantee* and delivered by leading data scientists and engineers from top organizations like Facebook, Netflix, Lyft, Databricks, LinkedIn, Stanford University & many more.

 

6 Unique Tracks

This year, we offered 6 unique and exciting tracks, including: Data Engineering & Beyond, Data Science & Analytics, ML Infrastructure, AI Products, Lightning Talks, and a series of peer-led Workshops.
 

Community-Powered

We are a diverse group of geeks, coders, scientists, analysts, and more. We care deeply about local data communities, no big $$$ sponsors here, just data nerds, like you & I.            

 

Born To Be Open Source

We believe in Open Source; code, content and mentality. We featured top open source contributors and tools at our talks. Afterwards, we published all content (talks and slides) for the community, for free. See below! 

 

Participate in Workshops

This was a two-day track dedicated fully to peer-led Workshops, provided for free. Participants tried out the latest products alongside their creators, got hands on experience with innovative tools, and met some of the most talented builders in our industry. 

 

Networking That Works

Our popular "Speaker Office Hours" format allows attendees to interact directly with each speaker after their talk. Office hours, Workshops, and our Data Community Party allow you to meet data influencers who will positively impact your career.

Austin 2022 Track Hosts

Stay tuned, we'll publish the schedule soon.

Austin 2022 Speakers

Stay tuned, we'll publish the schedule soon!

Conference Committee

Pete Soderling

Pete Soderling - Chair

Pete Soderling is the founder of Data Council and the Data Community Fund. As a former software engineer, repeat founder and investor in more than 40 data-oriented startups, Pete’s lifetime goal is to help 1,000 engineers start successful companies. Most importantly, Pete is a community builder — from his earliest days of working with the data engineering community starting in 2013, he has witnessed the unique power of specialized networks to bring inspiration, knowledge and support to technical professionals. 

 

Previously, Pete founded Hakka Labs (a social network for software engineers), Stratus Security (an early cloud-based API platform) and mechanikal (a software development agency in NYC). He is a graduate of New York University, has spoken at events such as TEDx, O’Reilly Strata and QCon, and has worked actively in supporting technical founders around the globe.

Oana Olteanu

Oana Olteanu - Co-Chair

Oana is an ML engineer with operating experience in enterprise software and investment experience at the Seed, Series A and Series B stages. She is currently a Principal at SignalFire strengthening their technology portfolio with a focus on seachange technologies across the software stack. 

Prior to VC, Oana worked at SAP where she contributed to the development of the backend systems used by SAP for its solution portfolio. She also developed assessment frameworks and evaluated requests for funding from more than 70 product teams, across different stages of the life cycle. 

Oana received a BS in Computer Science with a specialization in Machine Learning from Jacobs University and an MS in Computer Science and Business with specialization in Machine-to-Machine Platforms from Mannheim University.

Austin 2022 Conference Workshops

Day One

Databricks: Building Lakehouse with Delta Lake

Apache Spark™ is the dominant processing framework for big data. Delta Lake adds reliability to Spark so your analytics and machine learning initiatives have ready access to quality, reliable data. By attending this workshop, you'll learn about the use of Delta Lake to enhance data reliability for Spark environments. Learn about the role of Apache Spark in big data processing, the use of data lakes as an important part of the data architecture, data lake reliability challenges, how Delta Lake helps provide reliable data for Spark processing, specific improvements that Delta Lake adds, and the ease of adopting Delta Lake for powering your data lake.

Speaker: Vini Jaiswal, Senior Developer Advocate

Rasgo: Scalable Feature Engineering with Python

Feature engineering is more than simply missing value imputation, handling outlier and categorical variables and scaling numerical variables. It is an opportunity to allow a data scientist's creativity to shine and as Andrew Ng’s stated, “Applied machine learning is basically feature engineering.” In this workshop, we will work through the data preparation process for a sales forecasting use case. We will start with exploration of the data to identify the key tables, before moving on to engineering new features to support the forecast and join all of the data together. We will finish by making the data available both for modeling and production.

Speaker: Patrick Dougherty, Co-Founder & CEO 

Starburst: Turning Distributed Data into Data Products

By treating data as a product, organizations manage data in a way that maximizes its value and leads to faster & better decisions. Starburst enables organizations to unlock the value of distributed and complex data by making it fast and easy to access, no matter where it lives, all without the need to move or copy data. In this lab, we’ll focus on how to turn distributed data into data products that can easily be built & shared across your organization. This hands-on lab will teach participants to use the Starburst Enterprise interface to create Data Products, providing business context around datasets so that data consumers can explore, discover, and analyze data with ease. Participants will learn: What is a Data Product? How does Starburst enable modern data management paradigms such as Data Mesh? How do you publish Data Products with Starburst? How does one query and combine Data Products to gain valuable insights? We’ll showcase analyzing data from data products in Tableau.

Speaker: Vishal Singh, Head of Data Products

 

Soda: Data Reliability Engineering

Join Tom and Vijay for a one hour hands-on coding session, where you will use Soda’s Open Source tools to see how you can build reliability and observability into your data systems at scale. You’ll be given a hackathon-type challenge so that you can experience how Soda works across the data product lifecycle, including testing and validating data, version control, managing data expectations in Git, and how to publish the results to easily collaborate with the business. You’ll see first-hand how to apply “as-code” principles to the testing domain, to ensure it is highly automatable and that the ongoing management of your data systems is more efficient. We’ll also be previewing Soda’s new Domain Specific Language (DSL) for data and analytics engineers. Our new language introduces:

- Human readable data quality checks that everyone can understand
- Easy and powerful domain-specific language to create coverage fast
- Leverage out-of-the-box templates and smart suggestions

You will walk away with practical experience on how to design systems in such a way that they become more reliable, and when problems happen, you have the observability to resolve them and maximize data availability. And yes, there’ll be prizes and free swag for everyone that participates!

Speakers: Tom Baeyens, Co-Founder & CTO and Vijay Kiran, Lead, Developer Tools

TITLE: Lorem Ipsum
This 2-day track is dedicated to peer-led Workshops that attendees can experience throughout the event, for free. Try out the latest products alongside their creators, get hands-on experience with innovative tools and meet some of the most talented builders in our industry. 

ABSTRACT:
Featuring: Census (operational analytics), Rudderstack (OSS CDP), Starburst (MPP database), Soda (data monitoring), Rasgo (OSS feature engineering), Carto (geospatial database) and Y42 (low-code data platform).

Day Two 

RudderStack: Pipelines for the Modern Data Stack

In this workshop, we will walk through a practical, step-by-step example of performing lead enrichment for marketing data. We will show how we can pull data from the marketing platform, load it into the data lake, process and enrich the records with third-party and behavioral data, then push the enriched records back to the marketing platform and other platforms.

Speaker: Ryan McCrary, Solutions Engineer

Carto: How to Develop Fluency in Spatial Analysis 

Did you know that only a third of Data Scientists know their stuff when it comes to Spatial Data Science? With more and more analytics professionals turning to location data to predict and optimize their business processes, it’s clear that new Big Data sets and data warehouses have revolutionized the way in which we create and enrich spatial models.
In this workshop, Gabriel Hidalgo, Data Scientist at CARTO, walks us through how to perform spatial analytics directly in the cloud data warehouse of your choice - whether that’s Google BigQuery, Snowflake, Amazon Redshift or other solutions such as Databricks. We’ll see how those 1 in 3 who know their spatial methods have become a coveted resource for their companies.
What will you learn?
Innovative types of location data available natively in the cloud

  • How to carry out cloud native spatial analysis, including a real life example exploring how Airbnb data enriched with sociodemographics can reveal main drivers that attract tourism in a certain area and increase listing and service success.

Speaker: Gabriel Hidalgo, Data Scientist

Census: High-Quality Data, Self-Served

Do you spend day in and day out answering ad hoc data questions from your business teams? Do you wish there was an easy way for those teams to use high-quality data you’ve spent hours modeling without more tasks ending up on your desk or in your Slack inbox? Donny Flynn (and Census) is here to help. In this workshop, Donny--a former head of data and current customer data architect at Census--will run you through how reverse ETL makes self-serve data a reality and answer some burning questions, such as: Why is self-serve data so hard for data and business teams today? How does reverse ETL solve this pain? What can reverse ETL add to your data stack? And who in your organization will feel the benefits? What things should be considered when selecting a reverse ETL tool? How do you get started with reverse ETL? Bonus: How quickly can you spin up Census to sync from source tables? (It’s under 10 minutes, seriously).

Speaker: Donny Flynn, Customer Data Architect

 

Y42: End-to-End Data Pipeline-as-Code

  • Imagine you could set up an entire end-to-end data platform using code coupled with a powerful user interface that allows anyone to become a data analyst. Sounds too good to be true? Come to our workshop and you will get hands-on experience in implementing the next generation of the modern data stack – from ingestion, to transformation, and visualization, all within one tool and deployed with just code. This is a hands-on workshop, so please bring your laptop for the full experience.
  •  
  • Speakers: Ba Thien Tran, Solutions Engineer and Rebecca Zwilling, Customer Data Architect

TITLE: Lorem Ipsum
This 2-day track is dedicated to peer-led Workshops that attendees can experience throughout the event, for free. Try out the latest products alongside their creators, get hands-on experience with innovative tools and meet some of the most talented builders in our industry. 

ABSTRACT:
Featuring: Census (operational analytics), Rudderstack (OSS CDP), Starburst (MPP database), Soda (data monitoring), Rasgo (OSS feature engineering), Carto (geospatial database) and Y42 (low-code data platform).

Austin 2022 Schedule

Day One - Mar 23

Day Two - Mar 24

VIEW THE SCHEDULE

Stay tuned, we'll publish the schedule soon!

View Venue Location

FEATURE SLIDER

  • Slide One Header

    Slide One Subheader

    Sed auctor neque eu tellus rhoncus ut eleifend nibh porttitor. Ut in nulla enim. Phasellus molestie magna non est bibendum non venenatis nisl tempor. Suspendisse dictum feugiat nisl ut dapi proin quis tortor orcitiam at risus et justo praesent id metus massa, ut blandit odio.

  • Slide Three Header

    Slide Two Subheader

    Vivamus hendrerit arcu sed erat molestie vehicula. Sed auctor neque eu tellus rhoncus ut eleifend nibh porttitor. Ut in nulla enim. Phasellus molestie magna non est bibendum non venenatis nisl tempor. Suspendisse dictum feugiat nisl ut dapibus. Mauris iaculis porttito.

  • Slide Three Header

    Slide Two Subheader

    At vero eos et accusamus et iusto odio dignissimos ducimus qui blanditiis praesentium voluptatum deleniti atque corrupti quos dolores et quas molestias excepturi sint occaecati cupiditate non provident, similique sunt in culpa qui officia deserunt mollitia animi.

View All Events

What Our Attendees Say

It's no secret that Data Council has been one of my favorite conferences, as it is for many other data professionals. Data Council is unique because it's organized by technical experts, for technical experts. While there are great conferences to discuss academic research, there are few where you can hear from the next generation of data experts in the industry, and at Data Council, the speakers are almost exclusively the founders of some of the most disruptive startups in the space. 

Jennifer Prendki

Founder & CEO
Alectio

Jennifer Prendki
Just wanted to reach out to thank you for the opportunity to speak at the conference. As a conference it was very informative and the talks inspired some of my software design approaches. I'm sharing my experiences with my team to help spread the knowledge.

Neelesh Salian

Senior Software Engineer
Stitch Fix

Neelesh Salian
Thanks for organizing Data Council. I really enjoyed the conference! I think you managed to build something with the right balance of tech and business content. Definitely one of the best conferences I attended this year. Not only that the talks were valuable but I enjoyed speaking to other participants. I found that many of them face similar challenges to the ones we deal with on a daily basis. Looking forward for the next one.

Gideon Mendels

Co-founder & CEO, Comet.ml

Gideon Mendels

Austin 2022 Partners

PLATINUM

Mparticle

GOLD

Census
Rudder
Star Burst

SILVER

Carto
Soda BW
Mode Analytics
Rasoo
y2
astronomer-logo-RGB-reverse-1200px
Delta Lake Logo_01 CMYK

BRONZE

Ahana
Bodo
Databand
Iggy
LI (1)-1
Materialize
Privacy Dynamics
Semitext
Big Eye
Arize
Gest
High Touch - HQ
Meltano
Select Star
Open Teams
SNOWPLOW
FIREBOLT
SNOWFLAKE
Elementl

COMMUNIT Y PARTNERS

RT Insights
Num Focus
Anaconda
Dew
Partner With Us