DataCouncil_Logo_Transparent
  • BAY 25
  • AUS 24
  • Sponsor
  • Podcast
  • Register Now
Data Council

Data Council Talks

Macrobase: A Search Engine for Fast Data Streams

Macrobase: A Search Engine for Fast Data Streams
Sahaana Suri | Stanford University
TimescaleDB: Rearchitecting a SQL database for time-series data

TimescaleDB: Rearchitecting a SQL database for ...
Mike Freedman | TimescaleDB
Production Analytics With a Distributed Column Store

Production Analytics With a Distributed Column ...
Sam Stokes | Honeycomb
Don’t optimize my queries, optimize my data!

Don’t optimize my queries, optimize my data!
Julian Hyde | Google
Worse Case Scenario in the Database

Worse Case Scenario in the Database
Marianne Bellotti | United States Digital Service
The Challenges of Distributing Postgres: A Citus Story

The Challenges of Distributing Postgres: A Citus ...
Ozgun Erdogan | Citus Data
The Statistics of Dirty Data

The Statistics of Dirty Data
Sanjay Krishnan | UC Berkeley
Easy, Scalable, Fault-tolerant Stream Processing with Structured Streaming in Apache Spark

Easy, Scalable, Fault-tolerant Stream Processing ...
Burak Yavuz | Databricks
Using Apache Arrow, Calcite and Parquet to build a Relational Cache

Using Apache Arrow, Calcite and Parquet to build ...
Jacques Nadeau | Dremio
How Spotify Distills Terabytes of Raw Data into Meaningful Music Recommendations

How Spotify Distills Terabytes of Raw Data into ...
Gandalf Hernandez | Spotify
Building a Recursive BigQuery Mapper

Building a Recursive BigQuery Mapper
Darren McCleary | The New York Times
Building ETL Infrastructure that Analysts Love

Building ETL Infrastructure that Analysts Love
Christian Romming | ETLeap
Using Apache Spark for processing trillions of records each day at Datadog

Using Apache Spark for processing trillions of ...
Vadim Semenov | Datadog
Productizing structural models

Productizing structural models
James Savage | Schmidt Futures
Lessons in hiring data scientists

Lessons in hiring data scientists
Genevieve Smith | Insight
Using Causal Forests for Subpopulation Identification in Randomized Clinical Trials

Using Causal Forests for Subpopulation ...
James Faghmous | Icahn School of Medicine at Mt. Sinai
You Won't Believe How We Optimize our Headlines

You Won't Believe How We Optimize our Headlines
Lucy Wang | BuzzFeed
Deploying Data Science for Distribution of The New York Times

Deploying Data Science for Distribution of The ...
Anne Bauer | The New York Times
Zip codes and other lies your map told you

Zip codes and other lies your map told you
Peter Lenz | Near
Automating machine learning

Automating machine learning
Andreas Mueller | Data Science Institute
Building automated support at Kickstarter

Building automated support at Kickstarter
Jeffrey Doker | Kickstarter
Privacy Techniques for Data Science

Privacy Techniques for Data Science
Jim Klucar | Immuta
The Future of Data Science in the Media

The Future of Data Science in the Media
Haile Owusu | Mashable
Composable Interfaces for Parallel Processing in Apache Spark & Weld

Composable Interfaces for Parallel Processing in ...
Matei Zaharia | Databricks
Data Science @ Pinterest

Data Science @ Pinterest
Mohammad Shahangian | Pinterest
Data Science in the Enterprise

Data Science in the Enterprise
Sean Anderson | Vectara
Practical Solutions for Annoying Machine Learning Problems

Practical Solutions for Annoying Machine Learning ...
Alyssa Frazee | Stripe
Beyond 50,000 Partitions: How Heroku Pushes the Limits of Kafka at Scale

Beyond 50,000 Partitions: How Heroku Pushes the ...
Jeff Chao | Heroku
Scaling Up Spark at Facebook – a 60TB Production Use Case

Scaling Up Spark at Facebook – a 60TB Production ...
Shuojie Wang | Facebook
 How Superset and Druid Power Real-Time Analytics at Airbnb

How Superset and Druid Power Real-Time Analytics ...
Max Beauchemin | Preset
Hoodie: An Open Source Incremental Processing Framework From Uber

Hoodie: An Open Source Incremental Processing ...
Vinoth Chandar | Onehouse
Data for the 99%

Data for the 99%
Benn Stancil | ThoughtSpot
Payment Fraud in Digital Currency

Payment Fraud in Digital Currency
Soups Ranjan | Revolut
The Limitations of Big Data in Predictive Analytics

The Limitations of Big Data in Predictive ...
Jennifer Prendki | Alectio
Practical Lessons for Building Machine Learning Models in Production

Practical Lessons for Building Machine Learning ...
Sharath Rao | Instacart
An Introduction to Big Data's Unsung Hero: The Log

An Introduction to Big Data's Unsung Hero: The Log
Liz Bennett | Stitch Fix
Why, When, How: Lessons Learned in Applying Deep Learning to Real-World Problems

Why, When, How: Lessons Learned in Applying Deep ...
Daniel Galron | eBay
Anomaly Detection for Data Quality and Metric Shifts at Netflix

Anomaly Detection for Data Quality and Metric ...
Laura Pruitt | Netflix
Cloud-Native Stream Processing

Cloud-Native Stream Processing
Sid Anand | Apache Software Foundation
Parsing of Diverse Healthcare Data

Parsing of Diverse Healthcare Data
Chris Hartfield | Clover Health
Interactive Exploratory Analytics with Druid

Interactive Exploratory Analytics with Druid
Fangjin Yang | Imply
InfluxDB Storage Engine Internals

InfluxDB Storage Engine Internals
Paul Dix | InfluxData
A Nation of Immigrants: The Data Sciences

A Nation of Immigrants: The Data Sciences
Kenneth Sanford | Dataiku
Twitter Heron: The Path Towards Elastic Streaming

Twitter Heron: The Path Towards Elastic Streaming
Ashvin Agrawal | Microsoft
Simulation-based Inference: Advantages Over A/B Testing in Real Estate

Simulation-based Inference: Advantages Over A/B ...
Nelson Ray | Opendoor
Format Wars: from VHS and Beta to Avro and Parquet

Format Wars: from VHS and Beta to Avro and Parquet
Silvia Oliveros Torres | Silicon Valley Data Science
Real-time System Computing Engines

Real-time System Computing Engines
Steffen Peter | Levyx
The Right Stuff: Lessons Learned from a Decade of Data Engineering

The Right Stuff: Lessons Learned from a Decade of ...
Ben Hamner | Kaggle
How Engineer Angels Evaluate Data-Oriented Companies

How Engineer Angels Evaluate Data-Oriented ...
Jocelyn Goldfein | Zetta
The Future of Column-Oriented Data Processing with Arrow and Parquet

The Future of Column-Oriented Data Processing ...
Julien Le Dem | Datadog
Computational Social Science: Exciting Progress & Future Challenges

Computational Social Science: Exciting Progress & ...
Duncan Watts | Microsoft Research
Causal Inference in Data Science From Prediction to Causation

Causal Inference in Data Science From Prediction ...
Amit Sharma | Microsoft Research
Reinforcement Learning for Data Scientists

Reinforcement Learning for Data Scientists
Brian Farris | Bloomberg
How to Change a City with Data Science

How to Change a City with Data Science
Ben Wellington | Two Sigma
Python Data Wrangling: Preparing for the Future

Python Data Wrangling: Preparing for the Future
Wes McKinney | Posit
Lessons Learned Optimizing NoSQL for Apache Spark

Lessons Learned Optimizing NoSQL for Apache Spark
John Musser | Ford Motor Company
Unified Pipeline Architecture: The Evolution of Data Processing at Spotify

Unified Pipeline Architecture: The Evolution of ...
Erin Palmer | Spotify
SystemML & Spark: a Framework for Scalable Data Science Algorithm Development

SystemML & Spark: a Framework for Scalable Data ...
Jerome Nilmeier | IBM
Stop Obsessing about Data Infrastructure

Stop Obsessing about Data Infrastructure
Yair Weinberger | Alooma
Apache Kafka and the Rise of Stream Processing

Apache Kafka and the Rise of Stream Processing
Guozhang Wang | Confluent
Genomic Data Analysis with Spark & Hadoop

Genomic Data Analysis with Spark & Hadoop
Ryan Williams | Icahn School of Medicine at Mount Sinai
Anomaly Detection for Real-World Systems

Anomaly Detection for Real-World Systems
Manojit Nandi | STEALTHbits
Building a Cloud-Native SQL Database

Building a Cloud-Native SQL Database
Alex Robinson | Cockroach Labs
To Get the Value, Ditch the Hype

To Get the Value, Ditch the Hype
Nick Ursa | The New York Times
Statistical and Computational Challenges of Real-Time News Clustering

Statistical and Computational Challenges of ...
Jeiran Jahani | Chartbeat
Predicting Chaotic Systems with Sparse Data: Lessons from Numerical Weather Prediction

Predicting Chaotic Systems with Sparse Data: ...
David Kelly | New York University
The Trials and Tribulations of Scaling Data Science and Engineering

The Trials and Tribulations of Scaling Data ...
Ashley Miller | Datadog
Peloton: The Self-Driving Database Management System

Peloton: The Self-Driving Database Management ...
Andy Pavlo | Carnegie Mellon University
Elastic Big Data Platform at Datadog

Elastic Big Data Platform at Datadog
Doug Daniels | Datadog
Processing Geographic Data at Internet Scale

Processing Geographic Data at Internet Scale
Peter Lenz | Near
Bias, Variance and Adaptive Products

Bias, Variance and Adaptive Products
George Davis | Frame.ai
Career Panel - Leveling Up in Your Career as a Data Scientist/Engineer

Career Panel - Leveling Up in Your Career as a ...
Nick Chamandy | Lyft
VC Panel - The Present Future of Data-Oriented Startups | DataEngConf NY '16

VC Panel - The Present Future of Data-Oriented ...
Matt Hartman | Betaworks
Scalability! But at What COST?

Scalability! But at What COST?
Frank McSherry | Materialize
Reducing Student Loans with Bot-Powered Humans

Reducing Student Loans with Bot-Powered Humans
William Falcon | Facebook / NYU
Building Bots, Building Blocks: How Forbes Experiments, Evaluates, and Kills Data-driven Bots

Building Bots, Building Blocks: How Forbes ...
Luis Capelo | Forbes
https://www.datacouncil.ai/hubfs/DataEngConf/Data%20Council/Sample%20Backgrounds/bg2.jpg center center
  • Follow / Join Us


  • YouTube
  • Twitter
  • LinkedIn
  • Contact Us


  • Email Us
  • Lost Your Tickets?
  • Menu


  • Home
  • Blog
  • Code of Conduct
  • Partners
  • Privacy Policy
  • Terms of Use

MENU