Over the last decade I have been lucky enough to contribute a few successful open source projects to the data ecosystem. In this talk I’ll share the story of how these projects came to be and what made their success possible. I’ll describe the ideation process and early growth of the Apache Parquet columnar format and show how this led to the creation of its in-memory alter-ego Apache Arrow. I’ll end with showing how this experience enabled the success of OpenLineage, an LFAI & Data project that brings observability to the data ecosystem. Along the way, I’ll talk about the key elements that catalyzed their growth, from project focus to governance to community.
Julien Le Dem is the Chief Architect of Astronomer and Co-Founder of Datakin. He co-created Apache Parquet and is involved in several open source projects including OpenLineage, Marquez (LFAI&Data), Apache Arrow, Apache Iceberg and a few others. Previously, he was a senior principal at Wework; principal architect at Dremio; tech lead for Twitter’s data processing tools, where he also obtained a two-character Twitter handle (@J_); and a principal engineer working on content platforms at Yahoo, where he received his Hadoop initiation. His French accent makes his talks particularly attractive.