Skip to main content
Contributor

Contributor

By Eric Anderson
The origin story behind the best open source projects and communities.
Where to listen
Apple Podcasts Logo

Apple Podcasts

Castbox Logo

Castbox

Google Podcasts Logo

Google Podcasts

Overcast Logo

Overcast

Pocket Casts Logo

Pocket Casts

RadioPublic Logo

RadioPublic

Spotify Logo

Spotify

Stitcher Logo

Stitcher

Currently playing episode

Envoy Proxy with Matt Klein

Contributor

1x
Temporal (Part 2) with Maxim Fateev and Dominik Tornow
Eric Anderson (@ericmander) returns to Temporal with co-founder Maxim Fateev (@mfateev) and principal engineer Dominik Tornow (@DominikTornow). When Maxim joined us in September of 2020, the company called their project a “workflow orchestrator.” Today, Temporal has grown in popularity and usability, but the terminology around that abstraction has changed. Tune in to track the evolution of what Maxim calls a genuinely “new category of software.” In this episode we discuss: New features and developments in the last 2 years The proper way to pronounce “Temporal” How Temporal guarantees that workflow execution actually runs to execution Describing Temporal as a new pair of glasses Replay, Temporal’s first developer conference on August 25-26, in Seattle Links: Temporal Cadence Apache Cassandra Replay People mentioned: Samar Abbas (@samarabbas77) Other episodes: Temporal with Maxim Fateev Apache Cassandra with Patrick McFadin
40:44
June 22, 2022
Scarf with Avi Press
Eric Anderson (@ericmander) interviews Avi Press (@avi_press) about Scarf, the distribution platform for open-source software that facilitates analytics and commercialization. Scarf offers a set of tools that allows founders and maintainers to understand adoption of their products, including Scarf Gateway, which provides a central access point to containers and packages. From there, open-source developers can connect with the people that rely on their work. In this episode we discuss: Why you can’t rely on Github as a source of comprehensive data about open-source software Tracing a user’s journey interacting with a project across multiple platforms How better observability allows maintainers to make better software Inspiring indie maintainers to commercialize their projects The privilege of being able to work in open-source, and how Scarf can enable a more inclusive developer community Links: Scarf Tidelift Gitcoin OpenTeams Aviyel
28:21
June 08, 2022
Rasgo with Patrick Dougherty
Eric Anderson (@ericmander) and Patrick Dougherty (@cpdough) talk about Rasgo, the data transformation platform for MLOps that makes generating SQL easy. The team at Rasgo recently open-sourced a package called RasgoQL, that allows users to execute SQL queries against a data warehouse using Python syntax. Tune in to find out how Rasgo aims to bridge an important gap in the Modern Data Stack. In this episode we discuss: The advantages of offering both a low-code/no-code UI and a Python interface "How can a data scientist, without needing full-time resources from data engineering, be somewhat self-sufficient in data prep and able to deliver those insights without a massive human capital investment needed?" Where Rasgo fits into the world of feature stores Why one Rasgo user took a trip to a wind farm in Texas Eric’s predictions for the future of data prep and transformation Links: Rasgo RasgoQL DuckDB Delta Lake People mentioned: Jared Parker (@jaredtparker_)
28:49
May 25, 2022
Feast with Willem Pienaar
Eric Anderson (@ericmander) and Willem Pienaar (@willpienaar) talk about Feast, the open-source feature store for machine learning. Feature stores act as a bridge between models and data, and allow data scientists to ship features into production without the need for engineers. Willem co-created Feast at Gojek, and later teamed up with the folks at Tecton to back the project. In this episode we discuss: The value of feature stores in MLOps What happens when you open-source too early Why most open-source code has nothing to hide Bringing an open-source project to an existing company Good and bad use cases for a feature store Links: Feast Tecton Turing Merlin Kubeflow apply() Conference People mentioned: Mike Del Balso Kevin Stumpf (@kevinmstumpf) Ajey Gore (@AjeyGore) Demetrios Brinkmann (@Dpbrinkm) Wes McKinney (@wesmckinn) Other episodes: Flyte with Ketan Umare Great Expectations with Abe Gong and Kyle Eaton
32:29
May 11, 2022
Flyte with Ketan Umare
Eric Anderson (@ericmander) and Ketan Umare (@ketanumare) discuss Flyte, the open-source workflow automation platform for large-scale machine learning and data use cases. Ketan is a former engineer at Lyft, where he created Flyte to help models in Pricing, Locations, ETA, and more. Today, the project allows machine learning developers everywhere to bring their ideas from conception to production. In this episode we discuss: How Flyte combines compute with parts of a workflow engine in a way that is best for the user The importance of reliable fares and ETA predictions at a ride-sharing app A progenitor to Flyte called “Better Airflow” Ketan’s innovative approach to bringing typing to machine learning workloads Why Flyte landed at the Linux Foundation Links: Flyte Union.ai Apache Airflow Kubeflow Luigi MLTwist Other episodes: Great Expectations with Abe Gong and Kyle Eaton Envoy Proxy with Matt Klein
36:29
April 27, 2022
Activeloop with Davit Buniatyan
Eric Anderson (@ericmander) meets with Davit Buniatyan (@DBuniatyan) of Activeloop, the database for AI. Davit was inspired to found Activeloop while working on large datasets in a neuroscience research lab at Princeton. Powering the technology at Activeloop is Hub, the open-source dataset format for AI applications. Join us to learn how Hub promises to enhance and expand various verticals in deep learning. In this episode we discuss: Reconfiguring traditional ML tooling for the cloud Connectomics - working with thin slices of a mouse brain with neuroscientist Sebastian Seung Choosing between university, a start-up, and open-source Davit’s original product, that ran computation on crypto mining GPUs on a distributed scale Focusing on different data modalities for computer vision Links: Activeloop Activeloop Hub Apache Parquet Apache Spark TensorFlow Snowflake Databricks Timescale People mentioned: Sebastian Seung (@SebastianSeung) Other episodes: TensorFlow with Rajat Monga
30:30
April 13, 2022
Unikraft with Alexander Jung and Simon Kuenzer
Eric Anderson (@ericmander), Alexander Jung (@nderjung) and Simon Kuenzer (Github: @skuenzer) get technical on Unikraft, the open-source unikernel development kit. Unikernels are specialized, high performing OS images that have the potential to revolutionize virtualization. Unikraft makes unikernels easy to use by prioritizing modularity, security, and POSIX-compatibility. In this episode we discuss: How Unikraft seeks wider adoption of unikernels in real-world applications Unikraft’s background in research and academia Bottom-up as well as top-down specialization Building a community with a large proportion of students Links: Unikraft Unikraft: Fast, Specialized Unikernels the Easy Way Xen Project MirageOS HermitCore Firecracker
27:13
March 30, 2022
EdgeDB with Yury Selivanov
Eric Anderson (@ericmander) has a conversation with Yury Selivanov (@1st1), the co-founder of EdgeDB. EdgeDB is the world’s first “graph-relational database.” It’s a term coined specifically for this new type of database, designed to ease the pain of dealing with the usual relational and NoSQL models. And no, EdgeDB is NOT a graph database! In this episode we discuss: A glitch at EdgeDB’s Matrix-inspired launch event Origin of the term and design philosophy, “graph-relational” What to know about becoming a Python core developer How EdgeDB’s next-gen query language compares to GraphQL and SQL Links: EdgeDB magicstack uvloop People mentioned: Elvis Pranskevichus (@elprans) Colin McDonnell (@colinhacks) Victor Petrovykh (Github: @vpetrovykh) Dan Abramov (@dan_abramov) Brett Cannon (@brettsky) Daniel Levine (@daniel_levine) Other episodes: Hasura with Tanmai Gopal Dgraph with Manish Jain
45:27
March 16, 2022
Deephaven with Pete Goddard
Eric Anderson (@ericmander) sits down with Pete Goddard (@pete_paco) to talk about Deephaven, the open-core query engine built for real-time streams and batch data. Pete is the CEO of Deephaven Data Labs, and comes to the data world from a background in capital markets trading. Deephaven originally addressed a need for real-time data infrastructure in the finance world, but the team realized how useful their technology could be in a wider variety of verticals. Join us for Pete’s unique perspective on reaching out into alternate industries and use cases through community development. In this episode we discuss: How Pete transitioned from Wall Street to open-source software Selling investors on open-source Two questions people always ask Pete The luxury of Deephaven’s incremental update model Barrage, Deephaven’s API for streaming tables that extends Apache Arrow Flight Links: Deephaven Barrage Apache Kafka Apache Arrow Flight Eclipse Jetty Other episodes: TensorFlow with Rajat Monga
41:18
March 02, 2022
Meltano with Douwe Maan
Eric Anderson (@ericmander) and Douwe Maan (@DouweM) chat about Meltano, the open-source DataOps operating system. Meltano provides the connective tissue that allows teams to treat their data stack as a single software development project. Tune in to learn how Meltano is trying to bring software development best practices into the data world. In this episode we discuss: Meltano’s origins as a side project at GitLab How Meltano glues together open-source technologies like Singer, dbt and Airflow Douwe’s experience wearing many different hats in the early days of Meltano Meltano’s shift from an ELT solution to an operating system The Love-Tap Fest community event, starting right after this episode’s release! Links: Meltano Love-Tap Fest - February 17-24th, 2022 GitLab Singer dbt Apache Airflow Apache Superset Terraform People mentioned: Taylor Murphy (@tayloramurphy) AJ Steers (@aaronsteers) Other episodes: Great Expectations with Abe Gong and Kyle Eaton Dagster with Nick Schrock Prefect with Jeremiah Lowin
39:16
February 16, 2022
Penpot & Taiga with Pablo Ruiz-Múzquiz
Eric Anderson (@ericmander) and Pablo Ruiz-Múzquiz (@diacritica) examine the intersection of open-source, agile development, and UI/UX design at the heart of two applications, Penpot and Taiga. Penpot is a design and prototyping platform intended for cross-domain teams, while Taiga is a popular agile project management software. These products comprise the heart of Pablo’s innovative company, Kaleidos Open Source, which was founded in Spain more than a decade ago. Listen to today’s episode for one of the industry’s most unique perspectives on open-source code and design. In this episode we discuss: An internal crisis and a major pivot for Kaleidos How Penpot was born from Kaleidos’ signature personal innovation week Designing a design tool that can be used to design itself Bringing design, code and people closer together Why Pablo asserts that designers care about open-source Links: Penpot Taiga Kaleidos Open Source Other episodes: Blender with Dalai Felinto
39:03
February 02, 2022
DockerSlim with Kyle Quest
Eric Anderson (@ericmander) and Kyle Quest (@kcqon) discuss DockerSlim, the open-source optimization and security tool for Docker container images. Kyle initially created DockerSlim as a humble hackathon project, and now supports it with his company, Slim.AI. Tune in to learn how DockerSlim is redefining DevOps with application intelligence and a backwards compatible vision of the future. In this episode we discuss: Bridging the gap between application and infrastructure Emerging from the cloud native stone age Application intelligence rather than artificial intelligence in Slim.AI DockerSlim integrated into CI/CD pipelines, embedded systems, and robots How Slim.AI aims to become ‘Google for containers’ Links: DockerSlim Slim.AI Terraform Serverless Sigstore
38:32
December 22, 2021
Unleash with Egil and Ivar Østhus
Eric Anderson (@ericmander) is joined by Egil (@EgilCo) and Ivar Østhus (@ivarconr), brothers and co-creators of the open-source feature management platform, Unleash. It’s a real family business, with Egil acting as CEO and Ivar the CTO of the company. Over beers and burgers, the two decided to bring their strengths together for a feature toggle tool that transforms DevOps and continuous deployment pipelines. In this episode we discuss: Ivar as a pioneer in the trunk-based development space Word of mouth and old colleagues bringing Unleash to new companies Assessing a contributor’s personality and mindset Resolving a deadlock scenario on a feature launch day without impacting customers Why feature flagging is fundamental to true DevOps Links: Unleash
29:29
December 08, 2021
Kubescape with Shauli Rozen
Eric Anderson (@ericmander) invites Shauli Rozen (@shaulir) to share about his work on Kubescape, the first open-source Kubernetes security testing tool that is compliant with NSA & CISA hardening guidelines. Despite the project’s recency, Kubescape has seen explosive growth on Github and recognition from the Kubernetes community. Tune in to learn how the team at ARMO built a successful open-source security tool for DevOps. In this episode we discuss: Why Kubescape uses guidance from the NSA & CISA Correcting the misconception that developers don’t care about security Providing value in the first five minutes of using the tool ARMO’s detailed approach to community feedback Shauli’s thoughts on security roles of the future Links: ARMO Kubescape Terraform
26:49
November 24, 2021
Blender with Dalai Felinto
Eric Anderson (@ericmander) connects with Dalai Felinto (@dfelinto), development coordinator at Blender. Blender is a free and open-source 3D graphics toolset with a unique story spanning nearly 30 years. The project is used professionally for animation, video games, scientific visualization, and much more. Join us for a very special episode of Contributor as we take a deep dive into one of the most dedicated, robust communities in open-source history. In this episode we discuss: When the dotcom crash landed Blender in the hands of community members Taking open-source beyond the toolset with open movie projects Dalai’s transition from burgeoning architect to Blender developer How you can use Blender’s new Geometry Nodes for AI training Solving organizational challenges with full-time staff and contributors Links: Blender Blender Studio Big Buck Bunny Elephants Dream Sprite Fright People mentioned: Ton Roosendaal (@tonroosendaal)
36:15
November 10, 2021
Great Expectations with Abe Gong and Kyle Eaton
Eric Anderson (@ericmander) interviews Abe Gong (@AbeGong) and Kyle Eaton (@SuperCoKyle) about Great Expectations, the open-source framework that aims to create a shared standard for data quality. Abe is a core contributor to the project, and the CEO and co-founder of Superconductive, the team backing Great Expectations. Kyle is Growth Lead at Superconductive, and Community Manager of Great Expectations. The team at Superconductive have just launched the new Expectation Gallery to connect contributors and carve out vertical spaces in this ecosystem. Tune in to find out why Great Expectations is the leading open-source project for eliminating pipeline debt. In this episode we discuss: How the Expectation Gallery enables new modes of community engagement Superconductive’s pivot from healthcare data consulting to open-source data validation Collaborative conversations with other data companies Abe’s advice to future open-source founders on segmenting value The vision of Great Expectations as a protocol-level open standard Links: Great Expectations Superconductive Down with Pipeline debt Cascade Data Labs Flyte Dagster Databricks pandas People mentioned: James Campbell (@jpcampbell42) Other episodes: Dagster with Nick Schrock
32:18
October 27, 2021
Bolster with Abhishek Dubey
Eric Anderson (@ericmander) sits down with Abhishek Dubey (@abhishekdubey), co-creator of Bolster, the fraud prevention platform powered by deep-learning. Bolster is used by clients like LinkedIn, Uber and Dropbox for its cutting-edge detection and takedown technology. Abhishek and his co-founder built Bolster around the real-time URL-scanning tool CheckPhish, which analyzes phishing sites for free. On today’s episode, learn how Abhishek and the team at Bolster have found success by focusing on building their business out of passion, and giving back to the community. In this episode we discuss: A second mortgage and a startup garage Discovering 100 Fortune 500 companies were using CheckPhish How Bolster snagged LinkedIn without a proof of concept Bolster’s secret sauce, that sets them apart from other security companies Comparing the community focus of Bolster to a traditional open-source model Links: Bolster CheckPhish Hacker Dojo Twilio People mentioned: Shashi Prakash (@skiddzo)
34:22
July 28, 2021
Sanity with Magnus Hillestad and Even Westvang
Eric Anderson (@ericmander) sits down with Magnus Hillestad (@MHillestad) and Even Westvang (@even), co-founders of the unified content platform Sanity. The team at Sanity helps businesses organize their structured content as data, allowing distribution from a single source of truth. Tune in today’s episode to learn how Sanity aims to change the way people think about content. In this episode we discuss: The open-source editing environment and CMS, Sanity Studio From content as data, to coffee table books How Sanity differs from a traditional CMS Why the Sanity team turned down a contract with the United Nations Building a team that can scale to a vision of ubiquity Links: Sanity Sanity Studio Figma Brex Netlify People mentioned: Simen Svale Skogsrud (@svale) Øyvind Rostad (@rostad)
40:46
July 14, 2021
Teleport with Ev Kontsevoy
Eric Anderson (@ericmander) and Ev Kontsevoy (@kontsevoy) talk about Teleport, the open-source tool for instant access to cloud resources. These include SSH servers, Kubernetes clusters, databases and more. Teleport was inspired by the growing complexity of cloud environments, and aims to make engineers feel like all their cloud applications are in the same room together. In this episode we discuss: How Teleport grew from a side project to Gravity, the open-source toolkit for packaging and running applications autonomously Unifying and consolidating modern access methods and industry best practices Bringing identity to a protocol-level An early community use case for Teleport in the cattle industry Engaging with outside contributions while balancing security constraints Links: Teleport Gravity Mailgun
33:37
June 30, 2021
Rook with Travis Nielsen
Eric Anderson (@ericmander) and Travis Nielsen (@STravisNielsen) talk about Rook, the open-source storage orchestrator for Kubernetes. Travis is a Senior Principal Software Engineer at Red Hat, and maintainer of Rook. Join us to dive deep into the story of Rook, from Microsoft, to Quantum, to Red Hat. In this episode we discuss: Ceph + Kubernetes = Rook The difficulty and importance of a stable storage solution for stateless applications How Rook leverages Kubernetes CRDs Why the Rook team decided to work with the CNCF Red Hat’s philosophy and approach to open-source Links: Rook Red Hat Upbound Quantum CNCF People mentioned: Bassam Tabbara (@bassamtabbara) Jared Watts (@jbw976)
32:30
June 16, 2021
Apache Cassandra with Patrick McFadin
Eric Anderson (@ericmander) and Patrick McFadin (@PatrickMcFadin) delve into the history of Apache Cassandra, the open-source NoSQL database born and bred around cloud over a decade ago. Patrick is the VP of Developer Relations at DataStax, and a member of the Cassandra Project Management Committee. On today’s episode, Patrick shares his philosophy on developer advocacy and experience in open-source. In this episode we discuss: Behind the NoSQL explosion that made Cassandra the darling of the valley Comparing different eras of commercializing open-source, then and now How Patrick became a pioneer in evangelizing and community-building The two kinds of people to recruit for developer relations Why Patrick says open-source is going to “start eating clouds” Links: Apache Cassandra Datastax Datastax Astra People mentioned: Avinash Lakshman (@HedvigEng) Prashant Malik (@pmalik) Adrian Cawcroft (@adrianco) Kelsey Hightower (@kelseyhightower) Other episodes: Chef with Adam Jacob
34:23
June 02, 2021
Dagster with Nick Schrock
Eric Anderson (@ericmander) interviews Nick Schrock (@schrockn) about Dagster, the open-source data orchestrator for machine learning, analytics, and ETL. Nick is the founder and CEO of Elementl, and is well-known for creating the Project Infrastructure group at Facebook, which spawned GraphQL and React. On today’s episode of Contributor, Nick explains how he set out to fix an inefficiency he identified amongst the complexity of the data infrastructure domain. In this episode we discuss: Dagster’s place in the industry shift towards thinking of data as a software engineering discipline Why Nick believes it’s time for the term “data cleaning” to be retired The empowerment of Dagster’s instantaneous spin-up process and local development experience How a partner integrated Dagster into workflow for ops workers on the warehouse floor One user’s testimony that, “what dbt did for our SQL, Dagster did for our Python” Links: Dagster Elementl GraphQL React dbt Snowflake Apache Airflow People mentioned: Lee Byron (@leeb) Dan Schafer (@dlschafer) Abe Gong (@AbeGong)
34:59
May 19, 2021
Hasura with Tanmai Gopal
Eric Anderson (@ericmander) and Tanmai Gopal (@tanmaigo) dive into the open-source Hasura GraphQL Engine and the wider Hasura community. Hasura provides real-time GraphQL APIs for databases, so developers can focus on building applications without worrying about infrastructure. Tune in to hear the full story about how Tanmai and his team are helping engineers unlock the dream of self-serve data access. In this episode we discuss: How the early Hasura team created their own version of GraphQL in parallel Developing community with ease of onboarding and radical transparency Transitioning community events into the COVID world, and looking to a future beyond travel Hasura’s secret sauce: the authorization framework Links: Hasura Hasura Con’21 DigitalOcean People mentioned: Rajoshi Ghosh (@rajoshighosh)
36:12
May 05, 2021
MindsDB with Jorge Torres and Adam Carrigan
Eric Anderson (@ericmander) is joined by the co-founders of MindsDB, Jorge Torres (@JorgeTorresAI) and Adam Carrigan (@AdamMCarrigan). MindsDB is an open-source AI layer that integrates with existing databases, from MySQL to Clickhouse. Tune in to learn how these two former college roommates are working to bring machine learning into the mainstream. In this episode we discuss: Why it makes sense to run machine learning models in the database Partnering with Kafka, Looker, and more MindsDB’s initial adoption by students at Berkeley Different applications for MindsDB and machine learning in ecommerce, finance, and more The moment Jorge knew he had to get into business with Adam Links: MindsDB RedisConf 2021 Looker Apache Kafka ClickHouse Other episodes ClickHouse with Alexey Milovidov and Ivan Blinkov
29:40
April 21, 2021
Anaconda with Peter Wang
Eric Anderson (@ericmander) welcomes Peter Wang (@pwang) for a conversation about the Python ecosystem and the open-source communities that have built it. Peter is the creator of Anaconda, the near-essential Python distribution for scientific computing that makes managing packages a lot more manageable. In today’s episode, Peter offers a unique and powerful perspective on how to make the economics of open-source work for everyone. In this episode we discuss: The paradox of the PVM and Python’s packaging difficulties How Guido van Rossum implied permission for Anaconda and the open-source Python movement Python as the lingua franca of a new professional class Looking to Roblox for inspiration for a scientific computing creator community Giving back to open-source communities through the NumFOCUS Foundation Links: Anaconda NumFOCUS NumPy SciPy Enthought  Jupyter TensorFlow MicroPython scikit-learn pandas Quansight Red Hat Roblox People mentioned: Travis Oliphant (@teoliphant) Fernando Pérez (@fperez_org) Brian Granger (@ellisonbg) Min Ragan-Kelley (@minrk) Guido van Rossum (@gvanrossum) James Currier (@JamesCurrier) Other episodes: NumPy & SciPy with Travis Oliphant TensorFlow with Rajat Monga
34:40
April 07, 2021
Redpanda with Alexander Gallego
Eric Anderson (@ericmander) is joined by Alexander Gallego (@emaxerrno) for an examination of Redpanda, the source available event streaming platform designed as a drop-in replacement for Kafka. Redpanda’s storage engine is attractive to developers for its performance and simplicity, removing the complexity of running Kafka to scale and deploying with a single binary. Listen to today’s episode to learn more about how Alexander and the team at Vectorized are looking to advance the conversation around streaming into the future. In this episode we discuss: What Alexander means when he says that hardware is the platform for data streaming The 3 things that turn a data stream into a data product Comparing Redpanda to Kafka and Pulsar A difference in product philosophy between selling to data teams vs app developers How Alexander approached the challenge of monetizing data infrastructure Links: Redpanda Vectorized Apache Kafka Apache Pulsar Apache Spark Apache Beam Apache Storm Apache Flink Elastic CockroachDB Other episodes: TensorFlow with Rajat Monga Scylla with Dor Laor
33:41
March 24, 2021
Storybook with Zoltan Olah
Eric Anderson (@ericmander) and Zoltan Olah (@zqzoltan) discuss Storybook, the open-source UI component development tool. Storybook supports all the most popular frontend frameworks and libraries such as React, Vue and Angular, but allows users to test and develop components in isolation. In today’s episode, learn more about the early days of the component-driven development methodology and how Storybook was saved by a passionate community of engineers. In this episode we discuss: Storybook as an integral part of UI design workflow How Zoltan and his team inherited Storybook and saved it from being “left out to dry” Solving a pain point for front-end engineers with Chromatic’s UI regression testing, built on top of Storybook Why Zoltan compares components to APIs, and Storybook to a service mesh What’s happening today in the world of open-source design systems Links: Storybook Chromatic Meteor GraphQL React Tailwind Selenium Cypress Material-UI Figma Learn Storybook People mentioned: Dominic Nguyen (@domyen) Tom Coleman (@tmeasday) Arunoda Susiripala (@arunoda) Norbert de Langen (@NorbertdeLangen) Michael Shilman (@mshilman)
27:36
March 10, 2021
SkyWalking with Sheng Wu
Eric Anderson (@ericmander) and Sheng Wu (@wusheng1108) discuss Apache SkyWalking, an open-source APM tool focusing on cloud-native and distributed systems. SkyWalking was originally developed in 2012 as a training tool for developers new to distributed systems architecture, but it became Sheng’s pet project for several years until he brought it to the Apache Incubator program. Listen to today’s episode for the inside scoop of how this “hidden gem” fits into the Apache network of open-source software projects. In this episode we discuss: Why open-source APMs are not very common SkyWalking’s focus on attracting more contributors rather than users How a conflict of interest at Huawei led to a “bake-off” between Apache and CNCF The impact of Elastic changing their license on the open-source community The name “Skywalking,” its sources of inspiration, and an easter egg Links: Apache SkyWalking Kubernetes The Apache Incubator CNCF Tetrate Apache ShardingSphere Apache APISIX Envoy Proxy Apache Airflow Apache Beam Dynatrace New Relic Elastic Helm Zipkin Other episodes: Envoy Proxy with Matt Klein
32:03
February 24, 2021
Snowpack with Fred K. Schott
Eric Anderson (@ericmander) and Fred K. Schott (@FredKSchott) dive into the world of Snowpack, an open-source, frontend build tool for web developers. Snowpack is special because it uses Javascript’s ES module system to instantly write file changes to the browser. Fred created Snowpack and the Skypack CDN to fulfill his vision of the future of the web, which he first recognized while trying to advance the Javascript ecosystem with an earlier project called Pika. On today’s episode, find out how Fred rejected the pain of modern web development, and came up with a better solution. In this episode we discuss: Reconfiguring old ideas for today’s web development landscape How Snowpack and Skypack lighten the load when it comes to Node modules and storage space Questioning what it means to build a modern application that works for developers and users alike Skypack and the future of shared dependencies across different sites Why Snowpack is using an open governance framework Links: Snowpack Skypack OCTO Speaker Series - Fred K. Schott Svelte React Ripple Microsite Deno Next.js esbuild webpack People mentioned: Rich Harris (@Rich_Harris) Nate Moore (@n_moore)
31:35
February 10, 2021
NumPy & SciPy with Travis Oliphant
Eric Anderson (@ericmander) and Travis Oliphant (@teoliphant) take a far-reaching tour through the history of the Python data community. Travis has had a hand in the creation of many open-source projects, most notably the influential libraries, NumPy and SciPy, which helped cement Python as the standard for scientific computing. Join us for the story of a fledgling community from a time “before open-source was cool,” and their lessons for today’s open-source landscape. In this episode we discuss: How biomedical engineering, MRIs, and an unhappy tenure committee led to NumPy and SciPy Overcoming early challenges of distribution with Python What Travis would have done differently when he wrote NumPy Successfully solving the “two-option split” by adding a third option Community-driven open-source interacting with company-backed open-source Links: NumPy SciPy Anaconda Quansight Conda Matplotlib Enthought TensorFlow PyTorch MXNet PyPi Jupyter pandas People mentioned: Guido van Rossum (@gvanrossum) Robert Kern (Github: @rkern) Pearu Peterson (Github: @pearu) Wes McKinney (@wesmckinn) Charles Harris (Github: @charris) Francesc Alted (@francescalted) Fernando Perez (@fperez_org) Brian Granger (@ellisonbg) Other episodes: TensorFlow with Rajat Monga
49:31
January 27, 2021
Scylla with Dor Laor
Eric Anderson (@ericmander) and Dor Laor (@DorLaor) go under the hood of Scylla, the open-source NoSQL database designed for low latency and high throughput in big data applications. Dor and his team have reimplemented Apache Cassandra in C++ from scratch, with additional compatibility for DynamoDB. In today’s episode, Dor shares details on the exciting work coming out of ScyllaDB, including Seastar, their open-source C++ framework. Also, check out Scylla Summit 2021 to learn what’s next for Scylla. In this episode we discuss: Enabling Scylla to “gain control” by implementing Apache Cassandra in C++ How Dor and his co-founder were ahead of the curve with their vision for virtualization Scylla’s unique shard-per-core architecture Working with distributed teams, even before the COVID-19 pandemic The growing significance of separating the interface from the engine in open-source Learn about Project Circe, which is being featured at Scylla Summit 2021 right now Links: Scylla Seastar Scylla Summit 2021 Apache Cassandra DynamoDB MongoDB Redhat QEMU Redis Vectorized Apache Hadoop Apache HBase Apache Beam Apache Flink Apache Spark People mentioned: Avi Kivity (@AviKivity)
31:47
January 13, 2021
Gitpod with Sven Efftinge, Christian Weichel and Gero Posmyk-Leinemann
Eric Anderson (@ericmander) chats with Sven Efftinge (@svenefftinge), Christian Weichel (@csweichel) and Gero Posmyk-Leinemann (Github: @geropl) about their work on Gitpod, an open-source Kubernetes application that allows engineers to spin up a server-side dev-environment from a Git repository, all within their browser. The three team members are part of TypeFox, a consulting firm that specialized in developer tools for different companies before branching out into open-source projects. Upon Gero’s hiring at TypeFox, he was tasked with creating a minimum viable product for the idea that would eventually become Gitpod. Tune in to hear how shifting from consulting to working on their own open-source projects was a breath of fresh air for the developers at TypeFox. In this episode we discuss: How Gitpod solves the problem of switching between multiple dev environments, and improves deep code review The trap that many open-source founders fall into Why TypeFox wanted to switch from a consulting firm to a product shop Details on how Gitpod handles licensing Learn how you can instantly try out a Gitpod environment for any existing Github repository Links: Gitpod TypeFox Theia Kubernetes People mentioned: Anton Kosyakov (@akosyakov) Sid Sijbrandij (@sytses)
26:02
December 30, 2020
oso with Graham Neray
Eric Anderson (@ericmander) interviews Graham Neray (@grahamneray) about oso, the open-source policy engine for authorization. oso was originally born from a desire to make infrastructure and security easier for developers, which is why Graham and his company describe themselves as being in the “friction-removal business.” Listen to today’s episode to learn how the team at oso are working to put security in the hands of developers.  In this episode we discuss: Developers building RBAC (role-based access control) systems over and over again Why open-source is the best way to handle authorization logic The history behind oso’s core policy language, Polar How someone beat Graham to the punch submitting oso to a Python newsletter Comparing oso and OPA (Open Policy Agent) Links: oso Stripe Trulioo MongoDB Auth0 Show HN OPA Polar Adventure People mentioned: Sam Scott (@samososos) Alex Plotnick (Github: @plotnick) Stephen Olsen (@olsenator4) Other episodes: Presto on Contributor OPA on Contributor
28:45
December 16, 2020
TensorFlow with Rajat Monga
Eric Anderson (@ericmander) is joined by Rajat Monga (@rajatmonga), a co-creator of TensorFlow. Originally developed by the Google Brain team, TensorFlow is now one of the most popular open-source libraries for machine learning. The team at TensorFlow seek to “democratize” the world of AI as we know it, and by all accounts, they are succeeding. Listen to today’s episode to get inside one of the largest and most exciting open-source projects of the decade. In this episode we discuss: How TensorFlow compares to other open-source projects at Google Taking bets on launch day numbers Balancing the demands of different kinds of TensorFlow users Lessons from Keras and PyTorch Links: TensorFlow Keras  PyTorch Kafka Kubernetes MapReduce: Simplified Data Processing on Large Clusters Bigtable: A Distributed Storage System for Structured Data People mentioned: Jeff Dean (@JeffDean) Andrew Ng (@AndrewYNg) François Chollet (@fchollet)
29:41
December 02, 2020
Materialize with Frank McSherry
Eric Anderson (@ericmander) and Frank McSherry (@frankmcsherry) dive into Materialize, a source-available streaming database that lets engineers build real-time applications. Frank is a data processing expert whose work at Microsoft Research on the Timely and Differential Dataflow models culminated in the Materialize project. Tune in to today’s episode to learn how the team at Materialize are making the technology from cutting-edge data research accessible to a wider swath of users. In this episode we discuss: Sharing early ideas with an “academic open source” approach How Materialize made a commitment to correctness Frank’s developmental philosophy of iterative thinking Novel applications for the Materialize community Changing the way we approach problems with real-time data processing Links: Materialize Naiad: A Timely Dataflow System DryadLINQ Apache Arrow People mentioned: Arjun Narayan (@narayanarjun) Derek Murray (@mrry)
36:21
November 18, 2020
Cilium with Thomas Graf
Eric Anderson (@ericmander) speaks with Thomas Graf (@tgraf__) about Cilium, the open-source networking, observability, and security software for cloud-native applications based on eBPF. Thomas is the co-founder and CTO of Isovalent, which maintains both eBPF and Cilium. Listen to today’s episode for a discussion of how Thomas’ work has leveled up the Linux kernel and the possibilities of network infrastructure in a cloud-native world. In this episode we discuss: The impact of simultaneous development on Cilium and eBPF Google’s incorporation of Cilium Shortening the gap between writing kernel code and its deployment What JavaScript and eBPF have in common Cilium’s sister project, Hubble Links: Cilium eBPF Isovalent Red Hat OpenShift Kubernetes Docker New GKE Dataplane V2 increases security and visibility for containers SPIFFE Istio People mentioned: Brendan Gregg (@brendangregg) Other episodes: Istio on Contributor
29:29
November 04, 2020
Prefect with Jeremiah Lowin
Eric Anderson (@ericmander) and Jeremiah Lowin (@jlowin) discuss Prefect, a workflow management system and data orchestration tool under development as an open-source project. Jeremiah initially created Prefect to solve a technical challenge specific to his own work, but soon realized that it was appealing to a very wide range of different clients. Listen to today’s episode to learn why Jeremiah believes most attempts to build a unified framework for solving data orchestration fail. In this episode we discuss: Solving the “negative engineering problem” Learning from the complaints of data engineers at Apache Airflow The difficulty of having a product that serves two masters How COVID changed the direction of Prefect Links: Prefect Apache Airflow Why Not Airflow? People mentioned: Jim O'Shaughnessy (@jposhaughnessy) Patrick O’Shaughnessy (@patrick_oshag)
48:59
October 21, 2020
Open Policy Agent with Torin Sandall
Eric Anderson (@ericmander) catches up with Torin Sandall (@sometorin), co-creator of Open Policy Agent (OPA), the open-source, general-purpose policy engine. By focusing on demonstrating OPA’s value through case studies, targeted interviews, and word-of-mouth, Torin and the folks at Styra were able to grow OPA into the emerging standard for unified policy enforcement across the cloud-native stack. In this episode we discuss: When Netflix stumbled across OPA and delivered its “Cinderella moment” Why OPA was designed to be developer-centric The value of demonstrating OPA’s use cases to the industry How one user created an RPG engine with OPA Links: Open Policy Agent Styra OpenStack LinkerD Hacker News Kubernetes KubeCon OPA Gatekeeper conftest Corrupting the Open Policy Agent to Run My Games Envoy Styra Academy People mentioned: Tim Hinrichs (@tlhinrchs) William Morgan (@wm) Kevin Hoffman (@kevinhoffman) Other episodes: LinkerD on Contributor Envoy on Contributor
34:29
October 07, 2020
Temporal with Maxim Fateev
Eric Anderson (@ericmander) and Maxim Fateev (@mfateev) trace the development of Temporal, an open-source workflow orchestration engine. At Uber, Maxim co-created the project’s predecessor, Cadence, but Temporal’s roots stretch farther back to include lessons learned at Amazon and Microsoft. In this episode, learn how 18 years of experience in asynchronous messaging and workflows culminated in the foundation of Temporal. In this episode we discuss: Why Maxim quit Uber to start his own company Differences between Temporal and Cadence How Uber is filling the position that Google once had incubating open-source projects Maxim’s advice for aspiring open-source founders Related Links: Temporal Cadence Kafka HashiCorp BanzaiCloud Hacker News Andreesen Horowitz TChannel Hadoop People mentioned: Samar Abbas (@samarabbas77)
30:38
September 23, 2020
Dgraph with Manish Jain
Eric Anderson (@ericmander) and Manish Jain (@manishrjain) discuss the impact of Dgraph, an open-source database with a graph backend that Manish describes as “a search engine acting as a database.” Manish took a gamble when he chose GraphQL as his project’s query language shortly after its release by Facebook in 2015. Now, GraphQL has grown immensely in popularity and the bet has paid off, as Dgraph leads the cutting edge of databases in this new space. Make sure to check out the Dgraph team’s conference, “GraphQL In Space,” which will be held virtually on September 10th at graphqlcon.space. In this episode we discuss: How Manish was ahead of the curve at Google The chance circumstances in the Australian job market that led to Dgraph Building trust between open-source developers and their community Why the Dgraph team decided to hold their upcoming conference “In Space” The future of databases and GraphQL Related Links: Dgraph GraphQL In Space GraphQL Badger MongoDB BigTable Cassandra Spanner Elasticsearch People mentioned: Scott Kelly (@StationCDRKelly)
30:04
September 09, 2020
Presto with Martin Traverso, Dain Sundstrom and David Phillips
Eric Anderson (@ericmander) talks to Martin Traverso (@mtraverso), Dain Sundstrom (@daindumb) and David Phillips (@electrum32) about their collaboration on Presto, an open-source distributed SQL query engine for big data. The three engineers worked together at three different companies before deciding to solve an efficiency problem for data analytics at Facebook in 2012. Listen to today’s episode to learn about the careful planning and technical philosophy behind the development and design of Presto. In this episode we discuss: Starting an open-source project at Facebook in the early 2010s The importance of making Presto “dirt simple to install” What is “documentation driven development” Bootstrapping the growth of an open-source community How a single query caused a brownout across Facebook infrastructure Related Links: Presto Starburst Ning Netezza ProofPoint Hadoop Postgres Hive OpenCompute @Scale Arm Treasure Data Qubole People mentioned: Jay Parikh (@jayparikh)
38:29
August 26, 2020
Xanadu with Nathan Killoran
Nathan Killoran (@co9olguy) guides Eric Anderson (@ericmander) through the cutting-edge world of quantum machine learning at Xanadu, a quantum computing company that is innovating with its use of photonics. Nathan is Xanadu’s Head of Software, Algorithms, & Quantum Machine Learning, and has detailed insight on their main open-source software projects, StrawberryFields and PennyLane. On today’s episode, Nathan explains how the barrier to contributing may be lower than you think, even if you don’t have a PhD in quantum physics. In this episode we discuss: Designing software for Xanadu’s unique approach to quantum computing Machine learning, differentiable programming and more in the quantum domain How even high school students can contribute to an open-source quantum computing project Is there a road map for quantum machine learning? Nathan’s “blue sky” interview questions Links: Xanadu StrawberryFields PennyLane ProjectQ TensorFlow Quantum PyTorch Qiskit Pyquil Cirq Alpine Quantum Technologies Quantum Open Source Foundation Unitary Fund People mentioned: Christian Weedbrook, CEO of Xanadu (@_cweedbrook)
28:22
August 12, 2020
Clickhouse with Alexey Milovidov and Ivan Blinkov
Eric Anderson (@ericmander) talks to Alexey Milovidov (@alexey-milovidov) and Ivan Blinkov (@blinkov) about their work on Clickhouse, an open source analytical database from the team at Yandex. Originally designed to support Yandex.Metrica, word of this powerful tool spread rapidly inside the company, and the idea was hatched to make Clickhouse into a truly open source project. Tune in to learn about how Alexey petitioned management to accept what initially seemed like a “crazy” idea - and how the risk paid off. In this episode we discuss: Differences between Clickhouse and similar products Why some open source projects are more successful than others The history of open source at Yandex What makes a good open source developer Building an international community Links: Clickhouse Yandex.Metrica Altinity Postgres Oracle Infobright InfinityDB MongoDB Vertica Dremel: Interactive Analysis of Web-Scale Datasets (2010) CatBoost BEM Presto Druid Greenplum Apache Spark
42:10
July 29, 2020
LinkerD with William Morgan
Eric Anderson (@ericmander) chats with William Morgan (@wm), CEO of Buoyant and a creator of the open source service mesh, LinkerD. As a former infrastructure engineer at Twitter, William leveraged his experience there to help develop what would become effectively the first service mesh. Listen to today’s episode to find out how the team at Buoyant originally coined the term, and are continuing to define the concept today. In this episode we discuss: Pioneering the very first service mesh Why Buoyant rejected the open core model How the industry is shifting away from the “nights and weekends” community Rewriting LinkerD from scratch Links: LinkerD Buoyant Dive Kubernetes Docker Finagle HAProxy NGINX CNCF Prometheus Cisco Webex Istio
32:00
July 15, 2020
Chef with Adam Jacob
Full show notes and transcript. Eric Anderson (@ericmander) welcomes Chef co-founder Adam Jacob (@adamhjk) to talk about the popular open source service. He and co-founder Nathan Haneysmith originally started the company as a way to sell automation services to startups, but wanted to expand their abilities to serve more clients. From naming the company to governance and engaging with contributors, Adam dives into why it was important to him to go the open source route and how the business model works. In this episode we discuss: How Chef got started The decision to be open source What the business model looks like Contributors and community members Where Chef is today and where it’s headed Links Chef Puppet The Apache Software Foundation Docker Perl
40:45
July 01, 2020
Istio with Sven Mawson
Full show notes and transcript. Eric Anderson (@ericmander) and Sven Mawson (@smawson) dive into the past, present and future of Istio, an open source service mesh born of collaboration between IBM and Google. Sven is a Senior Staff Engineer at Google and co-founder of the Istio project. In today’s episode, he shares the story of how two titans came together for a tool that anyone can use and contribute to. In this episode we discuss: How Google asked IBM to drop their Amalgam8 project The involvement of Lyft, Envoy and Matt Klein (@mattklein123) Making moves at QCon A counter-intuitive marketing strategy What work still needs to be done Links Istio Google Cloud Endpoints Kubernetes Envoy QCon NGinX
31:24
July 01, 2020
Envoy Proxy with Matt Klein
Full show notes and transcript. Eric Anderson (@ericmander) and Matt Klein (@mattklein123) discuss the beginnings of Envoy Proxy, an open source proxy now governed by the CNCF. Matt is a software engineer at Lyft and creator of the Envoy. On today’s episode, Matt gives the inside scoop on the benefits and challenges of cultivating a self-sustaining open source community.   In this episode we discuss: How Matt’s experience at Twitter informed development of Envoy Working with Google The role of marketing in Envoy’s success Why building an open source community is like “total controlled anarchy” Finding the right contributors and maintainers Links: Envoy Proxy Finagle Hystrix NginX HA Proxy Istio CNCF
39:08
November 14, 2019
Alluxio with Haoyuan Li
Full show notes and transcript. Eric Anderson (@ericmander) hosts Haoyuan Li (@haoyuan), also known as H.Y., creator of Spark Streaming as well as the open source data orchestration system, Alluxio. H.Y. founded Alluxio, Inc. to further develop the research project that he first created as a doctoral student at UC Berkeley’s AMPLab. Listen to today’s episode to learn more about how H.Y. identified an opportunity to disrupt cloud storage with an open source project as his Ph.D. thesis. In this episode we discuss: H.Y.’s analysis of the data storage industry’s cyclical history How H.Y. balanced academics with the Alluxio community The 3 types of Alluxio contributors Use cases for Alluxio Links: Alluxio Spark Streaming Kubernetes Presto TensorFlow AMPLab
32:36
November 01, 2019
Contributor Trailer
Learn about Contributor, a podcast about the best open source projects and the communities that build them.
00:46
July 04, 2019