Data on Kubernetes Community
By Bart Farrell
The new data on k8s community (DOKC) is an openly governed and self-organizing group of curious and experienced operators and engineers concerned with running data-intensive workloads on Kubernetes
DoK Talks #90- Build Your Kubernetes Operator with the Right Tool! // Rafal Leszko
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK You want to build a Kubernetes Operator for your software. Which tool to choose? Operator SDK with Helm, Ansible, or Go? Or maybe start from scratch with Python, Java, or any other programming language? And what is the right phase in the Operator Capability/Maturity Model that you should provide? In my talk I'll present: - Different ways of building Kubernetes Operators - Demo of building the same Operator using different tools - Methods used by the most popular Operators (Couchbase, Prometheus, MongoDB) - Operator Capability Model and how it affects your toolkit - Our journey with Hazelcast Operator BIO Tech Lead of the Cloud-Native Team at Hazelcast, author of the book "Continuous Delivery with Docker and Jenkins", trainer, and conference speaker. He specializes in Java development, Cloud environments, and Continuous Delivery. A former employee in a number of companies and scientific organizations: Google, CERN, AGH University, and more.
September 17, 2021
DoK Talks #89- An introduction to Azure Arc Enabled Data Services // Chris Adkin
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Azure Arc enabled Data Services allows organizations to run the Azure dialect of database-as-a-service on Kubernetes on virtually any cloud or infrastructure. This session will cover the architecture of this new product, how it is deployed, the dialects of database engine that it currently supports and the storage considerations for deploying this service. BIO Chris has been working with databases for over twenty years in various roles ranging from DBA to developer to architect. Chris currently works at Pure Storage as a solutions architect and one of his main focus areas is databases running on Kubernetes, Chris is also a co-organizer of the Azure Cloud Native Data Platform meetup user group.
September 16, 2021
DoK Talks #88- NSA & CISA K8s hardening Guidance: data & secrets protection // Leonid Sandler
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK The National Security Agency (NSA) and the Cybersecurity and Infrastructure Security Agency (CISA) have published comprehensive recommendations for strengthening the security of an organization’s Kubernetes system to help companies make their Kubernetes environment more difficult to compromise. one of the key areas this report covers is how to configure K8s to prevent data breaches. in this session, Leonid Sandler, ARMO CTO and Co-Founder, will review this guidance, and its impact on K8s data protection and will reveal everything you should know before choosing Kubernetes secret management method. BIO Leonid Sandler is CTO and co-founder of ARMO, 20+ years software security experience, NDS CTO Software Security, Designed Cloud Security products serving the world largest content providers. Built & managed Product and Development team of over 130 people KEY TAKE-AWAYS FROM THE TALK * How to use NSA & CISA K8s hardening guidance * How to protect K8s secrets
September 15, 2021
DoK Talks #87- Cloud Native Microgateway Architecture // Leonardo Silva
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Building a gateway api architecture is not an easy task. In this presentation Leonardo will talk about how the microgateway strategy can be used to resiliently and reliably expose apis in a cloud native environment while increasing the productivity of engineering teams. BIO Experienced Senior Solutions Architect and DevSecOps Engineer with a demonstrated history of working in the information technology and services industry. Skilled in Microservices architecture, AWS Cloud Architecture, DevOps, and Kubernetes. KEY TAKE-AWAYS FROM THE TALK Use microgateway architecture to expose APIs instead of monolithic approaches. You can combine API gateways, kubernetes and istio features to increase your system's resilience whithout losing control of security and compliance You can apply gitops techniques to improve teams performance in real world projects.
September 13, 2021
DoK Student Sessions- Intro to Apache Cassandra // Aleks Volochnev
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK What do companies like Apple, Netflix, Huawei or Instagram have in common? Aside from billions of requests, petabytes of data, and millions of users around the world, all of these companies use Apache Cassandra™ - the NoSQL decentralised database, well known by data experts for its scaling capabilities, high availability and incredible performance. Join us for the Introduction to Apache Cassandra workshop to learn the fundamentals of one of the most powerful databases of our time, make your step into NoSQL world and take your chance to get a voucher for free Cassandra Developer or Cassandra Administrator certification exam. BIO After many years in formation technologies as a developer, technical lead, DevOps Engineer and architect, Aleks focused himself on cloud computing and distributed systems. Professional Cloud Architect and Developer Advocate, he shares his knowledge and expertise in the field of high-performant and disaster tolerant systems.
September 13, 2021
DoK Talks #86- Understanding performance design and benchmarking in Kubernetes // Dean Lewis
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK This session will talk about the considerations and tools available when it comes to performance and design of Kubernetes running in a virtualised environment. BIO Working at VMware as a Cloud Management specialist focusing on automation and cloud native technologies. KEY TAKE-AWAYS FROM THE TALK From this session you will learn what tools are readily available today to help you benchmark your platforms.
September 9, 2021
DoK Talks #83- (Further) Exploring DataOps and Kubernetes // Randy Abernethy
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Randy will discuss the increasing importance of DataOps to the modern analytics-driven enterprise and how it integrates with continuous development and operations environments. He’ll illustrate how Kubernetes enables DataOps across four common patterns used for continuous data integration: batch, streaming, microservice and ETL solutions. Randy will also explain how these design patterns – in concert with automation, CI/CD and containerization – can be applied to high-scale data integration scenarios, creating an environment that supports rapid development of elastic analytics solutions. BIO Randy Abernethy is a Managing Partner at RX-M, a cloud native advisory and training firm. He is a tech entrepreneur, startup advisor, financial technology pioneer, Apache Thrift committer, Cloud Native Ambassador and author. Randy’s work allows him to pursue his passion for large scale distributed systems and all things cloud native. Interacting with hundreds of technology professionals each year, he stays highly connected with the latest software and platform trends and developments. Randy enjoys collaborating and networking with individuals around the world.
September 9, 2021
DoK Talks #85- I lost my cluster, can I get it back? // Salman Iqbal
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK We all heard about how Kubernetes is self healing. However, what happens if you lose your Kubernetes cluster and your data? Is there a way to recover that? This talk will explore the why and how on backing up and restoring Kubernetes cluster. The talk will focus on three scenarios for backup and restore: - Kubernetes control plane - Applications - Data A comparison of different techniques will be presented with some live demos! BIO Salman works as an MLOps Engineer at Appvia and a Kuberenetes Instructor at Learnk8s. He has worked with a number of organisations in setting up Machine Learning platforms for teams to operate at scale. He is also a Certified Kubernetes Administrator and a strong advocate for Cloud Native technologies. He co-founded Cloud Native Wales, an initiative to bring together a community that shares ideas on how to use cloud native technologies effectively. You can also find him on YouTube as Soulman Iqbal where he tries to explain cloud native concepts by simplifying them.
September 8, 2021
DoK Talks #84- Build your IDP with Otomi, the first and only Kubernetes-based O.S // Sander Rodenhuis
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Kubernetes is a platform for building platforms and is not really developer-friendly. That's why red Kubes created Otomi, the first and only Kubernetes-based operating system. Install Otomi with a single Helm Chart, login, and start deploying applications with only a few clicks. Use Otomi to build your Enterprise Container Platform or Internal Developer Platform in minutes. In this talk, I'm going to introduce Otomi. We'll start by looking at the architecture and how you can install Otomi on any Kubernetes cluster. But don't expect a long and boring slide deck, we'll quickly take a look at a live install and navigate through all the Otomi features. BIO Sander is the founder and CEO of Red Kubes, the company behind Otomi. Otomi is an open-source extension for Kubernetes to support the most common DevOps and SRE use cases. Before Sander founded Red Kubes, he worked as a freelance consultant for the last 12 years, doing a lot of cloud-native stuff with Azure, AWS, and of course Kubernetes. KEY TAKE-AWAYS FROM THE TALK - Now I can turn my Kubernetes cluster into my own Internal Developer Platform in minutes - With Otomi, developers can deploy containers without having to write any YAML manifests - With Otomi, you can provide developers with self-service and onboard new development teams onto the platform within minutes - Use Otomi to build your Enterprise-grade Container Platform - Otomi is completely FREE and open-source - And Otomi runs on any Kubernetes cluster in any cloud
September 3, 2021
DoK Talks #82- What we learned from 100 Kubernetes postmortems and how to prevent them // Shimon Tolts
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK A smart person learns from their own mistakes, but a truly wise person learns from the mistakes of others. When launching our product, we wanted to learn as much as possible about typical pains in our ecosystem, and did so by reviewing many post-mortems (100+!) to discover the recurring patterns, anti-patterns, and root causes of typical outages in Kubernetes-based systems. In this talk we have aggregated for you the insights we gathered, and in particular will review the most obvious DON’Ts and some less obvious ones, that may help you prevent your next production outage by learning from others' real world (horror) stories. BIO At 15 years old, Shimon started his first company, providing web hosting on top of Linux servers in the pre-cloud era. He established and managed the Software Engineering Infrastructure department for 400 engineers at ironSource. Also as an AWS Community Hero, Shimon runs the largest AWS user-group worldwide and is an avid speaker at conferences. Shimon co-leads the Israeli CNCF chapter Today, Shimon is the CEO and Co-Founder at Datree, which prevents Kubernetes misconfiguration from reaching production. KEY TAKE-AWAYS FROM THE TALK How to prevent misconfiguration in Kubernetes workloads
September 1, 2021
DoK Talks #81- How to Alert on SLOs using Error Budget Burn Rate // Yuri Grinshteyn
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Join this session to learn: - why reliability is a key feature of your service - why SLOs are the best way to measure it and set reliability targets - how error budgets increase your velocity - what an error budget burn rate is - how to set up alert using burn rates to optimize your response and operational load BIO Yuri works with Google Cloud Platform customers to help them design, architect, build, and operate reliable applications and services and advocates for SRE principles and practices. He occasionally writes code in Node.js, Python, and Go and likes to play video games on the train ride to work.
August 27, 2021
DoK Talks #80- Introduction To Keptn // Brad McCoy & Adam Gardner
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Keptn is an event-driven orchestration platform with SRE best practices (declarative pipelines, quality gates and automated remediation) available "out of the box". This talk will be an introduction to Keptn, the core usecases and Keptn in the wider IT ecosystem BIO Brad started his career as a Software Engineer at TripAdvisor and then got into DevOps and moved into Fintech where he led the Cloud Engineering Innovation team at Temenos. Brad is an advocate for cloud-native and open source technologies. He contributes to the CNCF Security Technical Advisory Group. He also runs the GitOps working group for Ortelius a project of the CD Foundation. Adam is an Automation Architect and Keptn contributor with a hint of Java development and Search Engine Optimisation. SCUBA diver and craft beer drinker. Not all at the same time.
August 27, 2021
DoK Talks #79- Running Apache Pulsar in Kubernetes // Chris Bartholomew
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK When I founded Kesque in 2019, my goal was to Kubernetes as the base platform for all our software. Because Kesque was a cloud-based SaaS powered by Apache Pulsar, that meant getting Apache Pulsar, a high-performance streaming solution and Kafka alternative, up and running in Kubernetes. In this talk, I will give an overview of Apache Pulsar and describe how we got Pulsar up and running in Kubernetes. We will cover some of the features of Pulsar that make it "cloud-native" and easy to run in Kubernetes as well as some of the challenges we faced and how we solved them. Kesque was acquired by DataStax, which is a strong supporter of Kubernetes. I will also cover how we continue to use Kubernetes as the foundation for the work we are doing at DataStax around Apache Pulsar. BIO Chris Bartholomew is a Streaming Engineering Leader at DataStax. He has been working with high-performance pub–sub systems for over a decade. He has tested, supported, and operated messaging systems that are deployed in banking, capital markets, and transportation industries. He was the founder and CEO of Kesque, a cloud-based managed service built around Apache Pulsar that was acquired by DataStax.
August 26, 2021
DoK Talks #78- Leverage AI, Big Data workload on Kubernetes with Volcano // Kevin (Zefeng) Wang
ABSTRACT OF THE TALK Volcano is a system for running high-performance workloads on Kubernetes. It features powerful batch scheduling capability that Kubernetes cannot provide but is commonly required by many classes of high-performance workloads, including Machine Learning, Deep Learning, Big Data, Bioinformatics Computing, etc. These types of workloads typically run on generalized domain frameworks like TensorFlow, Spark, PyTorch, MPI, etc. In this talk, Kevin will introduce how Volcano is integrated with these frameworks to allow users to run their applications without extra adaptation efforts while enjoying remarkable batch scheduling. BIO Kevin Wang is a contributor in the CNCF community since its beginning, leader of the cloud native open source team at Huawei, and co-founder of the KubeEdge and Volcano projects. Kevin has contributed to Kubenretes upstream for years and now spends 100% of his work and focus on wider open source cloud native community development.
August 26, 2021
DoK Talks #77- Kubernetes Essential Tools // Javier Ramos
ABSTRACT OF THE TALK In this talk I will try to summarize my favorite tools for Kubernetes with special emphasis on the newest and lesser known tools which I think will become very popular. BIO Javier has over 15 years of experience in the IT industry as an enterprise architect with especial emphasis on Big Data and DevOps. He has worked in small startups and also large multinationals such Oracle, IBM or Cisco. He holds several certifications in AWS, GCP and Azure and writes regularly on Medium.com. Certified Java Architect/AWS/GCP/Azure/Kubernetes: Microservices/Kubernetes/Serverless, AWS/GCP Kafka/Akka/Flink/Spark, AI, JS/React/Angular/PWA
August 25, 2021
DoK Talks #71 Introducing Kubestr: A new way to benchmark your Kubernetes storage // Michael Cade
Abstract of the talk… Benchmarking storage is not a new concept, this has been happening on storage for a long time. But have we overlooked the benchmarking capabilities or at least the ease in which to achieve this in a cloud-native, container-based, Kubernetes landscape? There has been a rise in stateful workloads and support around persistent storage in Kubernetes is improving. Now we can take our traditional workloads such as SQL Server, Oracle and SAP alongside our data stores for microservices with the same storage system for MongoDB, Cassandra, Redis, MySQL and PostgreSQL. With each of these stateful applications having different performance requirements, it becomes necessary to benchmark the storage backing these Persistent volumes. The CSI (Container Storage Interface) is the standard for creating custom components to work with data storage. This has enabled many more storage vendors to adopt their platforms to the cloud-native approach and offerings. All of this is great, but how do we ensure that the right datastore is used to achieve the performance required for our microservices running these stateful workloads? Bio… A community first technologist for Kasten by Veeam Software. Based in the UK with over 16 years of industry experience with a key focus on technologies such as cloud-native, automation & data management. His role at Kasten is to act as a technical thought leader, community champion and project owner to engage with the community to enable influencers and customers to overcome the challenges of Cloud-Native Data Management and be successful, speaking at events sharing the technical vision and corporate strategy whilst providing ongoing feedback from the field into product management to shape the future success. Key take-aways from the talk… Kubestr can assist here in three ways: · Identify the various storage options present in a cluster. · Validate if the storage options are configured correctly. · Evaluate the storage using common benchmarking tools like FIO.
August 20, 2021
DoK Student Sessions- Why you should learn the Julia Programming Language // Logan Kilpatrick
Abstract of the talk… In this talk, we will go over what Julia is, why you would want to learn it, and how to contribute to the ecosystem. Bio… Logan is the Community Manager for the Julia Programming Language, a member of the Board of Directors at NumFOCUS, and a Machine Learning Engineer. Outside of work, Logan is a graduate student at Harvard University and Northwestern Universities Pritzker School of Law. Key take-aways from the talk… Viewers will leave with a high-level understanding of the Julia ecosystem, the benefits the language provides, how to contribute to it, and more.
August 20, 2021
DoK Talks #76 AHOY - open source release manager for Kubernetes // Neil White
Abstract of the talk… AHOY is a release manager for Kubernetes, fully open source. Bio… I thought, this is easier explained, in, interpretive rap... Look, if you had one shot, one opportunity To fix everything that was ever broke In one deployment Would you rebuild it, or just let it fail? Yo His palms are sweaty, config weak, server load is heavy There's vomit on this keyboard already, Kubernetes He's nervous, but the control plane says Nodes are ready Fork Bomb, but he keeps on forgetting Commands he wrote down, to restart all the pods He opens up Wikis, but the page won't load out Server's chokin now, everybody's phoning now. Release window runs out, times up, over bloaw! Roll back to the previous, oh this is grievous Oh there goes docker, it's borked! He's mad cause K8 don't, downgrade that easily? No He won't have it, he knows, manual deploys back’s on the ropes It don’t’ matter He's dope, he knows that, but he broke He's whole Openstack So when he goes back to his mobile app,that's when it's Back to the Trello Board yo, this whole Agile thing sux He better go capture the screen logs next time using tmux You better lose yourself in this DevOps, the SecOps You own it, you better never let it go Ho! You only get one shot, do not miss your change to Window This opportunity comes once in a weeks time Yo! So that is what I do on a typical day. For those who want the more traditional summary.... Open source consultant specializing in enterprise grade solutions based on open source software and open source integration. Worked with multiple DevOps teams inside numerous South Africa companies, to help bring these worlds together. Then watching the worlds explode. Sometimes for the good, sometimes for the bad. Currently a strong focus on Kubernetes. And then all the usual toppings, GitOps, CI/CD tooling, automation, test-driven infrastructure, and everything cloud native. Key take-aways from the talk… It automatically incorporates GitOps, auto-generation of Helm charts, all stored on Git.
August 19, 2021
DoK Talks #75 Run production-grade Databases on Kubernetes using KubeDB // Tamal Saha
Abstract of the talk… Kubernetes has emerged as the de-facto way to deploy modern containerized apps on cloud or on-premises. Despite all that growth on the application layer, it remains challenging to run a database in a distributed environment like Kubernetes. However, many developers want to treat data infrastructure the same as application stacks. Operators want to use the same tools for databases and applications and get the same benefits as the application layer in the data layer: rapid spin-up and repeatability across environments. This is where KubeDB by AppsCode comes as a solution. KubeDB by AppsCode is a production-grade cloud-native database management solution for Kubernetes. KubeDB simplifies and automates routine database tasks such as provisioning, patching, backup, recovery, failure detection, and repair for various popular databases on private and public clouds. Bio… Tamal Saha is the Founder/CEO of AppsCode. Tamal was born and raised in Bangladesh. He completed his BSc in CSE from the top engineering school in Bangladesh. He came to the USA as a Ph.D. student at the University of Virginia in 2009. After finishing MSc, he dropped out of the PhD program and spent the next 5 years building core infrastructure software that powers Amazon retail and Google Ads business. While at Google in 2015, he became an early adopter and contributor to Kubernetes. Since he has developed well known projects in the Kubernetes space like KubeDB, Stash, Kubeform, KubeVault, Voyager, etc. Key take-aways from the talk… - Kubernetes is a viable platform for running production-grade databases. - Helm charts are not enough for Day 2 life cycle management of databases. - Users really need an operator based solution for running databases in Kubernetes.
August 18, 2021
DoK Talks #74 Introducing LitmusChaos 2.0 // Prithvi Raj & Sayan Mondal
Abstract of the talk… Chaos Engineering in the past few years has evolved with the development of the Cloud-Native paradigm and with the introduction of the LitmusChaos tool which helps curate Cloud-Native Chaos Engineering. Belonging to the CNCF ecosystem, Litmus stresses upon applying Chaos Engineering to modern architectures and building resilient applications simultaneously. The features and ideas majorly surround Community Collaboration, Observability, GitOps, Open APIs & much more. Chaos testing is seen as a must for all Cloud-Native practitioners emphasizing reliability as a container challenge. This talk stresses the evolution of Chaos Engineering with the introduction of the LitmusChaos 2.0 Bio… Prithvi Raj has been an active community manager in not only building LitmusChaos, the CNCF project based on Cloud-Native Chaos Engineering but also the overall Chaos Engineering community by working extensively as a co-organizer of the global conference Chaos Carnival as well as hosting monthly meetups at the CNCF Kubernetes Chaos Engineering Meetup group. His love for Chaos Engineering started in 2020 and has been growing ever since leading to the active adoption of chaos testing as a means of testing. He has also worked as a co-organizer for KCD Bengaluru 2021 and has delivered talks at various forums including KCD Africa, Cloud-Native Students, etc. Other than this Prithvi loves listening to Indie pop music and is an ardent fan of Liverpool FC. Watching cricket and wrestling are also his leisure activities. He loves collecting currencies as a hobby! Sayan currently works at ChaosNative as a Chaos Engineer and contributor to Litmus. He helps design, architect, and execute the Litmus Web with precision. He is not only passionate about development but also curious about breaking stuff with the practice of Chaos Engineering.
August 18, 2021
DoK Talks #73- Build Reproducible Experiments with Kubeflow and lakeFS // Barak Amar
Abstract of the talk… Kubeflow is a cloud-native ML platform that simplifies the training and deployment of machine learning pipelines on Kubernetes. lakeFS is wrapper layer around an object store that enables git-like operations such as branching and committing over datasets. Learn how to build ML workflows that are portable, scalable, and reproducible by integrating lakeFS operations into your Kubeflow pipeline components Bio… Barak is a software developer of the open source lakeFS project at Treeverse. He has more than 15 years of experience in a wide range of systems, from scheduling to security. He’s passionate about three things: programming languages, distributed systems, and long-distance running.
August 13, 2021
DoK Talks #72- Highly available, pluggable and long term storage metrics for everyone. Extending Prometheus with Thanos // Wiard van Rij
Abstract of the talk… Prometheus was initially made for short metric retention to answer questions on “what is happening ‘now’”. It is a strong project that solves certain problems really well, but still as a monolith when doing so. Thanos has been made to enable scaling, highly available setups and long term (cheap) storage for Prometheus. Everyone could leverage Thanos for these features. It does not stop there; Thanos has multiple components that could be used for multi-cluster telemetry, remote writes, and multi-tenancy. We want to introduce everyone to Thanos. Explaining the use-cases and how it could benefit your stack now observability becomes such an important factor in tech. Bio… Wiard van Rij’s main focus is in the field of observability at Fullstaq. As a consultant he is helping people, teams, and organizations with various cloud-native challenges with a strong focus on Kubernetes and Observability. Wiard is a Thanos team member, open source enthusiast and has extra fun with security and hacking. Key take-aways from the talk… - Introduction to Prometheus and Thanos - How to extend your stack(s) for highly available and long term metrics - By leveraging the right tools and services, one can have a rich set of features which are also cost effective
August 11, 2021
Intro to Persistence in Kubernetes
Abstract of the talk… So you want want to run stateful containers? A brief intro to persistence in Kubernetes. Kubernetes and persistent storage go together like oil and water. Kubernetes is inherently an ephemeral system and persistent storage by definition must survive. After his talk, you should have a clear understanding of the how to get started on the path to successfully manage a persistent data storage solution on your Kubernetes cluster Bio… Kunal - Junior pursuing Computer Science & Engineering. Co-founder at Code for Cause. CNCF Intern 2020. MLH Coach. Google Summer of Code Mentor. YouTuber. Gold Microsoft Learn Student Ambassador. Eric has worked with distributed systems for the better part of a decade. From databases to containerization. He has been an architect, developer, and network engineer. Currently he is in developer relations trying to bring his enthusiasm for tech to the rest of the world.
August 6, 2021
DoK Talks #70 - YugabyteDB - Distributed SQL Database on Kubernetes // Amey Banarse
Abstract of the talk… Kubernetes has hit a home run for stateless workloads, but can it do the same for stateful services such as distributed databases? Before we can answer that question, we need to understand the challenges of running stateful workloads on, well anything. In this talk, we will first look at which stateful workloads, specifically databases, are ideal for running inside Kubernetes. Secondly, we will explore the various concerns around running databases in Kubernetes for production environments, such as: - The production-readiness of Kubernetes for stateful workloads in general - The pros and cons of the various deployment architectures - The failure characteristics of a distributed database inside containers In this session, we will demonstrate what Kubernetes brings to the table for stateful workloads and what database servers must provide to fit the Kubernetes model. This talk will also highlight some of the modern databases that take full advantage of Kubernetes and offer a peek into what’s possible if stateful services can meet Kubernetes halfway. We will go into the details of deployment choices, how the different cloud-vendor managed container offerings differ in what they offer, as well as compare performance and failure characteristics of a Kubernetes-based deployment with an equivalent VM-based deployment. Bio… Amey is a VP of Data Engineering at Yugabyte with a deep passion for Data Analytics and Cloud-Native technologies. In his current role, he collaborates with Fortune 500 enterprises to architect their business applications with scalable microservices and geo-distributed, fault-tolerant data backend using YugabyteDB. Prior to joining Yugabyte, he spent 5 years at Pivotal as Platform Data Architect and has helped enterprise customers across multiple industry verticals to extend their analytical capabilities using Pivotal & OSS Big Data platforms. He is originally from Mumbai, India, and has a Master's degree in Computer Science from the University of Pennsylvania(UPenn), Philadelphia. Twitter: @ameybanarse LinkedIn: linkedin.com/in/ameybanarse/
August 4, 2021
DoK Talks #67- Run Apache APISIX in Kubernetes // Jintao Zhang
Abstract of the talk… Apache APISIX is a dynamic, real-time, high-performance API gateway. You can use Apache APISIX to handle traditional north-south traffic, as well as east-west traffic between services. It can also be used as a k8s ingress controller. In this talk, Jintao Zhang will introduce how to run Apache APISIX on k8s and how to use Apache APISIX as an ingress controller. Bio… Apache APISIX committer, Kubernetes ingress-nginx reviewer
August 2, 2021
DoK #69- To Certify or Not to Certify, is Kubernetes Certification Worth it? // Keith McClellan
Abstract of the talk… As an engineer, should I consider getting a certification? What makes a certification valuable to me or my employer? How do I pick which one to get? Will these really help me build stateful applications on Kubernetes? In this talk, we will discuss the relative value of certifying on different technologies, with a specific focus on CNCF certifications for administration of k8s and developing Kubernetes-native applications. In this session we will discuss: - The pros and cons of getting certified - Why your current and future employers might care about your certifications - What are other things you can do to make yourself a more attractive candidate in this cloud-native landscape And of course, since Keith is a long-time database geek, we'll talk about how these might help you (or not) build stateful applications on Kubernetes. Bio… Keith McClellan is the Director of Partner Solutions Engineering at Cockroach Labs. He is responsible for building CockroachDB-based solutions with our largest technology partners, including Kubernetes and the broader open-source ecosystem. He spearheaded Cockroach Labs' Kubernetes operator project, acting as the technical lead on the project and being a primary contributor to making that the best way to run CockroachDB on Kubernetes. Prior to Cockroach Labs, Keith has held technical leadership positions in cloud-native and big data companies including DataStax, Mesosphere (now D2IQ), and Platfora.
August 2, 2021
DoK Talks #68- The Kubernetes-native way to providing database services to developers // Adam Sandor
Bio… Adam is a Solutions Architect at Styra, helping companies adopt Cloud Native tech. Coming from a Java-dev background he is most excited about the space where software development and operations meet. When he is not flying in DCS World he is hacking on CI/CD, GitOps and Kubernetes Operators. Abstract of the talk… As Kubernetes is becoming the infrastructure platform of choice in many companies, database teams are struggling with the question whether to run databases on the cluster or outside of it. This talk will not answer that age-old question. I would rather focus everyone's attention to the developer experience. Modern CI/CD processes need development teams to be flexible and able to deliver without having to ask other teams for resources. So how can a database team provide reliable service while upping their game in DevEx? The answer in Kubernetes-land is providing Custom Resources backed by operators which handle database provisioning on or off the cluster in a way that fits into a GitOps CI/CD workflow. In this talk I will explain the concepts and dive into how you can make your own operator to provide a self-service interface for developers.
July 28, 2021
DoK #66 Crossplane Packages as a Distribution Mechanism // Daniel Mangum
Abstract of the talk… A typical user's journey with Crossplane starts with provisioning infrastructure using the Kubernetes API, then evolves to composing infrastructure into higher level abstractions, and culminates with building a complete platform using packages. Crossplane packages are distributed as OCI images, meaning that a platform API can easily be reproduced in any cluster, and they can declare dependencies, which specify the lower level services that support the higher level abstractions. This functionality allows for companies to distribute their product in an infrastructure provider-agnostic manner, and for infrastructure admins to build internal platforms made up of both generic and organization-specific components. Bio… Daniel Mangum is a senior software engineer at Upbound where he is a maintainer of Crossplane, an open source CNCF project. He has held leadership positions in the Kubernetes community, and is an active participant in multiple other open source efforts. When not working in the Cloud Native space, Daniel spends his time writing, speaking, and building tooling for the RISC-V ISA. Key take-aways from the talk… This talk will be useful for folks building an internal infrastructure platform, as well as folks that build a product that depends on some form of infrastructure (databases, caches, blob storage, etc.). We will cover how to both build and consume packages, paving the way for advanced usage of Crossplane.
July 21, 2021
DoK #65 Using Kubernetes and ClickHouse to enable high performance app analytics // Robert Hodges
Abstract of the talk… Embedded analytics are a major source of value to application users. Virtually every SaaS offering has them or is adding them now. This talk shows how to build low latency analytic applications on Kubernetes with ClickHouse, a popular, open source data warehouse. We'll start with the ClickHouse Kubernetes Operator to manage data warehouses, then cover ingest and visualization options to build a complete apps. Since this is a K8s talk, we'll of course geek out on the underlying plumbing as well. Bio… Robert is database geek. He started with M204 in the early 1980s and is now working on ClickHouse 20 years later, with digressions into virtualization, SaaS, and security topics. He met up with Kubernetes in 2018 at VMware. Robert is CEO of Altinity, which enables enterprises to use ClickHouse successfully. Key take-aways from the talk… 1. What are embedded analytics? 2. How can you build them with Kubernetes? 3. What are some of the design issues you'll need to consider?
July 16, 2021
DoK #63 Stranger Danger - Kubernetes Edition // Matt Jarvis
Abstract of the talk… Kubernetes is a powerful set of abstractions, but it's flexibility and configurability means it's pretty insecure by default. In this hands on talk, I'll show how an attacker can expand the blast radius of an exploit from a vulnerable web application in a container to owning the entire cluster. I'll also cover some ways in which you can prevent this happening to you ! Bio… Matt Jarvis is a Senior Developer Advocate at Snyk. Matt has spent more than 15 years building products and services around open source software, on everything from embedded devices to large scale distributed systems. Most recently he has been focused on the open cloud infrastructure space, and in emerging patterns for cloud native applications. Matt is a regular speaker at conferences across the world, including Open Infrastructure Summit, FOSDEM and All Things Open, a past winner of the OpenStack Outstanding Community Contributor award, and in 2021 was named one of the Top 100 influencers in Open Technologies in the UK. Matt is also a board director of OpenUK, and the founder of Cloud Native Manchester and Cloud Natives UK. Key take-aways from the talk… Kubernetes security considerations
July 13, 2021
DoK #62 Easy Kubernetes Volumes using Longhorn // Saiyam Pathak
Abstract of the talk… Longhorn is a lightweight, reliable, and powerful distributed block storage system for Kubernetes. It is an open source tool that can be installed on any Kubernetes Cluster. It has features like incremental snapshots and backup that can be backed up to NFS or S3-compatible object storage. In this talk, you will learn about Longhorn, its features including backup/recovery, and how you can take maximum benefit for your persistent Kubernetes volumes. You will also be shown a UI to understand the features in a much better way. Bio… Saiyam is working as Director of Technical Evangelism at Civo with a focus on defining the Civo cloud platform for simplifying Kubernetes and making it accessible for developers. Previously at Walmart Labs, Oracle, and hp, Saiyam has worked on many facets of k8s including machine learning platform, scaling, multi-cloud, managed k8s services, and k8s documentation. He’s worked on implementing Rancher and Influx in different organizations. When not coding, Saiyam works on contributing to the community by writing blogs and organizing local meetups for k8s, rancher, Influx. He is also an Influx ACE, Traefik Ambassador, CNCF ambassador, and can be reached on twitter @saiyampathak. Key take-aways from the talk… The audience will get to know about Longhorn, what it is, its features and how to use it. If time permits I can walk through the UI that will give more insights into the Product.
July 13, 2021
DoK #61 Perfecting Machine Learning Workloads on Kubernetes // Lars Suanet
Abstract of the talk… More and more applications are powered by Machine Learning (ML) models. Where the gap between Software Engineers and a Production environment on Kubernetes is already big, the gap between Data Scientists and that same production environment is enormous. In this talk, we will provide you with a framework for translating ML requirements into infrastructural requirements and concrete Kubernetes resources. In the first half of this talk, we will discuss how ML applications are different from most other applications, how ML workloads are structured and how ML requirements translate into Kubernetes resource configurations. In the second half of the talk, we will put this theory into practice. We will do a live demonstration of an ML Deployment on Kubernetes using Istio, Knative and Kubeflow Serving. Bio… Lars Suanet is a Software Engineer at Deeploy. With his background in Computer Science and his interest in AI, he tries to bridge the gap between Data Scientists and DevOps. His personal interests are Chinese culture, Distributed systems, Meditation and Plants.
July 2, 2021
DoK #60 Intro to Kubernetes // Aitor Artola & Kunal Kushwaha
Abstract of the talk… In this event we will introduce Kubernetes, containers and the cloud native initiative. You will get an overview of the benefits of containers running on Kubernetes and the new mindset that it requires, a new mindset driven by the cultural change the cloud native initiative is promoting. Concepts related to microservices and automation will be covered giving an overview of the different kinds of open tools you can find in the cloud native ecosystem to build an run modern applications in the cloud. Sections 1.- Introduction to Kubernetes & Cloud Native 2.- Docker & Containers. Microservices 3.- Kubernetes. The container orchestrator 4.- Cloud Native with Kubernetes. Modern applications 5.- Cloud Native tool landscape
July 2, 2021
Postgres on Kubernetes Hands-On-Lab // Álvaro Hernández
Abstract of the talk… From 0 to 60/100 (depending on where you live) in just 2h! It may sound "slow" if you talk about cars, but when you talk about databases in general and Postgres in particular, it isn't! From an empty Kubernetes cluster, you will leave the session with one or more Postgres clusters created, all with: high availability and automatic failover; automated backups with lifecycle; distributed logs with lifecycle; Web Console to manage it all; connection pooling; tuned Postgres and connection pool configurations; and any number of installed Postgres extensions. All this in just a single session! BYOK (Bring Your Own Kubernetes): come prepared to the session with a Kubernetes cluster, and ready to create YAMLs and deploy via GitOps; and to do the same using the Web Console, for point-and-click lovers. And learn how to automate Postgres Day2 operations! This session is a tutorial on production quality Postgres clusters based on the open source StackGres.io platform. Go from Zero to Postgres Hero in just one Hands-on-Lab! Bio… Álvaro is a passionate database and software developer. He founded and works as the Founder & CEO of OnGres (https://ongres.com). He has been dedicated to PostgreSQL and R&D in databases for two decades. An open source advocate and developer at heart, Álvaro is a well-known member of the PostgreSQL Community, to which he has contributed founding the non-profit Fundación PostgreSQL (https://postgresql.fund) and the Spanish PostgreSQL User Group. You can find him frequently speaking at PostgreSQL, database, cloud, and Java conferences. Every year, Álvaro travels approximately three-four times around the globe—in 2020 (or now more likely 2021 due to COVID-19), he will hit the milestone of having delivered 100 tech talks. AWS Data Hero (2019).
July 1, 2021
DoK #59 Let's get Real: SRE | Do we need it? // Benoit Schipper
Abstract of the talk… More and more companies around the world are adopting SRE. Despite Google's great book series regarding SRE, there is no default implementation for SRE. Join me and watch me explain this by taking my home country as an example =). Bio… I work as a SRE for several customers at hcs-company.com . My past as network engineer in the global Business Process Outsourcing (BPO) business in combination with my studies and passion for getting things done has landed me in the SRE role. For me a perfect fit! From Business to Code, from Infrastructure to end-users. A great position where you get to see the big picture! Key take-aways from the talk… In this talk I describe that there is no singular way for implementing SRE throughout the world. Many company's have had structures in place for several decades that can make it several times harder (or easier) to implement SRE. In my talk I explain how I see SRE within the Netherlands, my home country.
June 23, 2021
DoK #58 Benchmarking for PostgreSQL workloads in Kubernetes // Gabriele Bartolini & Francesco Canovai
Abstract of the talk… Databases like PostgreSQL cannot run on Kubernetes. That’s the refrain we hear all the time, as well as the motivation for us to break this barrier, once and for all. Hear the story of our journey so far about bringing PostgreSQL to Kubernetes. Discover why we believe that benchmarking both the storage and the database before production leads to a healthier and longer experience of the DBMS, even in Kubernetes. We’ll be sharing our process, and the results obtained so far, and unveil our plans for the future. Bio… A long time open-source programmer and entrepreneur, Gabriele has a degree in Statistics from the University of Florence. After having consistently contributed to the growth of 2ndQuadrant and its members through nurturing a lean and devops culture, he is now leading the Cloud Native initiative at EDB. Gabriele lives in Prato, a small but vibrant city located in the northern part of Tuscany, Italy - famous for having hosted the first European PostgreSQL conferences. His second home is Melbourne, Australia, where he studied at Monash University and worked in the ICT sector. He loves playing the Blues with his Fender Stratocaster, but his major passions are called Elisabeth and Charlotte! Francesco is a senior software engineer and member of the EDB Cloud Native team. With a background in system and PostgreSQL administration, he works on automation, CI/CD, testing, and compliance for the Cloud Native PostgreSQL operator. Key take-aways from the talk… 1. A methodology for benchmarking a PostgreSQL database in Kubernetes 2. Open source set of tools for benchmarking a PostgreSQL database in Kubernetes 3. Reasons why benchmarking the storage and the database is important
June 18, 2021
DoK #57 Key Criteria for Evaluating Kubernetes Data Storage // Enrico Signoretti
Abstract of the talk… Enterprises of all sizes are embracing hybrid cloud strategies that are ever more complex and structured, moving quickly from a first adoption phase, where data and applications are distributed manually and statically across different on-premises and cloud environments, to a new paradigm in which data and application mobility is the key to flexibility and agility. Now organizations want the freedom to choose where applications and data should run dynamically, depending on any number of business, technical, and financial factors. Kubernetes is instrumental in executing this vision, but it needs the right integration with infrastructure layers—such as storage—to make it happen. Bio… Enrico has more than 25 years in technical product strategy and management roles. He has advised mid-market and large enterprises across numerous industries, and worked with a range of software companies from small ISVs to global providers. Enrico is an internationally renowned expert on data storage—and a visionary, author, blogger, and speaker on the topic. He has tracked the evolution of the storage industry for years, as a Gigaom Research Analyst, an independent analyst, and as a contributor to the Register. Key take-aways from the talk… - Major differences between traditional storage with CSI compatibility and K8 storage - How to evaluate data storage solutions for K8s - Important evaluation metrics to consider and why - The market landscape
June 17, 2021
DoK #56 It's just a SQL - Crash course on Synapse Serverless for T-SQL ninjas! // Nikola Ilic
Abstract of the talk… Are you a seasoned T-SQL developer, used to solve each and every challenge by writing plain old SQL? But, now you need to leverage data coming from semi-structured or unstructured sources? What if I tell you that you can get your mission accomplished by writing your favorite T-SQL syntax? In this session, you will learn what is a Serverless SQL pool within Azure Synapse Analytics, how it works behind the scenes, and how can you preserve your "T-SQL Ninja" status even when dealing with the data coming from CSV and Parquet files, or from NoSQL database. Bio… I'm making music from the data! PowerBI and SQLServer addict, MCT, Pluralsight Author, blogger, speaker...Interested in everything related to data - always eager to extract valuable info from raw data in the most effective way. Multi-year experience working with (predominantly) Microsoft Data Platform (SQL Server, SSAS, SSIS, SSRS, and Power BI). Father of 2 and true football (and Barca) fan!
June 16, 2021
DoK #55 How to optimise operations and life cycle management for containers? // Rajalakshmi Srinivasa
Abstract of the talk… Modern applications are built to run on containerized infrastructure. Businesses are also migrating their existing apps from traditional to container deployments. In such a scenario, gaining end-to-end visibility of the complete Kubernetes container environment is an important challenge for the IT Operators/Administrators. In this talk, we will cover the following. - New-age business complexities. - How applications are moving from monolith to microservice architecture. - Operational challenges in monitoring the container architecture. - Strategies to efficiently manage the life cycle of the containers. Bio… Rajalakshmi Srinivasan, or Raji as she’s fondly called, is a Director of Product Management at Zoho Corp. In her 21 years journey with the company, she has gained extensive experience ranging from database scaling and network monitoring to application performance tracking and end-user-experience management. She currently leads the application performance monitoring section in Site24x7 - a cloud-based all-in-one monitoring product from Zoho Corp. Her greatest asset though is the profound wisdom that she has gathered through the years on how to innovate and improvise on an idea, how to rally a team around it, and how to execute it to perfection consistently ahead of the competition. She has an engineering degree in Computer Science from The College Of Engineering, Guindy (CEG), Chennai. She is a fitness freak. When she is not working, you can spot her running marathons, participating in clean-up drives, and traveling places along with her 3 little ones.
June 11, 2021
DoK #54 Putting Chaos into Continuous Delivery - How to increase the resiliency of your applications // Jürgen Etzlstorfer
Abstract of the talk… Continuous Delivery practices have evolved significantly with the cloud-native paradigm. GitOps & Chaos Engineering are at the forefront of this new CD approach, with an ever-increasing pattern involving Git-backed pipeline definitions that implement “chaos stages” in pre-prod environments to gauge service-level objective (SLO) compliance. In this talk, Juergen Etzlstorfer (maintainer of Keptn CNCF project) will discuss how you can construct pipelines that include chaos experimentation (using LitmusChaos) while simulating real-world load, and implement quality gates (based on SLOs) to ensure only resilient applications are deployed into production. He will also demonstrate how you can include chaos tests to your existing CD pipelines without the need of rewriting them. Bio… Jürgen is a Technology Strategist at Dynatrace and a maintainer of the Keptn open-source project. Within Keptn, he is working with the broader open-source tooling landscape on tools integrations and takes care about its growing community & ecosystem. He also loves to share his experience, most recently at conferences on Kubernetes based technologies, open-source and automation.
June 9, 2021
DoK #53 Day Zero - Azure Kubernetes Service // Raj Balakrishnan
Abstract of the talk… Are you new to azure kubernetes service and just want to see how the nuts and bolts come together ? This is the talk to be. Single slide and a end to end demo on how to run your first container on aks. Bio… I contribute in building Digital Experiences for our Farmers Worldwide, through Platforms and Services. Passionate about building Distributed and Scalable Systems in Microsoft Azure. My Day to Day Work is around - Apps, Data, Storage, Identity, Integration, Serverless and Site Reliability Engineering #PlatformEngineering #AKS #Kubernetes #EventDrivenArchitecture #Microservices #Observability #Leadership I Blog at https://dev.to/irajbalakrish Key take-aways from the talk… 1. setting up your aks cluster 2. running your first container
June 9, 2021
DoK #52 Enterprise-grade Kubernetes requirements // Haseeb Budhani
Abstract of the talk… We'll discuss best practices companies are adopting for enterprise-grade Kubernetes Management. Bio… Haseeb Budhani is the CEO of Rafay Systems, which he co-founded in late 2017. Prior to Rafay, Haseeb oversaw Akamai Technologies' expansion into the enterprise marketas the company's Vice President of Enterprise Strategy. Akamai acquired Haseeb's previous company, Soha Systems, in 2016. Haseeb co-founded Soha in 2013 and served as the company's CEO. Prior to Soha, Haseeb served as the Chief Product Officer for Infineta Systems, where he was responsible for overseeing all aspects of the company's product marketing, marketing communications and partner management activities. Previously, Haseeb held senior product management, marketing and engineering roles at NET, Personal IT, Citrix Systems, Orbital Data, IP Infusion and Oblix. Haseeb holds an MBA from UC Berkeley’s Haas School of Business and a B.S. in Computer Science from the University of Southern California. Key take-aways from the talk… A list of must-have tools and integrations for any enterprise running production apps on Kubernetes.
June 5, 2021
DoK #51 Promscale: Using Prometheus + Promscale + PostgreSQL to go from Observation to Understanding // Matvey Arye
Abstract of the talk… Often when I talk about putting observability data into PostgreSQL people ask me: are you crazy? And yet this somewhat heretical view has the potential to unlock a lot of the power and promise of observability. Thanks to TimescaleDB (an extension to PostgreSQL), storing time series, metric data inside of a relational database is now efficient, fast, and scalable. This is thanks to its unique partitioning, compression, and horizontal-scalability features. But even if this is possible to do, why would you? The answer to that lies in the power of a flexible data model, joins, and SQL (which Promscale supports in addition to PromQL). A flexible data model allows you to combine metric data with various other data - from machine information such as the number of cores, memory, etc. to location information using GPS coordinates. This allows you to enrich your metrics with supplemental information using joins and performing much more sophisticated analysis using SQL for capacity analysis, BI, and more. A flexible data model brings us to our second heretical idea: combining multiple modalities of observation in a single database. Combining metrics, logs, traces, event data, etc. in one DB has two major advantages: the first being a similar analytical advantage to what is described above: the ability to join and cross-correlate various types of signals together. The second major advantage is operational simplicity. As we all know, databases are the hardest things in our infrastructure to maintain and operationalize because of that pesky thing called state. So why maintain multiple different types of database systems if you could maintain just one? While these ideas about observability data on Kubernetes may seem unusual and counter-intuitive, I hope they will generate interest and start a good conversation. Bio… Mat has been working on data infrastructure in both academia and industry for the past decade. Currently, he is leading the Promscale team, to make it easy for people to store and analyze their Prometheus data in both PromQL and SQL. Previously, he completed his Ph.D. at Princeton and then worked as one of TimescaleDB's core architects where he concentrated on performance, scalability, and query power.
May 29, 2021
DoK #49 Deployments vs StatefulSets vs Daemonsets // Ali Kahoo
Abstract of the talk… Kubernetes provides different resources for deploying applications, we will be looking at them and the differences between them and how can we persist data using each of them. Bio… I am a Senior DevOps Engineer & Trainer with hands-on experience of architecting, developing, automating and optimizing production grade deployments on Kubernetes & Openshift using best DevOps practices. I am working as Lead DevOps Engineer at Tarabut Gateway where I have setup the infrastructure from scratch to leverage Open Banking in the MENA region. I am also a DevOps Instructor and have successfully trained more than 300 professionals worldwide on DevOps especially Kubernetes. Previously, I led a team Stakater which provides tools and services to help customers create and manage their Kubernetes based infrastructure. I am an avid open source contributor with contributions in developing and leading some opensource Kubernetes Controllers/Operators & helm charts.
May 29, 2021
DoK #50 Going Full Circle with Kafka // Ravi Trivedi
Abstract of the talk… Tecton is building a data platform for machine learning. This talk shares some of the adventures and lessons learned while introducing Kafka into our data pipelines. Bio… Engineering Manager @ Tecton. Previously: Staff SWE @ Google working on Search, Platforms, and Photos.
May 20, 2021
DoK #48 Airflow vs Argo - Battle Royale // Tim van de Keer
Abstract of the talk… We are going to be looking at and comparing Airflow (the established) versus Argo Workflows (The new kid on the block) and see how they measure up. What you would use each for, why you would want to use one or the other and who would win in a battle for data workflow management supremacy. Bio… DataOps and Data Engineering enthousiast. I get enthousiastic about data and data technologies and mostly about how we can move all that data around and have people smarter than me do smart things with it. Made the switch unknowingly into a Data engineering role some 2 years ago and haven't looked back since Also loves making taco's, anything food related basically and playing the guitar again. Key take-aways from the talk… What is Airflow What is Argo Workflows What are their pro's and con's Why you would pick either one
May 20, 2021
#1 DoK Community in Hindi: "Pehle Kadam Data on Kubernetes Community mein! // Kunal Kushwaha
Abstract of the talk… Kya hota hai Kubernetes? Shuruwat kahan se kare? Community ka hissa kaise bane? Kya aap ke mann mein bhi ye sawaal aate hain? Join kariye hume iss meetup mein jahan hum baat karengey har cheez Data on K8s ke baare mein (Hindi mein)! May 3rd ko hum charcha karengey ki kaise aap community ka hissa ban sakte hain, CNCF kya hai, ek SRE ka kaam kya hota hai, and bahot kuch! But yehi nahi! Bhaag lijiye meetup ke end ki quiz mein jisse aap jeet sakte hain kuch special SWAGS DoK ki taraf se! Bio… India se ek student jo hai Open Source and Communities ke liye utsahi! Kunal ne DevOps, and aur kaafi fields mein kaam kiya hai. Ye students ko apne YouTube channel ke dwara padhate and Open Source ke liye sahi rah bhi dikhate hain.
May 4, 2021
DoK Community #47 FullStack OpenSource Observability using SigNoz // Ankit Nayan
Abstract of the talk… In the talk, we shall dive deep into the latest open-source tools like Prometheus and Jaeger and our journey in using them and ultimately building our own open-source observability tool, SigNoz. We shall discuss: - What is Observability? The 3 pillars of Observability - Metrics, Traces, and Logs - How is monitoring different than observability? - The hard things about Prometheus? - Why Distributed Tracing became so important? - Running both Prometheus and Jaeger to get metrics + traces. How complex can it go? - Pros and cons of using SaaS vs OSS solutions. Why self-host in the 21st century? - Why we built SigNoz? - What is OpenTelemetry? How to instrument a sample app using OpenTelemetry? - Demo of SigNoz to get detailed insights into your applications Bio… Ankit is co-founder and maintainer at SigNoz which is a part of the Y Combinator W21 batch. Ankit loves to discuss new technologies and the problems they solve. He believes technology will be more profoundly used in the future to scale business needs. He is now passionate about the application performance monitoring space and loves talking to developers about how they do it today. When not working he loves to play badminton and go on adventurous trips. He has done many Himalayan trips earlier and is a philanthropist. Key take-aways from the talk… - Hard things about Prometheus - How is monitoring different than Observability - How to get started with OpenTelemetry
May 4, 2021
DoK Community #45 K8s DX Chronicles: Evolution From CLI to GitOps & Cloud Native IDEs // Katie Gamanji
Abstract of the talk… Within its 7 years of existence, Kubernetes has been the gravitational center of the Cloud Native landscape, elevating a pluggable system that contributed to the diversification of the entire ecosystem. Wider adaptability of the tool prompted the diversification of the end-user base, and a consistent DX for cluster interaction became essential for Kubernetes. The community channeled herculean efforts towards the enhancement of the developer experience by extending the cluster CLI, building portals, and highly-responsive UIs. Bio… Katie is the Ecosystem Technical Advocate at CNCF or Cloud Native Computing Foundation. Katie's focus within this role is to grow and lead the End User Community while bridging the gap with other ecosystem units, such as project maintainers, TOCs, and SIGs. In the past roles as a cloud platform engineer, Katie has build platforms that gravitate towards cloud-native technologies and principles, with Kubernetes as the focal point. These projects started with the automation of application delivery on OpenStack-based infrastructure, which transitioned into the creation of a centralized, globally distributed platform at Condé Nast and American Express. Additionally, Katie is the creator/instructor of the Cloud Native Fundamentals Udacity course and is on the advisory board for Keptn, a CNCF sandbox project. Key take-aways from the talk… This talk will focus on the cluster DX chronicles, showcasing tools that contributed to wider adoption for Kubernetes. An emphasis will be placed on cluster CLI and how it can be extended using kubectl plugins and wrappers. This will be followed by an introduction of widespread cluster state managers, covering mechanisms such as GitOps and cloud native IDEs, such as GitPod.
May 1, 2021
DoK Community #44 DataOps // Vijay AB Kumar
Abstract of the talk… The talk will cover the various aspects of DataOps, why DataOps is important. It will also talk about some of the client experiences and how DataOps strategy is helping addresses some of the challenges. The talk will also cover the DataOps implementations, tools and technologies. Bio… IBM Distinguished Engineer, Master Inventor with 60+ patents, 23 years in IBM. Architect, Developer, Blogger, Author Key take-aways from the talk… Good understanding of DataOps, Why we need DataOps, Some understanding of the Technology advancements in the DataOps Area
May 1, 2021
DoK Community #46 Recovering and Porting Applications in the Fast-Paced DevOps World // Prashanto Kochavara
Abstract of the talk… Are you a Cloud Architect, DevOps Engineer or SRE who is developing cloud-native applications, managing complex app migration projects or needs infrastructure resiliency? Cloud-native applications present extraordinary performance, scale and compliance challenges in hybrid- and multi-cloud environments that legacy tools simply cannot support. In this session and demo, we’ll take you thru a case study for a large aerospace and defense company who is managing and migrating Kubernetes applications and databases in a multi-cloud environment. You’ll also learn how to handle common cloud-native development challenges like recovering from accidental namespace deletions during test/dv or migrating your application to another cloud for scale and performance testing. Bio… Prashanto Kochavara is Director of Product at Trilio, a leader in cloud-native data protection. Kochavara is responsible for the product strategy, development and roadmap of the company's backup and recovery platform for Kubernetes.
April 30, 2021
DoK Community #43 Kubecost: open source cost monitoring for Kubernetes // Webb Brown
Abstract of the talk… Measuring costs in Kubernetes environments is complex. Applications and their resources needs are often dynamic. Teams share resources without transparent prices attached to workloads while organizations are increasingly running resources on a range of machine types and even cloud providers. Kubecost provides an approach built on open source for ensuring consistent and accurate visibility across all your workloads. This discussion will talk about practical examples for implementing cost monitoring & optimization and managing the data that is generated from these efforts. Bio… Webb Brown is a Co-founder and CEO at Kubecost. He is based in San Francisco and was previously a Product Manager at Google working on infrastructure monitoring.
April 25, 2021
DoK Community #42 Spark on Kubernetes is Now Generally Available: Why & How to Migrate to It // Jean-Yves Stephan
Abstract of the talk… Apache Spark natively runs on top of Kubernetes (instead of Hadoop YARN) since 2018, but it's only since Spark 3.1 (released in March 2021) that the integration is now officially generally available & production-ready. What is the high-level architecture of Spark on Kubernetes, how does it compare to alternatives, what does the migration look like? These are some of the questions we will answer together. We will first introduce the core concepts, then go through the stories of customers who migrated, and then give you concrete technical tips to help you be successful with Spark (on Kubernetes). If time permits, I may do a risky live demo. This will be a technical talk with very fresh content - I hope you will like it. I plan to make it short enough to make room for Q&A and improvisations based on your request. So let me know if there's something specific you're interested in. Bio… I'm one of the co-founders at Data Mechanics (https://www.datamechanics.co), a Cloud-Native Spark Platform for Data Engineers. We're a YCombinator backed startup. We strive to finally make Apache Spark as developer friendly and cost-effective as it should be.. by automating the infrastructure management side (autoscaling, automated sizing of containers, autotuning of Spark configurations) and building intuitive dashboards to help monitor your data pipelines. Prior to Data Mechanics, I was a software engineer at Databricks, where I led their Spark infrastructure team.
April 23, 2021
#3 DoK Community Brasil: "Como CNCF Brasil pode nos ajudar na nossa carreira de SRE, DevOps ou Dev" // Paulo Alberto Simoes
Talk in Portuguese Bio… Capacitando arquitetos, desenvolvedores e SREs em todo o mundo para fornecer aplicativos escaláveis e arquiteturas de sistemas otimizados para os recursos exclusivos da nuvem; E as organizações devem navegar em sua jornada nativa da nuvem. Apaixonado por Arquitetura de Sistemas e Engenharia de Software, me dedico a aprender e compartilhar conhecimentos com foco em Cloud, Java e outras tecnologias emergentes. Fui reconhecido como um embaixador da Cloud Native Computing Foundation, sou apaixonado por ajudar a desenvolver a comunidade como um palestrante ativo sobre projetos CNCF e alavancar a liderança local. Além disso, sou Hyperledger Foundation, uma organização de código aberto de blockchain, Líder de Capítulo e membro do conselho da SouJava, um dos maiores JUGs do mundo.
April 23, 2021
DoK Community #41 Designing Stateful Apps for the Cloud and Kubernetes // Evan Chan
Abstract of the talk… Almost all applications have some kind of state. Some data processing apps and databases have huge amounts of state. How do we navigate a cloud-based world of containers where stateless and functions-as-a-service is all the rage? As a long-time architect, designer, and developer of very stateful apps (databases and data processing apps), I’d like to take you on a journey through the modern cloud world and Kubernetes, offering helpful design patterns, considerations, tips, and where things are going. How is Kubernetes shaking up stateful app design? - What kind of state is there, and what are some important characteristics? - Kubernetes, containers, and the stateless paradigm (pushing state into DBs) - Where state lives and the persistence characteristics - Stateless vs serverless - why stateless is not really stateless, but server less really is - Improving on stateless paradigm using local state pattern - Logs and event streaming for reasoning about state and failure recovery - The case for local disks: ML, Databases, etc. - Kubernetes and the Persistent Volume/StatefulSets - Leveraging Kubernetes PVs as a basis for building distributed data systems - Mapping the solution space Bio… Evan has been a distributed systems / data / software engineer for twenty years. He led a team developing FiloDB, an open source (github.com/filodb/FiloDB) distributed time series database that can process a million records per second PER NODE and simultaneously answer a large number of concurrent queries per second. He has architected, developed, and productionized large scale data and telemetry systems at companies including Apple, and loves solving the most challenging technical problems at both large and small scales, from advanced custom data structures to distributed coordination. He is an expert in bleeding edge #jvm #java #scala and #rust performance. Current interests include Rust and columnar compression. He has led the design and implementation of multiple big data platforms based on Apache Storm, Spark, Kafka, Cassandra, and Scala/Akka. He has been an active contributor to the Apache Spark project, and a two-time Datastax Cassandra MVP.
April 20, 2021
#40 DoK Community: Cloud-Native Chaos Engineering in Databases // Karthik Satchitanand
Abstract of the talk… Chaos Engineering is revolutionizing testing means and doing it the cloud-native way is the best way in today's rapidly changing world with a huge shift in the paradigm of Kubernetes resiliency. Karthik S, one of the maintainers for LitmusChaos would be introducing how to carry out Chaos Engineering, the cloud-native way. Further, he will touch upon how Chaos Engineering is carried out in Cloud-Native Databases with LitmusChaos. He will also touch upon observability considerations for chaos engineering and what hooks Litmus provides for the same. Bio… Karthik Satchitanand is one of the maintainers of the CNCF sandbox project LitmusChaos. He is passionate about all things Kubernetes, and is generally interested in DevOps, storage performance/benchmarking & chaos engineering.
April 18, 2021
#39 DoK Community: A fireside chat with Jérôme Petazzoni // Jérôme Petazzoni
Abstract of the talk… A fireside chat with Jérôme Petazzoni in which we will get to know him up close and personal, ask him about how his personal music projects influence his professional work, and answer questions from the audience. Bio… Jérôme was part of the team that built, scaled, and operated the dotCloud PAAS, before it became Docker. He worked seven years at the famous container company, wearing various hats. When he's not busy with computers, he collects musical instruments. He can arguably play the theme of Zelda on a dozen of them.
April 11, 2021
#38 DoK Community: Patterns to create stateful applications on Kubernetes // Prashant Ghildiyal
Abstract of the talk… In this talk we will discuss what are the best patterns to create stateful applications on top of Kubernetes. This will include application layer caching, embeddable database as well as leveraging kubernetes objects to store and sync state across multiple replicas. Bio… Prashant is passionate about democratizing best AppOps practices for Kubernetes and has started Devtron and open source platform for AppOps on top of Kubernetes. Recently open sourced Devtron has been instrumental in adoption of Kubernetes in some of the largest startups in India.
April 8, 2021
Dok en español #2 ¡Suelten el Krake! Trayendo la Energía al Lazo de Cómputo // Juan A. Fraire
Abstract of the talk… ENG: Cloud&Heat has always focused on providing energy-efficient data centers. In the last 8 years, we have developed an innovative water cooling technology for servers, converting waste heat into a valuable asset. By doing so, we have already greatly improved the energy efficiency of individual data centers. However, this isn’t enough. To globally maximize the efficiency of distributed data center infrastructures, this talk presents Krake. Krake is an orchestration software for compute-intensive jobs. It improves the global cost and energy efficiency of infrastructures by balancing the load between data centers. Krake evaluates and selects the most efficient site to run jobs based on certain metrics, such as energy availability, heat demand, and latency. It also reacts to changes in the system by migrating jobs. In other words, it ensures a job is run in the most energy- and/or cost-efficient way at any given time. ESP: Cloud & Heat siempre se ha centrado en proporcionar centros de datos energéticamente eficientes. En los últimos 8 años, hemos desarrollado una innovadora tecnología de refrigeración por agua para servidores, que convierte el calor residual en un activo valioso. Al hacerlo, ya hemos mejorado enormemente la eficiencia energética de los centros de datos individuales. Sin embargo, esto no es suficiente. Para maximizar globalmente la eficiencia de las infraestructuras de centros de datos distribuidos, en esta charla presentaremos Krake. Krake es un software de orquestación para trabajos intensivos en computación. Mejora el costo global y la eficiencia energética de las infraestructuras al equilibrar la carga entre los centros de datos. Krake evalúa y selecciona el sitio más eficiente para ejecutar trabajos según ciertas métricas, como la disponibilidad de energía, la demanda de calor y la latencia. También reacciona a los cambios en el sistema mediante la migración de trabajos. En otras palabras, asegura que un trabajo se ejecute de la manera más eficiente en términos de energía y costo en un momento dado. Bio… ES: Juan A. Fraire es investigador asistente en CONICET en Argentina, profesor asociado en la Universidad de Saarland en Alemania y arquitecto de soluciones cloud en Cloud&Heat en Alemania. Su principal interés gira en torno a las aplicaciones en red y distribuidas habilitadas por técnicas informáticas de vanguardia. Además de aplicar esto al dominio del cloud computing en tierra, Juan lidera un esfuerzo internacional para llevar la visión al dominio del espacio. De hecho, Juan es el fundador y presidente del STINT Workshop desde 2014, ha sido coautor de más de 55 artículos publicados en revistas internacionales y conferencias líderes, y ha construído una visión novedosa de redes en su libro “Delay-Tolerant Satellite Network ” en coautoría con colegas de la NASA. EN: Juan A. Fraire is an assistant researcher at CONICET in Argentina, an associate professor at Saarland University in Germany, and cloud solutions architect at Cloud&Heat in Germany. His main interest revolves around networking and distributed applications enabled by state-of-the-art informatics techniques. Besides applying this to the cloud domain on ground, Juan leads an international effort in pushing the vision into the space domain. Indeed, Juan is the founder and chair of the annual Space-Terrestrial Internetworking Workshop (STINT) since 2014, has co-authored more than 55 papers published in international journals and leading conferences, and has imprinted a novel networking vision in his “Delay-Tolerant Satellite Network” book co-authored with colleagues from NASA.
March 27, 2021
#29 DoK Community: How Absa Developed Cloud Native Global Load Balancer for Kubernetes // Yury Tsarev
Abstract of the talk… Global load balancing, commonly referred to as GSLB (Global Server Load Balancing) solutions, have typically been the domain of proprietary network software and hardware vendors and installed and managed by siloed network teams. k8gb is a completely open source, cloud native, global load balancing solution for Kubernetes. k8gb focuses on load balancing traffic across geographically dispersed Kubernetes clusters using multiple load balancing strategies to meet requirements such as region failover for high availability. Global load balancing for any Kubernetes Service can now be enabled and managed by any operations or development teams in the same Kubernetes native way as any other custom resource. The talk will cover both technical and business aspects of k8gb creation including ongoing adoption within the huge scale organization. Bio… Yury is an experienced software engineer with a strong focus on Linux, Kubernetes and software quality. He is passionate about open source and contributes to several upstream projects on a regular basis. Currently, he is focused on developing open-source distributed systems in cloud-native environments.
March 27, 2021
DoK en español #1- Nuestros aprendizajes con Kubernetes // Aitor Artola, Miriam González, Raquel López Ruiz e Isidro Nistal
Our learnings from Kubernetes
March 27, 2021
#37 DoK Community: Running Data Replication Pipelines on Kubernetes with Argo // Stephen Bailey
Abstract of the talk… Hundreds of data teams have migrated to the ELT pattern in recent years, leveraging SaaS tools like Stitch or FiveTran to reliably load data into their infrastructure. These SaaS offerings are outstanding and can accelerate your time to production significantly. However, many teams prefer to roll their own tools. One solution in these cases is to deploy singer.io taps and targets — Python scripts that can perform data replication between arbitrary sources and destinations. The Singer specification is the foundation for the popular Stitch SaaS, and it is also leveraged by a number of independent consultants and data projects. Singer pipelines are highly modular. You can pipe any tap to any target to build a data pipeline that fits your needs, making them a good fit for containerized workflows. This article walks through the workflow at a high level and provides some example code to get up and running with some shared templates. I also drill into reasons for choosing the Argo approach over other orchestration tools like Airflow or Dagster, and the implications from a team perspective. Bio… Stephen Bailey is Director of Growth Analytics at Immuta, where he strives to implement privacy best practices while delivering business value from data. He loves to teach and learn, on just about any subject. He holds a PhD in educational cognitive neuroscience from Vanderbilt and enjoys reading philosophy
March 25, 2021
My questions about Data on K8s // Kunal Kushwaha
Bio… Junior pursuing Computer Science & Engineering. Co-founder at Code for Cause. CNCF Intern 2020. MLH Coach. Google Summer of Code Mentor. YouTuber. Gold Microsoft Learn Student Ambassador.
March 21, 2021
#36 DoK Community: A Snapshot of DevOps // Tiffany Jachja
Abstract of the talk… DevOps is like a camera. We focus on what's important, we capture the good times, we develop from the negatives, and if things don't work out, we take another shot. Many teams establishing working best practices for their tools improve their time to deliver and ability to scale. However, the real challenges exist outside of tools and technology and many teams today still have questions about DevOps. So, join this session to learn the fundamentals of shaping a DevOps culture. We'll discuss key attributes around people, process, and technology, likening you and DevOps to pro photographers and cameras. Bio… Tiffany Jachja is a technical evangelist at Harness. She is an advocate for better software delivery, sharing applicable practices, stories, and content around modern technologies. Before joining Harness, Tiffany was a consultant with Red Hat's Consulting practice. There she used her experience to help customers build their software applications living in the cloud.
March 20, 2021
St.Patrick´s Day Special - A diplomatic answer to the meaning of data, kubernetes, and everything // Luke Feeney
Abstract of the talk… I will talk about my experiences entering the world of databases and data management after a very different life as a diplomat. I will introduce TerminusDB and it's world history origins. Finally I will situate the project and the roadmap from a k8s perspective. Bio… Prior to joining TerminusDB, Luke worked in the Irish Foreign Ministry for a number of years. He was Ireland’s acting Ambassador to Greece. Luke was also the Head of the Government of Ireland’s Brexit Communications Team and the Government Brexit Spokesperson.
March 19, 2021
#35 DoK Community: Make Kubernetes your development environment // Ramiro Berrelleza
https://go.dok.community/slack Abstract of the talk… Developers spend a lot of time making their local machine look like a cluster. But why do we do that? Our local machine is not where our code is supposed to run! We built okteto (github.com/okteto/okteto) so we can make our Kubernetes clusters look like our local machine. In this talk, we'll show you how okteto helps you take advantage of all the goodness of Kubernetes and the cloud without having to sacrifice a really fast development and feedback loop. Bio… Ramiro Berrelleza is one of the makers of Okteto. He has spent most of his career (and his free time) building cloud services and developer tools. Before starting Okteto, Ramiro was an Architect at Atlassian and a Software Engineer at Microsoft Azure. Originally from Mexico, he currently lives in San Francisco. Key take-aways from the talk… If you're building a Cloud Native app, you should use Kubernetes as part of your development environment, instead of just using it as a deployment target. Don't build, push, redeploy on every change.
March 18, 2021
#34 DoK Community: Opstrace, An open source alternative to services like Datadog, SignalFx, and others... // Sébastien Pahl
Abstract of the talk… Open source observability should not be hard. What companies package as their enterprise offering should be available to anyone who wants to monitor their systems. Opstrace is a complete monitoring platform designed for the end user instead of the expert. It's goal is to be as easy to use and operate as a hosted SaaS provider but within ones own cloud account. This is not only up to 10x more cost-efficient but also allows full control over ones data. Bio… Sebastien Pahl is the co-founder and CEO of Opstrace, an open source alternative to services like Datadog, SignalFx, and others... Previously he has worked at Cloudflare, Mesosphere and Red Hat in San Francisco. Building teams and projects. Co-founded Docker, a Y-Combinator startup. Passionate about large scale platforms, developer tools, automation, open source, distributed systems, cooking and photography.
March 13, 2021
#33 DoK Community: Making observability accessible is the fourth pillar // Alex Jones
Abstract of the talk… Observability systems are typically a collection of tools that cover the three pillars of logs, metrics and tracing. These enable skilled engineers to correlate telemetry insights to perform data-driven diagnostics and rectify degraded services. In this talk, I discuss how over the course of three years, I have worked towards removing the built-in gatekeeping that comes with creating monitoring solutions and enabling them to work for an entire organisation. We shine a light on the overlooked developer community that interact with Observability but does not necessarily hail from SRE disciplines. Engaging with anecdotes from my past and illustrating the inherent bar to success that comes with connecting multiple tools together and the context that requires to achieve results. With years of experience working to improve adoption and create consumer-friendly facades for tools such as Grafana, Prometheus and Jaeger; I draw upon my background within large financial institutions and how building engaging and simplified DX can compel and excite engineers to work with observability. Bio… Alex works as both a contributor and end-user of cloud-native technology. He spends his work-time translating tooling, practices and behaviours into generators of strategic value and positive cultural change. His passion is to create positive engineering cultures that enable inclusivity and diversity as a core attribute rather than a vague goal. Enabling engineers to build more reliable services through shared accountability, observability and automation of toil. He has worked at companies such as JPMorgan, American Express, Microsoft, British Sky Broadcasting.
March 10, 2021
#32 DoK Community: How to choose a Kubernetes distribution for on-prem environments? // Tomasz Cholewa (Presenter: Bart Farrell)
Abstract of the talk… Buy a ready off-the-shelf product, customize an existing open source project, or build your own distribution? When you can't go to the cloud and leverage its powerful features you have to make a choice. On-prem environments need more attention, but they also often can be more cost-effective and are highly coveted by the development and operations teams. In this talk, I will cover some of the most important topics related to building an on-prem Kubernetes platform and I will describe the most popular distributions. Bio… Tomasz Cholewa is the co-founder of Cloud Code Labs and he believes that good applications deserve even better environments where they can spread their wings and it's his job to help you create them. To achieve it he uses cloud and containers, but mostly he listens carefully to understand and use the right mix of proper technology and processes around it. His field of expertise is public cloud (AWS), containers and their orchestration engines - OpenShift and Kubernetes.
March 7, 2021
#31 DoK Community: The Data Lifecycle - Where Do We Go From Here // Benjamin Rogojan. (Presenter: Bart Farrell)
Abstract of the talk… Going from raw data to machine learning models successfully in companies of all sizes requires more than just an understanding of programming. Teams need to manage their data products lifecycle, their software as well as the data. Data products like machine learning models aren’t created out of thin air. They are built on layers of best practices that ensure the models are using accurate data, they are outputting reliable numbers and they have some method to interact with the outside world. So how do we get there? The purpose of this talk is to discuss the current state of the data lifecycle as it pertains to creating data products. This could be machine learning models, dashboards and data APIs. We will outline the general architecture that helps take data from raw to some form of machine learning model. In addition, we will discuss some of the concepts that are being applied from DevOps as well as being created in MLOps to help better facilitate your data life cycle. Bio… Ben has spent his career focused on all forms of data. He has focused on developing algorithms to detect fraud, reduce patient readmission and redesign insurance provider policy to help reduce the overall cost of healthcare. He has also worked in various industries including transportation, Big Tech, start-ups, insurance, Saas and more. In all of these industries he has helped companies develop their data strategy. Often starting from scratch to develop an end-to-end data solution. Ben privately consults on data science and engineering problems both solo with Seattle Data Guy as well as with a company called Acheron Analytics. He has experience both working hands-on with technical problems as well as helping leadership teams develop strategies to maximize their data. Key take-aways from the talk… - Creating successful data products and models requires more than just programming skills - Best practices from DevOps can help improve data science and ML models maintenance and lifecycle
March 4, 2021
#2 DoK Community Brazil: Bora entender as Bases de dados na nuvem com a ajuda de Wagner Bianchi! // Wagner Bianchi (Talk in Portuguese)
Abstract of the talk… Uma conversa descontraída sobre o futuro de bases de dados como um serviço. Dados em Kubernetes desde o ponto de vista dum DBA. E várias outros assuntos parecidos. Bio… DBA e Profissional de TI com mais de 15 anos trabalhando com sistemas, integrações e bancos de dados. Arquiteto de soluções certificado pela AWS, um profissional certificado pelo banco de dados MariaDB, um especialista certificado pela Red Hat em Ansible Automation, um arquiteto certificado pela Splunk e um especialista certificado pelo MySQL (CMDBA, CMDEV, CMCDBA). Além disso, é líder da equipe de profissionais, sujando as mãos nas tarefas técnicas diárias, trabalhando globalmente com os melhores engenheiros e especialistas de banco de dados do mundo. Também esta gerenciando ambientes #GCP atualmente.
February 27, 2021
#30 DoK Community: Kyverno for Kubernetes! // Jim Bugwadia. (Presenter: Bart Farrell)
Abstract of the talk… Kubernetes is powerful but can be complex to manage! In this talk, Jim Bugwadia from Nirmata will show how policy managers can help address the complexity via admission controls and dynamic configurations. Jim will introduce Kyverno, a Kubernetes native policy engine and CNCF sandbox project. Jim will then demonstrate how you can use Kyverno to ensure security and best practice compliance for your clusters. Bio… Software engineer with background in distributed systems and scale computing. Founder at Nirmata. Previously worked at Cisco, Pano Logic, Trapeze Networks, Bell Labs, and Motorola. Key take-aways from the talk… 1. Policy engines are necessary for production-grade Kubernetes 2. Kyverno is an easy to use and powerful policy engine built for Kubernetes 3. You can use Kyverno to secure your clusters and address other configuration management issues
February 24, 2021
#28 DoK Community: Getting Started Contributing to Kubernetes // Rin Oliver & Savitha Raghunathan. (Presenter: Bart Farrell)
https://go.dok.community/slack Abstract of the talk… This talk will walk through how to get started contributing to Kubernetes, combatting imposter syndrome, the many other ways you can get started contributing to K8s other than by writing code, and the benefits to joining a community such as K8s. Bio… Rin is a Technical Community Builder at Camunda. They enjoy discussing all things open source, with a particular focus on diversity in tech, improving hiring pipelines in OSS for those that are neurodivergent, and removing accessibility barriers to learning programming. Rin is also a Member of Kubernetes, a contributor to Spinnaker, involved in the Kubernetes Contributor Experience SIG, and is a Storyteller on the Kubernetes Upstream Marketing Team. When not immersed in all things OSS and cloud-native, they can be found hanging out with their wife and pets, making candles, cooking, or gaming. Savitha is a Senior Platform Engineer at MathWorks. She has been working with container technologies for the past 5 years and use Kubernetes in her day to day job. She also contributes to the Kubernetes ecosystem, currently involved with release, security, mentoring, and documentation efforts. Key take-aways from the talk… How to get started contributing to K8s Why you should contribute to K8s Combatting imposter syndrome And more!
February 11, 2021
#27 DoK Community: Cost management for OpenShift, a new SaaS service to understand your Kubernetes costs // Sergio Ocón
Abstract of the talk… For IT decision-makers, this goes above and beyond just keeping infrastructure running and efficient; it is about understanding how your IT budget affects your business, and how well your resources maximize the use of your budget. This makes it critically important that IT teams can more quickly and easily see the totality of their IT costs across the hybrid cloud. We’re pleased to introduce a new software-as-a-service (SaaS) offering intended to help our customers better understand the costs of their OpenShift environments: OpenShift cost management. Available free of charge as part of a Red Hat OpenShift Container Platform subscription, OpenShift cost management provides a simplified, more intuitive view into the costs, from the macro to the granular, of an OpenShift deployment. Bio… Telco Engineer with an MBA, I've been in different positions in the market, from R&D to sales, and now work as a product manager for a SaaS service offered with Red Hat OpenShift, focused on cost management. Key take-aways from the talk… Cost management is a SaaS service that is offered with OpenShift Container Platform at no additional costs. Allows you to see the costs of your infrastructure and map them to OpenShift resources. Use tagging and cost models to provide better insights on your infrastructure.
February 4, 2021
#1 DoK Community Brazil: DevOps, kubernetes and data // Rogeria Portilho (Talk in Portuguese)
Abstract of the talk… My experience in this contemporary technology journey of the last 4 years, fears, mistakes, IT paradigms, and agile methodologies impact my goals. Bio… I love working with technology. I am a person who learns much more by doing and teaching. I have in mind that in technology you can’t know everything. I love dogs and cycling. Key take-aways from the talk… Talk about religion and politics
January 31, 2021
DoK Nederkube Edition #1: Is Kubernetes ready for Data Management? // Michel de Ru, Jeffry Molanus & Arie van den Bos
Abstract of the talk… Kubernetes became the standard for micro services architectures. But what about handling massive and scalable data management on top of it? Is it possible and what does it mean for operations? Cassandra has been adopted widely and accepted globally as the most scalable and reliable database. Now it adds ease of use by offering a Kubernetes native plug and play solution for enterprise use! Bios… Michel has been specializing in NoSQL technologies and Distributed Data Management for about 10 years. Working with enterprise organizations like ABN AMRO, Rabobank, ING, Booking.com and more, he deployed NoSQL technologies into business critical applications. Arie is a Cloud Solutions Engineer with over 10 years of experience. He has been a team leader and helped previous companies reduce their cloud costs through optimization strategies. Jeffry Molanus is CTO of MayaData, the creator of the OpenEBS and Litmus Chaos projects and a top 5 contributor overall to Cloud Native Computing Foundation projects. Jeffry’s experience includes technical leadership positions at Cloudian and Nexenta Systems. He has been a frequent speaker about the underpinnings of emerging trends at open source conferences such as FOSDEM and storage technology events such as the storage developer conference and elsewhere. Key take-aways from the talk… The time is ready for Kubernetes to support gigantic Data Management workloads and Cassandra now makes it easy to deploy and operate!
January 30, 2021
#26 DoK Community: How to unblock your release pipelines with data // Olaf Molenveld
https://go.dok.community/slack Abstract of the talk… Even though microservices are becoming a pattern, we still see a lot of "monolithical" deploys and manual reactive actions. This blocks the ability to achieve maximum velocity in your release. We can leverage data and smart use of traffic-shaping to achieve a higher release velocity AND quality. Bio… Olaf has over 20 years of experience in the internet industry in technical, architectural and IT management roles. With a background as a software developer, enterprise/solution architect, and technical consultant, Olaf is in a good position to align business challenges with technical innovations and organisational processes. In his former life, he was helping teams designing, building and releasing innovative online and e-commerce platforms for digital enterprises. In his role of CTO of Vamp.io he is focusing on realising the vision to bring "controlled GoLive" and advanced release automation to the next level. Key take-aways from the talk… Get an understanding on how we can use cloudnative traffic-shaping mechanisms and all kinds of data sources to increase the velocity and quality of releasing software.
January 28, 2021
#25 DoK Community: Deconstructing Postgres into a Cloud Native Platform // Álvaro Hernández
https://go.dok.community/slack Abstract of the talk… Is deploying Postgres in Kubernetes just repackaging it into a container? Can’t Postgres leverage the wide range of Cloud-Native software and integrate well with K8s? Join this journey that will cover and demonstrate, with demos running on StackGres: How to structure Postgres into an init-less container, plus several sidecar containers for connection pooling, backups, agents, etc. Defining high level CRDs as the single API to interact with the Postgres operator. Using K8s RBAC for user authentication of a web UI management interface. Using Prometheus for monitoring; bundling a node, Postgres and PgBouncer exporters together. Proxying Postgres traffic through Envoy. Terminate Postgres SSL with an Envoy plugin, that also exports wire protocol metrics to Prometheus. Using Fluentbit to capture Postgres logs and forward them to Fluentd, which stores them on a centralized Postgres database. Key take-aways from the talk… Understand how and why to "de-construct" Postgres into a cloud native application. Learn how to leverage K8s CRDs to interact with Postgres in Kubernetes, without being a Postgres expert. Understand the benefits of proxying Postgres traffic with the Envoy proxy and its Postgres filter. How to use FluentBit and Fluentd to capture and collect logs from Postgres pods and consolidate into a central location. Learn about the open source StackGres platform, a Postgres Distribution for Kubernetes. The goal of this talk is to share with the Community all the knowledge and best practices learned while integrating Postgres with the Kubernetes ecosystem. They stem from the development of the open source StackGres (https://stackgres.io) Postgres platform. StackGres followed instead a novel approach: re-analyze all Postgres architecture and find what components and subsystems could be externalized to K8s ecosystem tools, and construct sidecars for all the agents and related software, avoiding the use of init systems. Bio… Álvaro is a passionate database and software developer. He founded and works as the Founder & CEO of OnGres (https://ongres.com). He has been dedicated to PostgreSQL and R&D in databases for two decades. An open source advocate and developer at heart, Álvaro is a well-known member of the PostgreSQL Community, to which he has contributed founding the non-profit Fundación PostgreSQL (https://postgresql.fund) and the Spanish PostgreSQL User Group. You can find him frequently speaking at PostgreSQL, database, cloud, and Java conferences. Every year, Álvaro travels approximately three-four times around the globe—in 2020 (or now more likely 2021 due to COVID-19), he will hit the milestone of having delivered 100 tech talks. AWS Data Hero (2019).
January 21, 2021
#1 DoK Community India: "Best practices for overprovisioning in k8s" // Miguel Ángel Mingorance & José Luis Talavera
https://go.dok.community/slack We will discuss how we can implement an efficient solution to overscale a Kubernetes cluster and therefore keep always enough room in the cluster for applications to grow in number of replicas. Presenter: Bart Farrell
January 15, 2021
#24 DoK Community: The architecture of a distributed database // Jim Walker, Lisa-Marie Namphy & Keith McClellan
Abstract of the talk… Cockroach Labs has built a database architected from the ground up to be distributed. It is a perfect fit for the cloud and Kubernetes as it naturally scales and survives without manual interaction. The unique architecture of CockroachDB delivers some key innovations that may not only provide value for your applications but might also give you insight into the challenges/solutions in distributed systems. In this session, we will deliver a deep-dive exploration into the internals of the database, exploring the following, and more: How the database uses KV at the storage layer to effectively distribute data How Raft and MVCC are used to guarantee serializable isolation for transactions How Cockroach automates scale and guarantees an always-on resilient database How to tie data to a location to help with performance and data privacy Bio… Jim has been a product marketer for almost twenty years and before that he coded professionally in Smalltalk, C++ and Java. He still codes and likes to dive deep into tech so that he can help translate complex topicsinto consumable forms. Over the course of his career he has focused on emerging tech and has been directly involved in creating six categories. He prides himself as an advocate of the developer and a rabid open source software promoter. His list of startups that he’s helped build include Servgate, Vontu (acquired), Initiate Sytems (acquired), Talend (IPO), Hortonworks (IPO), EverString (acquired), CoreOS (acquired) and is currently the VP of a Product Marketing at pre-IPO, Cockroach Labs. Key take-aways from the talk… We will dive deep into the architecture of the database and explicitly cover the following areas: Ranges (partitions): SQL to KV RAFT Distributed Data: Range Distribution, Scale and Resilience Distributed Transactions Distributed SQL Execution Distributed Latency Distributed Performance Optimizations
January 14, 2021
#23 DoK Community: 2021 DoK Community Kickoff! Trends, friends, and more! // Arie van den Bos, Jake Page and Ariel Munafo
For our 23rd installation of the Data on K8s community meetup, we will be talking with Ariel Munafo who is a CNCF ambassador and the founder of EuropeClouds (among many other things), Arie Van den Bos Senior Systems Engineer on Cloud Systems at Kurago, and Jake Page who is a DevOps and Cloud Native Enthusiast. Ariel Munafo is a very busy human being. He is a Founder, Investor, Board Member, Cloud Computing Evangelist, CNCF Ambassador, Community Builder, Personal Advisor to CIOs & CTOs and still a Geek! He lives in Tel Aviv, Israel, but is constantly interacting with people all over the globe. Jake Page is a CISCO/AWS/Microsoft certified DevOps and Cloud Native Enthusiast living in Florianópolis Brazil. He wants to be involved in anything that has to do with growth, efficiency, and developing solutions that make people ask themselves, "What did I do before this existed?". Arie van den Bos is a Cloud Solutions Engineer with over 10 years of experience. He has been a team leader and helped previous companies reduce their cloud costs through optimization strategies.
January 7, 2021
Dok Season 1 Extras - #4 - How I got started on k8s, and you can too! // Sonia Singla
Sonia Singla - Developer advocate, Embark Studios
December 28, 2020
#22 DoK community: Vitess Operator for Kubernetes // Alkin Tezuysal
In this talk, I would like to uncover our newly announced Vitess Operator for Kubernetes. This talk demonstrates the sample implementation of Vitess in Kubernetes topology. I also explore common DBA tasks by demonstrating how they are handled in the Vitess ecosystem. Vitess, out of the box, comes with a lot of tools and utilities that one has to either incorporate or develop to manage MySQL topology. Let’s take a look at the capabilities of Vitess in these areas and demonstrate how they are performed under the operator realm. Alkin Tezuysal has extensive experience in enterprise relational databases, working in various sectors for large corporations. With more than 20 years of industry experience, he has acquired skills for managing large projects from the ground up to production. For the past decade, he's been focused on e-commerce, SaaS, and MySQL technologies. Alkin has managed and architected database topologies for high volume sites at eBay Intl. He has several years of experience in 24X7 support and operational tasks, as well as improving database systems for major companies. He has led global operations teams on Tier 1/2/3 support for MySQL customers. In late 2020, he joined PlanetScale's expert engineering team over Vitess. Open source database evangelist Global database operations expert Storyteller Inspiring technical and strategic leader Creative team builder Speaker, mentor, and coach Born to sail, forced to work
December 15, 2020
#21 DoK community: Data on Kubernetes: my insights // Eduard Tomàs
Data handling is one of the hardests things in Kubernetes. This talk will be an informal conversation about things (relateded to data management) Eduard found helping customers to embrace Kubernetes. I hope you find them useful! Eduard Tomàs is a passionate developer since he was a child. He started programming with a Spectrum 48 and here still he is! Now he is focused on helping customers to embrace the cloud. Due to that, he is working in cloud-ready, cloud-native technologies and distributed architectures.
December 9, 2020
Dok Season 1 Extras - #3 - My work on the Strimzi operator // Jakub Scholz
Jakub Scholz is a Software Engineer at Red Hat
December 4, 2020
#20 DoK community: Tips and tricks to get Kubernetes certifications // Eneko Pérez & Carlos Gómez Carrero
CKA (Certified Kubernetes Administrator) has a bad reputation as the hardest certification many people have faced. In this talk, we will go through the process to pass successfully the exam, tips on the exam itself, the environment and any other question that might arise. How to fly into a kubernetes certification. Eneko Pérez is a DevOps Engineer evolved from many years as a systems administrator. Carlos Gómez is a Kubernetes (CKA | CKAD) | Solutions Architect | DevOps Architect trying to get his place into the Cloud
December 1, 2020
#19 DoK community: Towards a K8s Native Streaming Application // Francesco Nobilia & Jeremy Frenay
Starting from a simple application which can be deployed in every machine running Docker, we will go through all steps required to transform the simple app into a Kubernetes native streaming application. We will explain the theory and then exemplify the learnt concepts to define a recipe for running streaming applications on Kubernetes. We will focus both on cultural and technical tricks to help you successfully adopt streaming applications at scale. At the end of the talk, you will have a comprehensive view regarding all platform building blocks and application requirements needed to successfully run a streaming application on Kubernetes. Spoiler: you will hear several times the words Apache Kafka, Kafka Streams and Strimzi. Francesco Nobilia is an enthusiastic engineer focused on building the next generation of a self-service and cost-effective streaming data platform. Event-Drive addicted. Apache Kafka fun. Kafka Summit and Meetup speaker. Currently, he is Principal Engineer at Nutmeg. Jeremy Frenay is a platform engineer at Babylon Health. He has been leading the Babylon's Data Operations efforts since late 2017, building the Kafka based data infrastructure, the automation and the tooling required to support teams of software and data engineers working on data integration pipelines in a cloud-native environment.
November 30, 2020
#18 DoK community: DoK Panel - The State of State // Rosemary Wang, Lili Cosic, Tomasz Cholewa & Jacquie Grindrod
Stateful vs stateless? We will stately be stating our statutes regarding the status of the state of statefulness and statelessness on k8s- oh yea! In the DoK Community, one of the main issues that folks have are how in the world can they flatten the learning curve when it comes to running stateful applications in k8s. That's why we've brought on 3 experts from 3 different countries to tell us what state state (intentionally doubled) is in! Rosemary Wang works to bridge the technical and cultural barriers between infrastructure engineers and application developers. She has a fascination for solving intractable problems with code, whether it be helping an infrastructure engineer learn to code or an application developer troubleshoot infrastructure failures. She also interfaces with vendors, clients, startups, and open source projects to find creative software solutions for infrastructure. When she is not drawing on whiteboards, Rosemary valiantly attempts to hack stacks of various infrastructure systems on her laptop while watering her houseplants. Lili Cosic is a Principal Software Engineer at Red Hat, working on OpenShift monitoring. She is also a kube-state-metrics, prometheus-operator and kube-prometheus project maintainer. Previously she worked at Weaveworks on the Weave cloud integration with Kubernetes and before that, she found her passion for Kubernetes operators at Kinvolk creating first operators. In her free time, Lili enjoys experimenting with Kubernetes, cooking and distributed systems. Tomasz Cholewa is the co-founder of Cloud Code Labs and he believes that good applications deserve even better environments where they can spread their wings and it's his job to help you create them. To achieve it he uses cloud and containers, but mostly he listens carefully to understand and use the right mix of proper technology and processes around it. His field of expertise is public cloud (AWS), containers and their orchestration engines - OpenShift and Kubernetes. Jacquie Grindrod is a developer advocate for HashiCorp with a focus on delivery and orchestration, where she’s able to apply her passion for solving problems with a holistic approach by bridging the gaps between teams and systems. In 2019, Jacquie was recognized as one of Canada’s Top 30 Under 30 Developers and has spoken at conferences including DevOpsDays Toronto and Grace Hopper Celebration.
November 30, 2020
#17 DoK community: Is k8s Even Ready For Data? Round II // Patrick McFadin & Jeffry Molanus
In our inaugural DOKC meet-up, Patrick McFadin Developer Advocate at Datastax emphasized the challenges of running Cassandra on Kubernetes, concluding at one point that “Kubernetes might not be ready for Cassandra.” Since that meeting, the use of the open-source Container Attached Storage project OpenEBS as a simple and high performance per workload storage for Cassandra has proliferated. Also the Cassandra Operator from Datastax, aka “CaSS”, has progressed as well. So - where are we now? Is CaSS on CAS working well? What is the future of collaboration between Datastax / Cassandra and MayaData / OpenEBS? Is Kubernetes now ready for Cassandra? What are the emerging technologies that might shape storage and Kubernetes in the near future? What are the reasons people avoid running DBs on Kubernetes? What makes it easier?
November 30, 2020
#16 DoK community: HyperStore-C: S3 object storage managed by Kubernetes // Gary Ogasawara
Cloudian’s HyperStore is S3-compatible object storage software focused on the enterprise market. In this talk, I'll discuss how and why we are working on Kubernetes-managed versions of HyperStore, including where we are now and what we're looking. Gary Ogasawara is Cloudian’s first Chief Technology Officer, responsible for setting the company’s long-term technology vision and direction. Before assuming this role, he was Cloudian’s founding engineering leader. Prior to Cloudian, Gary led the Engineering team at eCentives, a search engine company. He also led the development of real-time commerce and advertising systems at Inktomi, an Internet infrastructure company. Gary holds a Ph.D. in Computer Science from the University of California at Berkeley, specializing in uncertainty reasoning and machine learning.
November 30, 2020
Dok Season 1 Extras - #2 - El paso por Pivotal Cloud Foundry a Kubernetes // Alexander Herranz
En este episodio, Alexander Herranz nos habla sobre la localización de los datos de las empresas mediante la comparativa entre Openshift o Kubernetes. Algunos temas que tratamos: El paso por Pivotal Cloud Foundry a Kubernetes, hasta llegar a Openshift. Ventajas y desventajas de comenzar a trabajar con contenedores bajo el paraguas de Openshift. Una manera de garantizar buenas prácticas en el proceso de desarrollo: Despliegues e integración continua (CI/CD), monitorización y seguridad.
November 5, 2020
#15 DoK community: Reaching limits in K8s: A case study with Ingress Controller // Laurent Rouquette
When talking about data, we usually think about big data and scale, and what do we do next. Such limits are sometimes a good problem to have. In this talk, we'll discuss our approach to this situation using the Ingress Controller. Laurent is a Software Development Engineer at Adobe. He started in Dev, went to Ops (sometimes referred to as "SRE" these days), then became a manager and now he's back to being an individual contributor. This is all on purpose: he likes to do new things! He grew up in Marseille, France (go Olympique de Marseille!), but moved to the US right after college and actually he's never had a real job in Europe... When he isn't coding, he likes playing soccer (although he's been doing a lot of "watching" since March 2020).
October 27, 2020
#14 DoK community: Kubernetes Cost Control // Arie van den Bos
For our 14th installation of the data on k8s community meetup, we talked with Cloud System Engineer / Architect Arie van den Bos. // Abstract: In this meetup, Arie discussed the following: The importance of cost control while working with the cloud. K8S, Data & Cost Control. Hints/Tips around controlling your K8S costs. // Bio: Arie is a Cloud Solutions Engineer with over 10 years of experience. He has been a team leader and helped previous companies reduce their cloud costs through optimization strategies. As a Solution Architect, Arie has been designing solutions which affect different areas like: - Azure as a Disaster Recovery location - Azure AD for Identity Management and SSO - Azure Governance (Cost Control) - Office 365 Integration - OKTA Access/Identity Management - Cloud Automation (IaC) //Other links you can check Arie on: https://www.linkedin.com/pulse/control-costs-your-azure-cloud-environment-save-part-van-den-bos/?trackingId=hfPo7ynCviKUngjsMeabiA%3D%3D https://www.linkedin.com/pulse/control-costs-your-azure-cloud-environment-save-part-van-den-bos-1f/?trackingId=hECZZjKdBa7lmE2rJf5leQ%3D%3D https://www.linkedin.com/pulse/control-costs-your-azure-cloud-environment-save-part-van-den-bos-2f/?trackingId=5xhIm3h2zS5NCtaOuwkApQ%3D%3D // Final thoughts This will be a Fireside chat all audience can participate and ask questions, and you can also join the conversation beforehand in our slack group: https://go.dok.community/slack ▬▬▬▬▬▬ Connect with us 👋 ▬▬▬▬▬▬ Join our slack: https://go.dok.community/slack Follow us on Twitter: @dokcommunity Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/ Connect with Arie on Linkedin: https://www.linkedin.com/in/arievandenbos/ Connect with Bart on LinkedIn: https://www.linkedin.com/in/bart-farrell/ ▬▬▬▬▬▬ Supporters of the DoKc ▬▬▬▬▬▬ This meetup is sponsored by MayaData, which helped start the DOK.community and remains an active supporter. MayaData sponsors two Cloud Native Computing Foundation (CNCF) projects, OpenEBS (http://www.openEBS.io) - the leading open-source container attached storage solution - and Litmus (https://litmuschaos.io/) - the leading Kubernetes native chaos engineering project, which was recently donated to the CNCF as a Sandbox project. As of August 2020, MayaData is the fifth-largest contributor to CNCF projects. Well-known users of MayaData software include the CNCF itself, Bloomberg, Comcast, Arista, Orange, Intuit, and others. Check out more info at https://mayadata.io/.
October 26, 2020
#13 DoK community: Distributed Workloads on Kubernetes Operators to the Rescue // Sebastien Guilloux
For our 13th installation of the data on k8s meetup, we will be talking with Senior Software Engineer Sebastien Guilloux from Elastic about Distributed workloads on k8s and how operators play a part in that! // Abstract: How easily can you run distributed workloads on Kubernetes? The initial deployment of your 10-nodes database might be easy to setup, but day-2 operations (changing the configuration, adding and removing nodes, version upgrades, etc.) are much more complicated. We'll discuss how operators can help you manage distributed workloads, and a few operator tricks we learned while working on ECK (Elastic Cloud on Kubernetes) - an operator for the Elastic stack. // Sebastien Guilloux Bio: Sebastien Guilloux is a senior software engineer at Elastic. He has spent most of his career working with distributed systems, building resilient applications, and orchestrating Apache Kafka and Elasticsearch nodes around the world. He currently works on writing a Kubernetes operator for the Elastic Stack, Elastic Cloud on Kubernetes (ECK). @_sebgl on twitter // Final thoughts This was a damn good chat and Sebastian was sooo easy to talk to and shared so much knowledge with us! thanks to all that joined and watched! ▬▬▬▬▬▬ Connect with us 👋 ▬▬▬▬▬▬ Join our slack: https://go.dok.community/slack Follow us on Twitter: @dokcommunity Register for the next meetup: // Register here: https://go.dok.community/register Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/ Connect with Seb on Linkedin: https://www.linkedin.com/in/s%C3%A9bastien-guilloux-b6617359/ Connect with Bart on LinkedIn: https://www.linkedin.com/in/bart-farrell/ ▬▬▬▬▬▬ Supporters of the DoKc ▬▬▬▬▬▬ This meetup is sponsored by MayaData, which helped start the DOK.community and remains an active supporter. MayaData sponsors two Cloud Native Computing Foundation (CNCF) projects, OpenEBS (http://www.openEBS.io) - the leading open-source container attached storage solution - and Litmus (https://litmuschaos.io/) - the leading Kubernetes native chaos engineering project, which was recently donated to the CNCF as a Sandbox project. As of August 2020, MayaData is the fifth-largest contributor to CNCF projects. Well-known users of MayaData software include the CNCF itself, Bloomberg, Comcast, Arista, Orange, Intuit, and others. Check out more info at https://mayadata.io/
October 14, 2020
#12 DoK community: PostgreSQL as a Service on K8s at Zalando // Alexander Kukushkin
// Abstract: PostgreSQL is a powerful, open-source object-relational database system with over 30 years of active development that has earned it a strong reputation for reliability, feature robustness, and performance, but a production-grade deployment requires many complementary technologies to the database core: high availability and automated failover, backup and recovery, monitoring and alerting, centralized access control and logging, connection pooling, and so on. Being not initially accustomed for running stateful workloads, Kubernetes with its infrastructure as a code paradigm, CustomResourceDefinition, and Operator pattern turned out to be extremely convenient for deploying and running PostgreSQL at scale. I will talk about a few open-source project developed and maintained by database team at Zalando which anybody could use to build own PgaaS: 1. https://github.com/zalando/patroni - Tool for PostgreSQL high availability and cluster management. Integrates with K8s API and makes PostgreSQL cloud-native. 2. https://github.com/zalando/spilo - The Docker image that packages Patroni, multiple versions of PostgreSQL, and tools for backup and recovery. 3. https://github.com/zalando/postgres-operator - Implements Kubernetes operator pattern, orchestrates hundreds and thousands deployments of Patroni/Spilo clusters. Aforementioned projects would never get to the current state without an effort of dozens of external contributors. // Bio: During his professional career, Alexander touched PostgreSQL from all possible sides: as a Web Developer, as System Administrator, and as a Database Engineer now. Five years ago he started working on Patroni project and achieved quite a big success with it. Nowadays Patroni is used in the heart of PostgreSQL-as-a-Service on Kubernetes at Zalando to run 1000+ PostgreSQL HA clusters. Besides that, Alexander periodically reports bugs and contributes patches to PostgreSQL and some other open-source projects (usually Postgres related). ▬▬▬▬▬▬ Show Notes ▬▬▬▬▬▬ Remember to put your toys away: https://twitter.com/braddwyer/status/1303751133765283840 Getting Write happy and burning out your SSDs https://blog.okmeter.io/real-world-ssd-wearout-a3396a35c663 GitLab Postmortem https://about.gitlab.com/blog/2017/02/10/postmortem-of-database-outage-of-january-31/ ▬▬▬▬▬▬ Connect with us 👋 ▬▬▬▬▬▬ Join our slack: https://go.dok.community/slack Follow us on Twitter: @dokcommunity Register for the next meetup: // Register here: https://go.dok.community/register Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/ Connect with Alexander on Twitter: https://twitter.com/cyberdemn Connect with Bart on LinkedIn: https://www.linkedin.com/in/bart-farrell/ ▬▬▬▬▬▬ Supporters of the DoKc ▬▬▬▬▬▬ This meetup is sponsored by MayaData, which helped start the DOK.community and remains an active supporter. MayaData sponsors two Cloud Native Computing Foundation (CNCF) projects, OpenEBS (http://www.openEBS.io) - the leading open-source container attached storage solution - and Litmus (https://litmuschaos.io/) - the leading Kubernetes native chaos engineering project, which was recently donated to the CNCF as a Sandbox project. As of August 2020, MayaData is the fifth-largest contributor to CNCF projects. Well-known users of MayaData software include the CNCF itself, Bloomberg, Comcast, Arista, Orange, Intuit, and others. Check out more info at https://mayadata.io/ // Final thoughts This is a Fireside chat all audience can participate and ask questions and you can also join the conversation beforehand in our slack group: https://go.dok.community/slack You can also check out some of our old meetups on youtube here: https://www.youtube.com/channel/UCUnXJbHQ89R2uSfKsqQwGvQ
October 8, 2020
DoK Season 1 Extras - #1 - Is my data secure in K8s? // Asier Azaceta
Bart Farrell interviews Asier Azaceta, Cloud Security Architect in the IBM European Centre of Competence
October 4, 2020
#11 DoK community: Doing Data Wrong // Jeremy Tanner & David McKay
For our 11th installation of the data on k8s meetup, we talk with both Sr Tech Evangelists Jeremy Tanner and David McKay from Packet about doing data wrong on k8s. // Key takeaways: Data is hard with Kubernetes, especially when adding bare metal to the equation. Attendees will leave with an idea of some pain points they can avoid. // Abstract: In this talk, we'll look at great ways to lose data (like running databases on Kubernetes and bare metal), pain points for developers, lessons we've learned, and have a Festivus in September airing of grievances sessions for those who have felt this pain. // Jeremy Bio He / Him. Helps developers understand bare metal at Packet, an Equinix company. Based in Austin, TX and eagerly awaiting a time he can again appear in a city near you. Always an advocate (Developer, Community, BBQ). Likes Motorcycles, bicycles, and mechanical keyboards. Not Questlove. @Penguin on twitter // David Bio David is a Senior Developer Advocate at Packet and a member of the Kubernetes org and release team. As a professional technology magpie, David was an early adopter of cloud, container, and cloud-native technologies; crossing the murky waters of AWS in 2008, Docker in 2014, and Kubernetes in 2015. With an insatiable love for technology, David is always on the hunt to learn and share knowledge with others in fun and exciting ways. @rawkode on twitter ▬▬▬▬▬▬ Connect with us 👋 ▬▬▬▬▬▬ Join our slack: https://go.dok.community/slack Follow us on Twitter: @dokcommunity Register for the next meetup: // Register here: https://go.dok.community/register Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/ Connect with Jeremy on Linkedin: https://www.linkedin.com/in/jeremytanner/ Connect with David on LinkedIn: https://www.linkedin.com/in/rawkode/ ▬▬▬▬▬▬ Supporters of the DoKc ▬▬▬▬▬▬ This meetup is sponsored by MayaData, which helped start the DOK.community and remains an active supporter. MayaData sponsors two Cloud Native Computing Foundation (CNCF) projects, OpenEBS (http://www.openEBS.io) - the leading open-source container attached storage solution - and Litmus (https://litmuschaos.io/) - the leading Kubernetes native chaos engineering project, which was recently donated to the CNCF as a Sandbox project. As of August 2020, MayaData is the fifth-largest contributor to CNCF projects. Well-known users of MayaData software include the CNCF itself, Bloomberg, Comcast, Arista, Orange, Intuit, and others. Check out more info at https://mayadata.io/ // Final thoughts This is a Fireside chat all audience can participate and ask questions and you can also join the conversation beforehand in our slack group: https://go.dok.community/slack You can also check out some of our old meetups on youtube here: https://www.youtube.com/channel/UCUnXJbHQ89R2uSfKsqQwGvQ
September 30, 2020
#10 DoK community: Data on Kubernetes and container attached storage - an update // Evan Powell
For our 10th installation of the data on k8s community meetup, we talk with CEO of Mayadata Evan Powell about container attached storage, Portworx acquisition, openEBS, can open source make it, and we geek out a bit with questions from the audience. // Key takeaways: Kubernetes - extended by CAS and other software - is not just solving the problems of running data on Kubernetes, it is fast becoming the preferred platform for data. // Abstract: Back in 2018 the CNCF published a blog we wrote called Container Attached Storage. Today - September 22nd 2020 - a new blog is appearing on their site updating Container Attached Storage. https://bit.ly/2FYGgeR This talk borrows very heavily from that blog. What is CAS? Why would anyone use Kubernetes itself for storage? How does a microservices architecture help? Why is shared storage at the end of the road - though still used underneath CAS sometimes? // Evan Bio Evan has helped conceptualize, fund, position, lead and scale a few eventually important enterprise infrastructure software companies which were acquired. He is also a part time investor and mentor. Evan is currently CEO at MayaData. MayaData enables the use of Kubernetes as a data layer and sponsors open source projects including the CNCF project OpenEBS and the LitmusChaos engineering project. MayaData also helps to sponsor DOKC. Previously Evan helped conceptualize and build StackStorm, a DevOps workflow automation company, where he was co-founder and CEO. StackStorm was acquired and the project became a Linux Foundation project. Before StackStorm, Evan was founding CEO of Nexenta Systems which defined and led the open storage and software-defined storage space. ▬▬▬▬▬▬ Connect with us 👋 ▬▬▬▬▬▬ Join our slack: https://go.dok.community/slack Follow us on Twitter: @dokcommunity Register for the next meetup: // Register here: https://go.dok.community/register Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/ Connect with Evan on Linkedin: https://www.linkedin.com/in/epowell/ Follow Evan on Twitter: https://twitter.com/epowell101 ▬▬▬▬▬▬ Supporters of the DoKc ▬▬▬▬▬▬ This meetup is sponsored by MayaData, which helped start the DOK.community and remains an active supporter. MayaData sponsors two Cloud Native Computing Foundation (CNCF) projects, OpenEBS (http://www.openEBS.io) - the leading open-source container attached storage solution - and Litmus (https://litmuschaos.io/) - the leading Kubernetes native chaos engineering project, which was recently donated to the CNCF as a Sandbox project. As of August 2020, MayaData is the fifth-largest contributor to CNCF projects. Well-known users of MayaData software include the CNCF itself, Bloomberg, Comcast, Arista, Orange, Intuit, and others. Check out more info at https://mayadata.io/
September 24, 2020
#9 DoK community: Geospatial Sensor Networks and Partitioning Data // Alex Miłowski
For our 9th installation of the Dokc data on k8s meetup, we will be talking with Alex Milowski from Redis Labs. // Key takeaways: How are data collection and consumption workloads fundamentally different? What are the main challenges for sensor networks? How are those challenges address within the context of K8s? // Abstract: We use resources like weather reports or air quality measurements to navigate the world. These resources become especially important when faced by extreme events like the current wildfires in the Western USA. The data for the reports, predictions, and maps all start as realtime sensor networks. In this talk, Alex will present some of his research into scientific data representation on the Web and how the key mechanism is the partitioning, annotation, and naming of data representations. We’ll take a look at a few examples, including some recent work on air quality data relating to the current wildfires in the western USA. We’ll explore the central question of how geospatial sensor network data can be collected and consumed within K8s deployments. // Alex Bio Dr. Milowski is a researcher, developer, entrepreneur, mathematician, and computer scientist. He has been involved in the development of Web and Semantics technologies since the early 1990's, primarily focusing on data representation, algorithms, and processing data at scale; also, an experienced developer skilled in a variety of functional and imperative languages. He received his PhD in Informatics (Computer Science) from the renowned University of Edinburgh School of Informatics (Scotland) on large-scale computation over scientific data on the Web in 2014. Various experience in scientific computing - geospatial and genome data pipelines - and big data platforms. Recently, he has been working in telecommunications on various mobile financial applications and researching how to improve the productivity of machine learning systems and data scientists by utilizing Kubernetes as a platform. He has experience teaching, mentoring, and developing within various data science/ML domains including topics such as cloud computing, Kubernetes, Spark, Hadoop, text processing/NLP, deep learning, data acquisition, and a whole lot of Python. ▬▬▬▬▬▬ Connect with us 👋 ▬▬▬▬▬▬ Join our slack: https://join.slack.com/t/dokcommunity/shared_invite/zt-g3ui5r0g-jDKz5dhh2W1ayElqwKYYAg Follow us on Twitter: @dokcommunity Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/ Connect with Alex on Linkedin: https://www.linkedin.com/in/alexmilowski/ ▬▬▬▬▬▬ Supporters of the DoKc ▬▬▬▬▬▬ This meetup is sponsored by MayaData, which helped start the DOK.community and remains an active supporter. MayaData sponsors two Cloud Native Computing Foundation (CNCF) projects, OpenEBS (http://www.openEBS.io) - the leading open-source container attached storage solution - and Litmus (https://litmuschaos.io/) - the leading Kubernetes native chaos engineering project, which was recently donated to the CNCF as a Sandbox project. As of August 2020, MayaData is the fifth-largest contributor to CNCF projects. Well-known users of MayaData software include the CNCF itself, Bloomberg, Comcast, Arista, Orange, Intuit, and others. Check out more info at https://mayadata.io/
September 17, 2020
#8 DoK community: Appropriate workloads for databases in K8s // Rick Vasquez
For our 8th installation of the data on k8s meetup, we spoke with Rick Vasquez, Enablement Lead - Services Portfolio at Percona. // Key takeaways: Large unsharded data footprints are not great for kubernetes. Deploying databases in kubernetes is much different than just running an application in a container. Open source is playing catch up with regards to persistence and kubernetes. // Abstract: As more companies are moving to kubernetes and cloud native as a standard for developing net new functionality something has to happen to the legacy workloads. Often times we see a lift and shift mentality into kubernetes, we will talk about how that mentality can be dangerous or cause more work than expected. // Rick Bio: Hands-on technical practitioner with real world experience in some of the largest open source database deployments in the world. Translating technical and business requirements to the relevant groups is my passion. // Check out percona live event: http://percona.com/live ▬▬▬▬▬▬ Connect with us 👋 ▬▬▬▬▬▬ Join our slack: https://join.slack.com/t/dokcommunity/shared_invite/zt-g3ui5r0g-jDKz5dhh2W1ayElqwKYYAg Follow us on Twitter: @dokcommunity Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/ Connect with Rick on Linkedin: https://www.linkedin.com/in/rickvasquezii/ ▬▬▬▬▬▬ Supporters of the DoKc ▬▬▬▬▬▬ This meetup is sponsored by MayaData, which helped start the DOK.community and remains an active supporter. MayaData sponsors two Cloud Native Computing Foundation (CNCF) projects, OpenEBS (http://www.openEBS.io) - the leading open-source container attached storage solution - and Litmus (https://litmuschaos.io/) - the leading Kubernetes native chaos engineering project, which was recently donated to the CNCF as a Sandbox project. As of August 2020, MayaData is the fifth-largest contributor to CNCF projects. Well-known users of MayaData software include the CNCF itself, Bloomberg, Comcast, Arista, Orange, Intuit, and others. Check out more info at https://mayadata.io/
September 9, 2020
#7 DoK community: Conway’s Law & Kubernetes: Centralization vs. small team autonomy // Joseph Sandoval & Mike Tougeron
Data on Kubernetes #7: Conway’s Law & Kubernetes - Centralization vs small team autonomy with Mike Tougeron, Lead Site Reliability Engineer, at Adobe & Joseph Sandoval , SRE Manager, Platform Infrastructure, at Adobe. Loosely coupled teams, loosely coupled workloads and loosely coupled data - on a built for everyone platform? Abstract: Big clusters or small clusters? Where to draw the line and how to know whats best for your use case? We speak with Joseph and Mike from Adobe about the inevitable questions that arise when running k8s at scale. If it is run by the platform team, is it inevitably a pet? Or more of a pet? Is that the idea, that we give stuff that ” must not fail” to platform teams so they are common services w/ SLAs? Or how is it decided what is owned by the platform vs. the individual teams. While talking with Joseph and Mike we also dive into what their stack looks like, must have tools they use on a daily bases, VM vs K8s, differences in stateful apps on k8s and War stories! Mike T Bio: For many years Mike has been building Kubernetes platforms and deployments. With a passion for automation and developer engagement, Mike works towards continuously improving development pipelines to take the complication out of managing services on large-scale infrastructure backed by both vm and containers across multi-cloud environments. Mike is a lazy programmer who would rather write thousands of lines of code for automation instead of running repetitive commands every day. When not coding or playing with his son you’ll find Mike with his nose buried in a book or playing Civilization. Joseph Sandoval Bio: Joseph has been in the tech industry for 25 years running large scale infrastructure primarily in the E-commerce/SaaS. The last 15 years has been spent in leading and a managing role with the high performing. His philosophy is to create space for SRE’s to do their best work and espousing a product centric point of view when it comes to backend infrastructure and tooling. Currently managing a team responsible for over 200k cores of infrastructure (bare metal, vm’s and containers) in 6 datacenters and 3 AWS regions. ▬▬▬▬▬▬ Connect with us 👋 ▬▬▬▬▬▬ Join our slack: https://join.slack.com/t/dokcommunity/shared_invite/zt-g3ui5r0g-jDKz5dhh2W1ayElqwKYYAg Follow us on Twitter: @dokcommunity Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/ Follow Mike on Twitter: @mtougeron Connect with Mike on Linkedin: https://www.linkedin.com/in/mtougeron/ ▬▬▬▬▬▬ Supporters of the DoKc ▬▬▬▬▬▬ This meetup is sponsored by MayaData, which helped start the DOK.community and remains an active supporter. MayaData sponsors two Cloud Native Computing Foundation (CNCF) projects, OpenEBS (http://www.openEBS.io) - the leading open-source container attached storage solution - and Litmus (https://litmuschaos.io/) - the leading Kubernetes native chaos engineering project, which was recently donated to the CNCF as a Sandbox project. As of August 2020, MayaData is the fifth-largest contributor to CNCF projects. Well-known users of MayaData software include the CNCF itself, Bloomberg, Comcast, Arista, Orange, Intuit, and others. Check out more info at https://mayadata.io/
September 3, 2020
#6 DoK community: Operators, operators, operators… operators // Amit Gupta
Data on Kubernetes Community #6: Operators, operators, operators….Kubernetes operators! With Amit Gupta, Group Product Manager, at Confluent. Key takeaways: Kubernetes Operators represent a great opportunity for the data community to solve for the complexities of managing data products for their customers in a way that standardizes UX and integration points -- historically the most powerful solutions had to be niche and highly customized. Abstract: Evolution of this space and how we’ve come to operators Managed services vs. running your own Are people succeeding or stumbling with K8s Common challenges for data products on K8s (interested in the community conversation here, what are some common challenges and solutions people are seeing) Bio: Amit is a Group Product Manager at Confluent, helping to bring the cloud-native experience to Kafka and event streaming. He’s passionate about cloud, container orchestration, infrastructure automation, and wants to see the work of choosing, deploying, and successfully leveraging powerful distributed systems become a piece of cake for everyone in IT. In the past, Amit has worked as Director of Product Management for Platform at Pivotal, and as a Software Engineer working on cloud orchestration technology (think Kubernetes but for VMs) and container scheduling (think Kubernetes, pre-Kubernetes). He has a Master’s degree in mathematics from the University of California, Berkeley. He loves soccer, camping, and coffee. ▬▬▬▬▬▬ Connect with us 👋 ▬▬▬▬▬▬ Join our slack: https://join.slack.com/t/dokcommunity/shared_invite/zt-g3ui5r0g-jDKz5dhh2W1ayElqwKYYAg Follow us on Twitter: @dokcommunity Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/ Follow Amit on Twitter: https://twitter.com/amitkgupta84 Connect with Amit on Linkedin: https://www.linkedin.com/in/amitkgupta84/ ▬▬▬▬▬▬ Supporters of the DoKc ▬▬▬▬▬▬ This meetup is sponsored by MayaData, which helped start the DOK.community and remains an active supporter. MayaData sponsors two Cloud Native Computing Foundation (CNCF) projects, OpenEBS - the leading open-source container attached storage solution - and Litmus - the leading Kubernetes native chaos engineering project, which was recently donated to the CNCF as a Sandbox project. As of June 2020, MayaData is the sixth-largest contributor to CNCF projects. Well-known users of MayaData products include the CNCF itself, Bloomberg, Comcast, Arista, Orange, Intuit, and others. Check out more info at https://mayadata.io/
August 26, 2020
#5 DoK community: The full cycle of doing data on k8s: a case study // Dave Cook
Doing Data on Kubernetes this week we dive into Globally distributed Business applications with Dave Cook Founder of Gridworkz Key takeaways: Current data scalability challenges outlined. What’s available today. Abstract: Scaling ACID compliant databases in the cloud is challenging. We’ll look at a specific use case where we’re trying to scale a Saas Odoo ERP offering on Kubernetes and build a scalable Postgres cluster as a backend service. Bio: Dave is the founder of Gridworkz. They provide business software solutions to the SMB market. He is passionate about devops and leading edge open source technology and its practical application to business. Other interests include trying to pay it forward by applying his knowledge to help startups succeed. His latest interest is scaling Odoo and Postgres on Kubernetes. With a long history of working with traditional enterprise systems he sees tremendous potential in redefining how systems will look today and well into the future. ▬▬▬▬▬▬ Connect with us 👋 ▬▬▬▬▬▬ Join our slack: https://join.slack.com/t/dokcommunity/shared_invite/zt-g3ui5r0g-jDKz5dhh2W1ayElqwKYYAg Follow us on Twitter: @dokcommunity Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/ Connect with Dave on Twitter: https://twitter.com/davef_cook ▬▬▬▬▬▬ Supporters of the DoKc ▬▬▬▬▬▬ This meetup is sponsored by MayaData, which helped start the DOK.community and remains an active supporter. MayaData sponsors two Cloud Native Computing Foundation (CNCF) projects, OpenEBS - the leading open-source container attached storage solution - and Litmus - the leading Kubernetes native chaos engineering project, which was recently donated to the CNCF as a Sandbox project. As of June 2020, MayaData is the sixth-largest contributor to CNCF projects. Well-known users of MayaData products include the CNCF itself, Bloomberg, Comcast, Arista, Orange, Intuit, and others. Check out more info at https://mayadata.io/
August 21, 2020
#4 DoK community: The problem of stateful workloads - balance of keeping data HA vs. costs // Ren Lee
Balancing redundancy and HA with costs: did you really need all N replicas?AKA We were running what and it cost us how much?! With Ren Lee SRE at Arista Networks Key takeaways: “Lazy but Simple” vs. “Proactive but Expensive” methods of scaling: knowing when to pay the seemingly scarier price of running infrastructure than costing engineering time, and vice versa Hidden costs: cost of bad deployments and things that just don’t work When autoscaling becomes the demon: especially in public cloud environments when access to pools of resources is no longer your barrier Abstract: In an engineer’s ideal world we would love all the resources and redundancies we can possibly get for our services and infrastructure that supports them for sanity and of course, HA. However, how do you balance between “enough” redundancy and the actual operational costs of supporting such engineering choices, and what are some of the tough engineering decisions that need to be made? This talk focuses primarily on services being run on Kubernetes (or public cloud offering of Kubernetes), but the principles can be extended to any infrastructure environment. Key Topics: capacity planning, cost management, distributed services Bio: Ren is an SRE at Arista Networks for CloudVision services team. Deeply passionate about fixing broken things without anyone noticing and using effective monitoring to preempt potential disasters. Wrangler of services that run on Kubernetes to keep the zoo running any day, every day. Join our slack: https://join.slack.com/t/dokcommunity/shared_invite/zt-g3ui5r0g-jDKz5dhh2W1ayElqwKYYAg Follow us on Twitter: @dokcommunity Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/ Connect with Ren on Linkedin: https://www.linkedin.com/in/therendeye/ This meetup is sponsored by MayaData, which helped start the DOK.community and remains an active supporter. MayaData sponsors two Cloud Native Computing Foundation (CNCF) projects, OpenEBS - the leading open-source container attached storage solution - and Litmus - the leading Kubernetes native chaos engineering project, which was recently donated to the CNCF as a Sandbox project. As of June 2020, MayaData is the sixth-largest contributor to CNCF projects. Well-known users of MayaData products include the CNCF itself, Bloomberg, Comcast, Arista, Orange, Intuit, and others. Check out more info at https://mayadata.io/
August 13, 2020
#3 DoK community: Design considerations for operationalizing Distributed SQL on Kubernetes // Nikhil Chandrappa
Distributed databases on kubernetes And we just keep rolling along! Round 3 of the data on kubernetes community meetup! This time we will be talking with Nikhil Chandrappa Lead Software engineer at YugabyteDB. We will take a Practical look at running distributed SQL on Kubernetes using YugabyteDB Key takeaways: - Introduction to YugabyteDB Distributed SQL databases and its design principles - Design considerations for operationalizing Distributed SQL on Kubernetes - Deployment strategies for clustered Databases - Storage orchestration on Kubernetes - Yugabyte's approach for DBAAS on Kubernetes - DB Creation, Scale up / Scale down - Implementing Day 2 operations for distributed SQL databases - upgrades, backups, and monitoring - Distributed SQL Demo: A real-world e-commerce application Abstract This talk is targeted towards cloud-native developers and architects looking to deploy the operational database on Kubernetes. We are going to walk you through the design decisions YugabyteDB's team took when architecting the database as a service on Kubernetes. We are going to cover concepts related to Kubernetes Volume provisioning, pod placement strategies for data resilience/High availability, and how cluster events are used for reconciling the k8s workloads during day 2 operations like upgrades, scale-up/down. Bio: Nikhil is an ecosystem engineer at Yugabyte. He is leading the efforts on YugabyteDB integrations with open source developer tools like GraphQL, Spring Data, R2DBC, and Kubernetes. He also works with the developer community on the adoption of Distributed SQL databases in cloud native apps. Before joining Yugabyte, he worked as a senior data engineer at Pivotal which is now part of VMware Tanzu, championing the cloud native data APIs and in-memory data grids for fortune 500 customers. He has presented at major developer conferences, SpringOne Platform, PostgreSQL conf, JPMC tech fest. He is originally from Mysore, India, and has graduated with a masters degree in Computer Engineering from Syracuse University. I am currently looking for speakers who can talk about things such as operators, databases, multicloud/hybrid, or anything else that could be interesting for the SRE engineering crowd. Join our slack: https://join.slack.com/t/dokcommunity/shared_invite/zt-g3ui5r0g-jDKz5dhh2W1ayElqwKYYAg Follow us on Twitter: @dokcommunity Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/ Connect with Nikhil on Linkedin: https://www.linkedin.com/in/nikhilmc/ This meetup is sponsored by MayaData, which helped start the DOK.community and remains an active supporter. MayaData sponsors two Cloud Native Computing Foundation (CNCF) projects, OpenEBS - the leading open-source container attached storage solution - and Litmus - the leading Kubernetes native chaos engineering project, which was recently donated to the CNCF as a Sandbox project. As of June 2020, MayaData is the sixth-largest contributor to CNCF projects. Well-known users of MayaData products include the CNCF itself, Bloomberg, Comcast, Arista, Orange, Intuit, and others. Check out more info at https://mayadata.io/ ||SHOW NOTES|| Slides: https://docs.google.com/presentation/d/1MOYgKm3EuhQHY2ryxSC3qFId2snCI0nPzdKa28wL4EI/edit?usp=sharing YugaByte CTO's talk about logical clocks https://blog.yugabyte.com/distributed-postgresql-on-a-google-spanner-architecture-storage-layer/ Link to Yugabyte hiring page https://blog.yugabyte.com/insert-into-yugabyte-were-hiring-july-2020-edition/ Getting started with YugabyteDB - https://download.yugabyte.com/ Learn more about the internals of Distributed SQL https://blog.yugabyte.com/distributed-postgresql-on-a-google-spanner-architecture-query-layer/ Learn more about Microservices + YugabyteDB https://www.yugabyte.com/spring/
August 6, 2020
#2 DoK community: Data on k8s maturity check // Zach Dunn
Data on Kubernetes community virtual meetup #2 with Zach Dunn Senior Direct of Platform Operations, Optoro. We talk today about how the team at Optoro has implemented stateful apps on Kubernetes and what they have learned along the way. Let’s talk about storage. Optoro has moved to running stateful stores on Kubernetes. It’s a challenge, but it has a lot of value. Let’s talk about how we chose to do it, and what we figured out along the way. Key takeaways: Learn about how Optoro chose to run stateful services on kubernetes. Laugh at other people's mistakes! Cry with us over our pain. Maybe question some of the assumptions around storage in k8s. Wonder about the future of data in Kubernetes. Bio: Zach's spent most of his tech career in something that vaguely resembles production infrastructure. This has ranged from hot aisle drudgery to professional nerd herder and budget owner. Once an individual but now is he collectively just known as "Arabella's Dad" or on occasion "William's Dad". He currently is the Senior Director of Platform Operations and CISO at Optoro helping teams build and deliver modern software for enterprise clients Join our slack: https://join.slack.com/t/dokcommunity/shared_invite/zt-g3ui5r0g-jDKz5dhh2W1ayElqwKYYAg Follow us on Twitter: @dokcommunity Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/ Connect with Zach on Linkedin: https://www.linkedin.com/in/zach-dunn-11507217/ This meetup is sponsored by MayaData, which helped start the DOK.community and remains an active supporter. MayaData sponsors two Cloud Native Computing Foundation (CNCF) projects, OpenEBS - the leading open-source container attached storage solution - and Litmus - the leading Kubernetes native chaos engineering project, which was recently donated to the CNCF as a Sandbox project. As of June 2020, MayaData is the sixth-largest contributor to CNCF projects. Well-known users of MayaData products include the CNCF itself, Bloomberg, Comcast, Arista, Orange, Intuit, and others. Check out more info at https://mayadata.io/
July 29, 2020
#1 DoK community: Is k8s even ready for data? // Patrick McFadin
Our kick-off inaugural event of Data on Kuberneters commenced with VP Developer Relations of Datastax Patrick McFadin talking about this vision for the future of doing data on k8s Kubernetes has been a great solution for deploying application infrastructure. Trying to manage your data with the same control plane has been, less than ideal. This has been even more true when using distributed databases like Apache Cassandra. Once you get past the storage and stateful sets, you still have a lot to do. Let’s have a frank talk about the new opportunities to make Kubernetes ready for data. Patrick McFadin is the VP of Developer Relations at DataStax, where he leads a team devoted to making users of Apache Cassandra successful. He has also worked as Chief Evangelist for Apache Cassandra and consultant for DataStax, where he helped build some of the largest and exciting deployments in production. Patrick is active in the Apache Cassandra project and a member of the CNCF TOC. The new data on k8s community (DOKC) is an openly governed and self-organizing group of curious and experienced operators and engineers concerned with running data-intensive workloads on Kubernetes. We will have weekly meetups on Tuesdays at 5pm UK / 9am PST and everything will be recorded and put up on youtube and podcast land. I am currently looking for speakers who can talk about things such as operators, databases, multicloud/hybrid, or anything else that could be interesting for the SRE engineering crowd. Join our slack: https://join.slack.com/t/dokcommunity/shared_invite/zt-g3ui5r0g-jDKz5dhh2W1ayElqwKYYAg Follow us on Twitter: @dokcommunity Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/ Connect with Patrick on Linkedin: https://www.linkedin.com/in/patrick-mcfadin-53a8046/ This meetup is sponsored by MayaData, which helped start the DOK.community and remains an active supporter. MayaData sponsors two Cloud Native Computing Foundation (CNCF) projects, OpenEBS - the leading open-source container attached storage solution - and Litmus - the leading Kubernetes native chaos engineering project, which was recently donated to the CNCF as a Sandbox project. As of June 2020, MayaData is the sixth-largest contributor to CNCF projects. Well-known users of MayaData products include the CNCF itself, Bloomberg, Comcast, Arista, Orange, Intuit, and others. Check out more info at https://mayadata.io/ Come learn about running Cassandra in their hands-on workshop: https://www.datastax.com/events/cassandra-workshop-series
July 23, 2020