Skip to main content
OpenObservability Talks

OpenObservability Talks

By Dotan Horovits

On OpenObservability Talks we discuss harnessing the power of open source to advance observability initiatives for DevOps practitioners around the world.

We live-stream the episodes on Twitch and YouTube Live - tune in to see us live, and chime in with your comments and questions on the live chat.
www.youtube.com/@openobservabilitytalks

You can find us on Twitter @openobserv
Where to listen
Apple Podcasts Logo

Apple Podcasts

Google Podcasts Logo

Google Podcasts

Overcast Logo

Overcast

Pocket Casts Logo

Pocket Casts

RadioPublic Logo

RadioPublic

Spotify Logo

Spotify

Meta’s data driven approach to observability - OpenObservability Talks S3E06
Meta’s data driven approach to observability - OpenObservability Talks S3E06
At Meta (Facebook, Instragram et al) everything is data, and data driven approach is the rule, from product to engineering, from HR to finance. This is also how the team at Meta treats observability. Let’s see how we treat observability as a data analytics problem, and what you can implement, even if you’re not a hyperscaler. On this episode I’ll host David Ostrovsky from Meta. David is a software developer with over 20 years of industry experience, speaker, trainer, blogger and co-author of “Pro Couchbase Server”. He specializes in large-scale distributed system architecture. The episode was live-streamed on 7 November 2022 and the video is available at https://youtu.be/1l0HKUDoX4Q OpenObservability Talks episodes are released monthly, on the last Thursday of each month and are available for listening on your favorite podcast app and on YouTube. We live-stream the episodes on Twitch and YouTube Live - tune in to see us live, and chime in with your comments and questions on the live chat. https://www.twitch.tv/openobservability https://www.youtube.com/channel/UCLKOtaBdQAJVRJqhJDuOlPg Have you got an interesting topic you'd like to share in an episode? Reach out to us and submit your proposal at https://openobservability.io/ Show Notes: Addressing business observability Machine learning and predictions in observability From the business inwards, using SLOs Accelerate engineering quality with developer observability Organizational and communications aspects of high scale observability Actionable observability How small-medium size orgs can achieve a similar effect OpenTelemetry demo is GA PromLens is open sourced and contributed to Prometheus Resources: TEMPLE signals for observability: https://medium.com/@YuriShkuro/temple-six-pillars-of-observability-4ac3e3deb402 PromLens open sourced: https://promlabs.com/blog/2022/10/25/promlabs-and-chronosphere-open-source-the-promlens-query-builder Socials: Twitter: https://twitter.com/OpenObserv Twitch: https://www.twitch.tv/openobservability YouTube: https://www.youtube.com/channel/UCLKOtaBdQAJVRJqhJDuOlPg Website: https://openobservability.io/
01:00:20
November 23, 2022
Platform Engineering: DevOps evolution or a fancy rename? - OpenObservability Talks S3E05
Platform Engineering: DevOps evolution or a fancy rename? - OpenObservability Talks S3E05
Everyone’s talking about Platform Engineering these days. Even Gartner featured it in its Hype Cycle for Software Engineering 2022. But what is Platform Engineering really about? Is it the next stage in the evolution of DevOps? Is it just a fancy rebrand for DevOps or SRE? And how does observability pertain to platform engineering? On this episode of OpenObservability Talks Horovits hosted George Hantzaras, Director of Cloud Platform Engineering at Citrix. George is a distributed systems expert and a hands-on engineering leader with focus on delivering B2B cloud services at scale. Coming from a DevOps background, he focuses on implementing SRE at enterprise scale working with cloud native technologies. He has been organizing the Athens Cloud Computing Meetup since 2016 and the Athens Hashicorp User Group. Most recently, he has been a speaker at global events like Hashiconf, DeveloperWeek, Voxxed Days, DevNexus and more, focusing on reliability engineering, agile leadership, scaling engineering teams, and entrepreneurship. The episode was live-streamed on 6 October 2022 and the video is available at https://youtu.be/zJGMUVY6fDM OpenObservability Talks episodes are released monthly, on the last Thursday of each month and are available for listening on your favorite podcast app and on YouTube. We live-stream the episodes on Twitch and YouTube Live - tune in to see us live, and chime in with your comments and questions on the live chat. https://www.twitch.tv/openobservability https://www.youtube.com/channel/UCLKOtaBdQAJVRJqhJDuOlPg Have you got an interesting topic you'd like to share in an episode? Reach out to us and submit your proposal at https://openobservability.io/ Show Notes: Why Platform Engineering? Different teams employ different stacks. How does Platform Engineering fits in? Platform Engineering at Citrix How does it map to the common DevOps maturity models? Is Platform Engineering different from PaaS? Platform as a product Culture element is central in Platform Engineering Generating buy-in within the product’s engineering Metrics and quantifying the benefits of Platform Engineering Balancing flexibility and simplicity, and striking the right level of abstraction Delimitation between what’s developed by Platform vs. Product teams. Generating Golden Paths for engineering Observability and Platform Engineering Resources: George Hantzaras presentations Designing Golden Paths platformengineering.org Platform Engineering KPIs Socials: Twitter: https://twitter.com/OpenObserv Twitch: https://www.twitch.tv/openobservability YouTube: https://www.youtube.com/channel/UCLKOtaBdQAJVRJqhJDuOlPg Website: https://openobservability.io/
58:15
October 27, 2022
Where Are My App’s Traces?? Instrumentation in Practice - OpenObservability Talks S3E04
Where Are My App’s Traces?? Instrumentation in Practice - OpenObservability Talks S3E04
Instrumentation is that black magic that makes our application emit traces, logs, metrics or other telemetry. How does it work? What options are available in different programming languages, such as Java, Python and Go? What does OpenTelemetry offer in this domain?   On this episode of OpenObservability Talks I hosted Eden Federman, Co-Founder & CTO at keyval, a company focused on making observability simpler. Eden is the creator of two open source projects: Odigos and Go automatic instrumentation (now part of OpenTelemetry). Eden is passionate about everything related to observability and performance monitoring. He also created kubectl-flame, a profiler for Kubernetes. The episode was live-streamed on 22 September 2022 and the video is available at https://youtu.be/VFykWV1mLAI OpenObservability Talks episodes are released monthly, on the last Thursday of each month and are available for listening on your favorite podcast app and on YouTube. We live-stream the episodes on Twitch and YouTube Live - tune in to see us live, and chime in with your comments and questions on the live chat. https://www.twitch.tv/openobservability https://www.youtube.com/channel/UCLKOtaBdQAJVRJqhJDuOlPg Have you got an interesting topic you'd like to share in an episode? Reach out to us and submit your proposal at https://openobservability.io/ Show Notes: What is instrumentation Manual and Automatic instrumentation Different languages offer different options Java instrumentation capabilities Go instrumentation capabilities Instrumentation when using programming frameworks eBPF use in auto-instrumentation New OpenTelemetry SIG for Go auto-instrumentation Odigos open source project Best practices for instrumentation Resources: Instrumentation basics and OpenTelemetry support eBPF Automatic instrumentation for Go Odigos (Observability Control Plane) OpenTelemetry Go auto instrumentation SIG How OpenTelemetry works under the hood in JavaScript Spring Boot instrumentation Spring Cloud Sleuth 1.1.0 released
01:00:26
September 29, 2022
SigNoz: Open-source observability with Pranay - OpenObservability Talks S3E03
SigNoz: Open-source observability with Pranay - OpenObservability Talks S3E03
In this episode, hosted by Jonah Kowall we are excited to introduce Pranay Prateek the co-founder and creator of SigNoz to the audience. He is an entrepreneur who has been building an interesting and exciting open-source observability platform. The goal is a unified interface and data storage system which is scalable and manageable much more easily than today’s disjointed open-source tools. SigNoz has a goal of unifying tracing, metrics, and logging in a single platform and interface. We’ll dig into the evolution of this unique project from its creation until today, along with the adoption of the technology and contribution from the community. We will discuss the data architecture and their move from Druid towards ClickHouse, which is always a fascinating topic. Finally, we’ll elaborate on the future of SigNoz and where things are going with the project. Join me, Jonah Kowall, as I speak with Pranay Prateek on this episode of OpenObservability Talks. The podcast episodes are available for listening on your favorite podcast app and on this YouTube channel. We live-stream the episodes, and you’re welcome to join the stream here on YouTube Live or at https://www.twitch.tv/openobservability​. Have you got an interesting topic you'd like to share in an episode? Reach out to us and submit your proposal at https://forms.gle/9LDkYCmegyS5D8Li7​
56:17
August 25, 2022
Expensive Observability: The Cardinality Challenge - OpenObservability Talks S3E02
Expensive Observability: The Cardinality Challenge - OpenObservability Talks S3E02
We all collect logs, metrics and perhaps traces and other data types, in support of our observability. But this can get expensive pretty quickly, especially in microservices based systems, in what is commonly known as “the cardinality problem”.   On this episode of OpenObservability Talks I’ll host Ben Sigelman, co-founder and the GM of Lightstep, to discuss this data problem and how to overcome it. Ben architected Google’s own planet-scale metrics and distributed tracing systems (still in production today), and went on to co-create the open-source OpenTracing and OpenTelemetry projects, both part of the CNCF. The episode was live-streamed on 12 July 2022 and the video is available at https://youtu.be/gJhzwP-mZ2k OpenObservability Talks episodes are released monthly, on the last Thursday of each month and are available for listening on your favorite podcast app and on YouTube. We live-stream the episodes on Twitch and YouTube Live - tune in to see us live, and pitch in with your comments and questions on the live chat. https://www.twitch.tv/openobservability https://www.youtube.com/channel/UCLKOtaBdQAJVRJqhJDuOlPg Have you got an interesting topic you'd like to share in an episode? Reach out to us and submit your proposal at https://openobservability.io/ Show Notes: The difference between monitoring, observability and APM What comprises the cost of observability How common is the knowledge of cardinality and how to add metrics Controlling cost with sampling, verbosity and retention Lessons from Google’s metrics and tracing systems Using metric rollups and aggregations intelligently Semantic conventions for logs, metrics and traces OpenCost project New research paper by Meta on schema-first approach to application telemetry metadata OTEL code contributions - published stats Resources: Monitoring vs. observability: https://twitter.com/el_bhs/status/1349406398388400128 The two drivers of cardinality: https://twitter.com/el_bhs/status/1360276734344450050 Sampling vs verbosity: https://twitter.com/el_bhs/status/1440750741384089608 Observing resources and transactions: https://twitter.com/el_bhs/status/1372636288021524482 Socials: Twitter: https://twitter.com/OpenObserv Twitch: https://www.twitch.tv/openobservability YouTube: https://www.youtube.com/channel/UCLKOtaBdQAJVRJqhJDuOlPg Website: https://openobservability.io/
01:00:03
July 28, 2022
OpenTelemetry and the Vision for Unified Open Observability - OpenObservability Talks S3E01
OpenTelemetry and the Vision for Unified Open Observability - OpenObservability Talks S3E01
OpenTelemetry is one of the most fascinating and ambitious open source projects of this era. It’s currently the second most active project in the CNCF (the Cloud Native Computing Foundation), with only Kubernetes being more active. The entire industry is aligning behind this project, including incumbent monitoring vendors that were deeply vested in proprietary and closed-source agents to that end. In this episode of OpenObservability Talks I’ll host Alolita Sharma to discuss OpenTelemetry, its origins and mission statement, as well as updates hot off the press from the recent KubeCon conference in Valencia about releases and future plans. Alolita is co-chair of the CNCF Technical Advisory Group for Observability, member of the OpenTelemetry Governance Committee and a board director of the Unicode Consortium. She has served on the boards of the OSI and SFLC.in. Alolita has led engineering teams at Wikipedia, Twitter, PayPal, IBM and AWS. Two decades of doing open source continue to inspire her. The episode was live-streamed on 15 June 2022 and the video is available at https://youtu.be/IK2TWOzDUBI  OpenObservability Talks episodes are released monthly, on the last Thursday of each month and are available for listening on your favorite podcast app and on YouTube. We live-stream the episodes on Twitch and YouTube Live - tune in to see us live, and pitch in with your comments and questions on the live chat. https://www.twitch.tv/openobservability https://www.youtube.com/channel/UCLKOtaBdQAJVRJqhJDuOlPg Have you got an interesting topic you'd like to share in an episode? Reach out to us and submit your proposal at https://openobservability.io/ Show Notes: Hot updates from KubeCon EMEA 2022 Alolita Sharma introduction The state of OpenTelemetry When OpenTelemetry Logging is expecting GA The onboarding challenge of instrumentation Client side instrumentation and real user monitoring Adding continuous profiling telemetry to OpenTelemetry Interoperability between OpenTelemetry and Prometheus Challenges in OpenTelemetry and observability Where OpenTelemetry is heading next Jaeger OSS now accept OTLP (OpenTelemetry protocol) Resources: OpenTelemetry Metrics reaches RC: https://opentelemetry.io/blog/2022/metrics-announcement/ OpenTelemetry guide: https://logz.io/learn/opentelemetry-guide/ CI/CD Observability: https://horovits.medium.com/fighting-slow-and-flaky-ci-cd-pipelines-starts-with-observability-19da2ac94677 Jaeger can now accept OpenTelemetry protocol https://medium.com/jaegertracing/introducing-native-support-for-opentelemetry-in-jaeger-eb661be8183c OTel Community Day summary: http://paulsbruce.io/blog/2022/06/opentelemetry-community-day-austin-2022 Contextual Logging in Kubernetes 1.24 https://kubernetes.io/blog/2022/05/25/contextual-logging/  PolarSignals announced FrostDB https://www.polarsignals.com/blog/posts/2022/05/04/introducing-arcticdb/  Socials: Twitter: https://twitter.com/OpenObserv Twitch: https://www.twitch.tv/openobservability YouTube: https://www.youtube.com/channel/UCLKOtaBdQAJVRJqhJDuOlPg Website: https://openobservability.io/
01:00:37
June 28, 2022
Observability for Developers Demystified - OpenObservability Talks E2E12
Observability for Developers Demystified - OpenObservability Talks E2E12
Developers hate monitoring, but we need it. We need it in many points of the software development lifecycle: before deprecating an API, before launching a new feature, after launching the feature, and more. In fact, monitoring needs can vary much more than the classic Ops monitoring. In this episode I’ll host Liran Haimovitch to discuss how to determine what developers should be monitoring, the difference between observability for Dev and for Ops, and how observability fits into our current dev tools, dev stack and dev processes. Liran is the Co-Founder and CTO of Rookout. He’s an Observability and Instrumentation expert with a deep understanding of Java, Python, Node, and C++. Liran has broad experience in cybersecurity and compliance from his past roles. When not coding, you can find Liran hosting his podcast, speaking at conferences, writing about his tech adventures, and trying out the local cuisine when traveling. The episode was live-streamed on 10 May 2022 and the video is available at https://youtu.be/OaHQp-qnVN0  OpenObservability Talks episodes are released monthly, on the last Thursday of each month and are available for listening on your favorite podcast app and on YouTube. We live-stream the episodes on Twitch and YouTube Live - tune in to see us live, and pitch in with your comments and questions on the live chat. https://www.twitch.tv/openobservability https://www.youtube.com/channel/UCLKOtaBdQAJVRJqhJDuOlPg Have you got an interesting topic you'd like to share in an episode? Reach out to us and submit your proposal at https://openobservability.io/ Show Notes: Which data do we need to collect for our observability How is observability for dev different from ops How does observability fit into dev tool stack Snapshots provide deep-dive telemetry signal Dynamic instrumentation Snapshots support in programming languages and runtimes Open source standardization around snapshots The cost associated with observability Google is applying to contribute Istio to the CNCF Shopify case study for observability team Resources: Istio applying to the CNCF: https://istio.io/latest/blog/2022/istio-has-applied-to-join-the-cncf/ Shopify case study for Observability team: https://ericmustin.substack.com/p/notes-on-an-observability-team?s=r Socials: Twitter: https://twitter.com/OpenObserv Twitch: https://www.twitch.tv/openobservability YouTube: https://www.youtube.com/channel/UCLKOtaBdQAJVRJqhJDuOlPg Website: https://openobservability.io/
58:32
May 26, 2022
OpenSearch 2.0 and beyond with Eli - OpenObservability Talks E2E11
OpenSearch 2.0 and beyond with Eli - OpenObservability Talks E2E11
OpenSearch is a community-driven, open-source search and analytics suite derived from Apache 2.0 licensed Elasticsearch 7.10.2 & Kibana 7.10.2. The OpenSearch project started just over a year ago and is now the open-source alternative to ELK, which is no longer open source. The team has spent much of the last year getting the project going, but there was innovation as well. We will cover and discuss what OpenSearch has accomplished, but more importantly what’s coming next, including a big 2.0 release. We are joined in this episode by Eli Fisher, who is the product lead at AWS, working on the OpenSearch project. He’ll dive into recent launches, including several observability features, and innovations planned for 2.0 and beyond.    The podcast episodes are available for listening on your favorite podcast app and on this YouTube channel.   We live-stream the episodes, and you’re welcome to join the stream here on YouTube Live or at https://www.twitch.tv/openobservability​.    Have you got an interesting topic you'd like to share in an episode? Reach out to us and submit your proposal at https://forms.gle/9LDkYCmegyS5D8Li7​
01:01:12
April 28, 2022
SLO Driven Engineering: from Dev to Prod - OpenObservability Talks S2E10
SLO Driven Engineering: from Dev to Prod - OpenObservability Talks S2E10
Google’s SRE Book popularized the concept of Service Level Objective (SLO) and the SLO-driven approach. But what does it really mean to make SLO driven decisions? How can we generate observability and synchronize teams around joint SLOs? And how can we automate SLOs and integrate them into the software release pipeline? In this episode I’ll host Andreas Grabner. We’ll discuss the SRE practices, and how to automate SLO from dev all the way to prod. We’ll talk about the open source efforts to standardize the process under the Continuous Delivery Foundation, and about Keptn, the new CNCF open source project that promises to help with this automation. Andreas Grabner (@grabnerandi) has 20+ years of experience as a software developer, tester and architect and is an advocate for high-performing cloud scale applications. He is a contributor and DevRel for the CNCF open source project keptn (www.keptn.sh). Andreas is also a regular contributor to the DevOps community, a frequent speaker at technology conferences and regularly publishes articles on blog.dynatrace.com or medium. In his spare time you can most likely find him on one of the salsa dancefloors of the world. The episode was live-streamed on 15 March 2022 and the video is available at https://youtu.be/J81byOpVqrk  OpenObservability Talks episodes are released monthly, on the last Thursday of each month and are available for listening on your favorite podcast app and on YouTube. We live-stream the episodes on Twitch and YouTube Live - tune in to see us live, and pitch in with your comments and questions on the live chat. https://www.twitch.tv/openobservability https://www.youtube.com/channel/UCLKOtaBdQAJVRJqhJDuOlPg Have you got an interesting topic you'd like to share in an episode? Reach out to us and submit your proposal at https://openobservability.io/ Show Notes: What’s SRE Where is SRE placed in the organization SRE vs. DevOps Good and bad SLOs How to define SLOs top-down Who owns SLO definition, monitoring, remediation Where is SRE within less mature organizations Keptn OSS project background Who uses and contributes to Keptn project What’s the CDF (Continuous Delivery Foundation) Creating a standard CD event format under the CDF (CDF Events SIG) Cloud Native Observability survey by the CNCF Resources: SLO in the age of microservices: Keptn OSS project: https://keptn.sh/ Keptn 0.14.0 major release TechWorld with Nana on Keptn CD Foundation - SIG Events: https://github.com/cdfoundation/sig-events PurePerformance podcast Cloud Native Observability survey by the CNCF Socials: Twitter: https://twitter.com/OpenObserv Twitch: https://www.twitch.tv/openobservability YouTube: https://www.youtube.com/channel/UCLKOtaBdQAJVR
59:56
March 30, 2022
Building web-scale observability at Slack, Pinterest & Twitter - OpenObservability Talks S2E09
Building web-scale observability at Slack, Pinterest & Twitter - OpenObservability Talks S2E09
What does it take to build observability in a web-scale company such as Slack, Pinterest and Twitter? On this episode of OpenObsevability Talks I'll host Suman Karumuri to hear how he built these systems from the ground up on these #BigTech co's, about his recent research papers and more. Suman Karumuri is a Sr. Staff Software Engineer and the tech lead for Observability at Slack. Suman Karumuri is an expert in distributed tracing and was a tech lead of Zipkin and a co-author of OpenTracing standard, a Linux Foundation project via the CNCF. Previously, Suman Karumuri has spent several years building and operating petabyte scale log search, distributed tracing and metrics systems at Pinterest, Twitter and Amazon.  In his spare time, he enjoys board games, hiking and playing with his kids. The episode was live-streamed on 16 February 2022 and the video is available at https://youtu.be/IvidkV3TfYg  OpenObservability Talks episodes are released monthly, on the last Thursday of each month and are available for listening on your favorite podcast app and on YouTube. We live-stream the episodes on Twitch and YouTube Live - tune in to see us live, and pitch in with your comments and questions on the live chat. https://www.twitch.tv/openobservability https://www.youtube.com/channel/UCLKOtaBdQAJVRJqhJDuOlPg Have you got an interesting topic you'd like to share in an episode? Reach out to us and submit your proposal at https://openobservability.io/ Show Notes: * Who owns observability in large organizations? * The gaps in current way of handling metrics  * MACH research paper for metrics storage engine * The gaps in current way of handling logs Slack KalDB * SlackTrace - Slack in house tracing system  Resources: Research paper: building Observability Data Management Systems CIDR paper: Video SlackTrace blog post, talk. Logging at Twitter Pintrace: A Distributed Tracing Pipeline talk by Suman at LISA Observability Engineering book Observability Trends for 2022 Yelp engineering with Elasticsearch and Lucene Socials: Twitter: https://twitter.com/OpenObserv Twitch: https://www.twitch.tv/openobservability YouTube: https://www.youtube.com/channel/UCLKOtaBdQAJVRJqhJDuOlPg Website: https://openobservability.io/
58:39
February 27, 2022
SaaS Observability Done Right - OpenObservability Talks S2E08
SaaS Observability Done Right - OpenObservability Talks S2E08
SaaS (software as a service) is a popular model for many businesses today. SaaS businesses need agility to move fast and remain competitive. This means agility in the software IT stack, but also agility in the business models and product-led growth (PLG). Observability plays a key role in enabling SaaS organizations to move fast. Achieving this agility, however, raises specific observability requirements. On this episode of OpenObservability Talks we’ll host Aviad Mizrachi, the CTO and Co-Founder of Frontegg, to help us map these requirements. Having escorted dozens of SaaS businesses across many verticals, Aviad brings a wealth of experience in how today’s SaaS is built and operated, and will share his insights and best practices on how to design and build the observability stack right. Aviad has been a developer for the last 20 years. He held a few management and architecture positions on startups such as Vicon and HTS as well as in larger companies such as NICE and CheckPoint. Today at Frontegg Aviad works closely with many customers to help them build their SaaS solutions. The episode was live-streamed on YouTube Live and Twitch on 11 Jan 2022 and the video is available at https://www.youtube.com/watch?v=ZcneTMeBPeg  OpenObservability Talks episodes are released monthly, on the last Thursday of each month and are available for listening on your favorite podcast app and on YouTube.  We live-stream the episodes on Twitch and YouTube Live - tune in to see us live, and pitch in with your comments and questions on the live chat. https://www.twitch.tv/openobservability https://www.youtube.com/channel/UCLKOtaBdQAJVRJqhJDuOlPg Have you got an interesting topic you'd like to share in an episode? Reach out to us and submit your proposal at https://openobservability.io/   Show Notes: What characteristics in today’s SaaS businesses dictate/influence the tech choices How are SaaS systems built? Tech stack and architecture Which observability is needed for SaaS? Kubernetes & infra observability Availability, responsiveness, low latency are critical in SaaS product and business observability Observability has many stakeholders Recommended tooling for SaaS Correlating different data signals Persistence and the cost of storage Final tips for SaaS observability AWS recent outages and learnings Log4j recent CVEs  Resources: AWS outages and learnings: https://horovits.medium.com/retrospect-on-the-aws-outage-and-resilient-cloud-based-architecture-cc513a32747 Socials: Twitter: https://twitter.com/OpenObserv Twitch: https://www.twitch.tv/openobservability YouTube: https://www.youtube.com/channel/UCLKOtaBdQAJVRJqhJDuOlPg Website: https://openobservability.io/
58:18
January 31, 2022
Prometheus Pitfalls and the Rise of Continuous Profiling - OpenObservability Talks S2E07
Prometheus Pitfalls and the Rise of Continuous Profiling - OpenObservability Talks S2E07
We’ve grown to rely on “the three pillars” for observability - logs, metrics and traces. Popular frameworks such as Prometheus have helped popularize these practices. But now people are starting to realize that it’s not enough. On this episode Dotan Horovits will host Frederic Branczyk for a discussion about the unspoken pitfalls of Prometheus and the challenges of current observability coverage. We will also discuss the rise of Continuous Profiling as a new observability signal, what it’s about and where it can help. We’ll also review the recent launch of Parca, an open source project for continuous profiling that traces its roots to Red Hat’s internal ConProf open source tool. Frederic is the founder and CEO of Polar Signals. Before founding Polar Signals he was a senior principal engineer and the main architect for all things Observability at Red Hat, which he joined through the CoreOS acquisition. Frederic is a Prometheus and Thanos maintainer as well as the tech lead for the special interest group for instrumentation in Kubernetes. In a previous life, he was a security researcher working on key management solutions as well as intrusion detection systems. When not working on software Frederic enjoys obsessing over brewing a perfect cup of coffee. The episode was live-streamed at https://www.twitch.tv/openobservability on 16 December 2021 and the video is available at https://www.youtube.com/watch?v=G02g63oI0IA  OpenObservability Talks episodes are released monthly, on the last Thursday of each month. The episodes are also live-streamed on Twitch and YouTube Live - tune in to see us live, and pitch in with your comments and questions on the live chat. Show Notes: The limitations of the three pillars model of observability Prometheus strengths and pitfalls how to start with continuous profiling how to correlate between different telemetry Parca OSS intro eBPF turned out perfect for instrumenting continuous profiling Parca OSS future plan how is the performance penalty of continuous profiling kept low what's the solution for high cardinality in Prometheus? will Parca OSS be contributed to an established OSS foundation? Prometheus Agent mode released OTEL operator now has an instrumentation CR continuous profiling support for interpreted languages Resources: https://www.parca.dev/ https://github.com/google/pprof https://increment.com/containers/observing-containers-pillars-of-observability/ https://ebpf.io/ https://research.google/pubs/pub36575/ Social: Twitter: https://twitter.com/OpenObserv Twitch: https://www.twitch.tv/openobservability Website: https://openobservability.io/
01:01:48
December 21, 2021
BPF origin story and the future of telemetry analytics OpenObservability Talks S2E06
BPF origin story and the future of telemetry analytics OpenObservability Talks S2E06
OpenObservability Talks S2E06: Hosting Steve McCanne   We hear a lot about BPF in the industry today, applying this flexible technology to solve so many problems from routing, proxying, and of course observability. Correlating events and data from the operating system level across distributed systems is a key problem for the industry and community to solve. I am thrilled to announce Steve McCanne joining us for this episode. I have been lucky enough to spend time with Steve in my career and am delighted to have him join us to discuss the origin stories and where these foundational technologies might be applied in the future. Steve’s Bio and background speak for themselves.   Steve McCanne is the "Coding CEO" at Brim, a small startup working on the open-source Zed Project and a new application called "Brim" that leverages Zed. Back in the days before the Web, Steve worked at the Lawrence Berkeley National Laboratory where he developed BPF, libpcap, the PCAP file format, and the tcpdump language and compiler, while also working on the Real-time Transport Protocol (RTP) for Internet video when the telcos claimed that real-time Internet communication was impossible without end-to-end virtual-circuit guarantees. (Guess who was right?) After a brief stint in academia in the late '90s, Steve crossed over to the dark side, became a tech entrepreneur, and never looked back. He has founded several startups and took his '02 company and Sharkfest's sponsor, Riverbed, public in '06.   Related Links from this Episode   The USENEX paper from 1993 which Steve and Van Jacobsen wrote together on the BPF architecture:  https://www.usenix.org/legacy/publications/library/proceedings/sd93/mccanne.pdf   Tools for Steve's new company which are open source and used in the podcast: https://github.com/brimdata/zed https://github.com/brimdata/brim   Here is the BPF repo with directions which Steve was using in the screen share: https://github.com/brimdata/zbpf   Have you got an interesting topic you'd like to share in an episode? Reach out to us and submit your proposal at https://forms.gle/9LDkYCmegyS5D8Li7​    Follow us on social media: Twitter: https://twitter.com/OpenObserv​​​ - Twitch: https://www.twitch.tv/openobservability​ - Website: https://openobservability.io/​​
01:02:12
November 23, 2021
SRE at Google: Planet-scale observability - OpenObservability Talks S2E05
SRE at Google: Planet-scale observability - OpenObservability Talks S2E05
Have you ever wondered how services are operated at Google’s scale? Here’s your opportunity to find out. Ramón will share how his SRE team runs Google’s identity services, and the elaborate end-to-end observability they use to achieve it with strict SLA. We’ll also get a glimpse at the birthplace of Kubernetes, OpenCensus, Dapper, Monarch and other cornerstones of today’s cloud-native DevOps and observability. Ramón Medrano Llamas (@rmedranollamas) is a staff site reliability engineer at Google, focused on user identity and authentication. He concentrates on the reliability aspects of new Google products and new features of existing products, ensuring that they meet the same high bar as every other Google service. Before joining Google in 2013, he worked at CERN developing and designing distributed systems for physics. He holds a master’s degree in computer science and is pursuing a PhD on distributed systems. We live-stream the episodes, and you’re welcome to join the stream at https://www.twitch.tv/openobservability or on YouTube Live https://www.youtube.com/channel/UCLKOtaBdQAJVRJqhJDuOlPg You can find all the information and propose a talk topic on openobservability.io Show Notes: scale and size of Google Identity services operation evolution from monitoring to observability telemetry collection SRE job description is changing Google Dapper Google Census operating end-to-end observability at scale flexibility vs. runbook in SRE how SRE at google different transition from monolith to MSA Linux Foundation launching a DevOps bootcamp Parca OSS launched how to intro SRE culture Resources: Dapper paper: Dapper, a Large-Scale Distributed Systems Tracing Infrastructure Borg paper: Large-scale cluster management at Google with Borg MonArch paper: Monarch: Google’s Planet-Scale In-Memory Time Series Database SRE books  Systemantics
59:59
October 27, 2021
Observability Into Your Business And FinOps - OpenObservability Talks S2E04
Observability Into Your Business And FinOps - OpenObservability Talks S2E04
Observability is becoming a common practice for DevOps teams monitoring and troubleshooting IT systems. But Observability can offer much more than that. More advanced usage of telemetry, and in particular distributed tracing and its context propagation mechanism, can uncover insights into your business performance and can help solve business and FinOps problems. On this episode of OpenObservability Talks we’ll host Yuri Shkuro, a champion of Distributed Tracing, to discuss how tracing and observability can help beyond DevOps, whether on business cases, FinOps or even software development. We’ll also catch up on the latest updates from Jaeger, the CNCF’s distributed tracing OSS project, its synergy with OpenTelemetry and more topics. Yuri is a software engineer who works on distributed tracing, observability, reliability, and performance problems; author of the book "Mastering Distributed Tracing"; creator of Jaeger, an open source distributed tracing platform and a graduated CNCF project; co-founder of the OpenTracing and OpenTelemetry CNCF projects; member of the W3C Distributed Tracing Working Group. Show Notes: why distributed tracing? tracing through async flows why is the slow adoption of tracing? instrumentation challenge for tracing adoption using context propagation for business use cases observability tooling maturity Jaeger project updates OpenTelemetry accepted to CNCF incubation Cortex and Thanos accepted to CNCF incubation Google contributing SQLCommenter project to OTel K8s v1.22 releases API Server tracing in alpha Resource: Great addition in Kubernetes v1.22 release: API Server Tracing, based on OpenTelemetry Tracing at Uber and the beginning of Jaeger project: Distributed Tracing at Uber-Scale episode OpenTelemetry becomes a CNCF incubating project Cortex accepted to CNCF incubation in August Thanos accepted to CNCF incubation in August Google Donates Sqlcommenter to OpenTelemetry Project From Distributed Tracing to APM: Taking OpenTelemetry and Jaeger Up a Level Mastering Distributed Tracing by Yuri Shkuro
01:06:47
September 19, 2021
Fluentd for logging and metrics and path forward - OpenObservability Talks S2E03
Fluentd for logging and metrics and path forward - OpenObservability Talks S2E03
In this episode, we’ll talk with industry veteran and product manager Anurag Gupta who has been working in open source observability for over 4 years. We will go into depth on his background, and how he views the ecosystem of open source. Then we will dig into the Fluentd and Fluent Bit projects and discuss some of the amazing innovations coming from this project. Learn what’s next for logging, and how a consolidated data collection plane is being driven by the Fluentd project.
46:22
August 26, 2021
Prometheus, OpenMetrics, and the CNCF Observability Ecosystem - OpenObservability Talks S2E02
Prometheus, OpenMetrics, and the CNCF Observability Ecosystem - OpenObservability Talks S2E02
The CNCF has a rich suite to address monitoring Kubernetes and cloud-native workloads. First of which is Prometheus, which is widely adopted, with great out-of-the-box compatibility with Kubernetes. But under the CNCF you can also find OpenMetrics that offers standardization of the metrics format, Thanos and Cortex which offer long-term storage for Prometheus, and other complimentary solutions and integrations.    On this episode of OpenObservability Talks we’ll host “RichiH” Hartmann and discuss the different OSS projects, the synergy between them, and the future roadmap in building the community and making CNCF a leading offering.   Richard "RichiH" Hartmann is Director of Community at Grafana Labs, Prometheus team member, OpenMetrics founder, CNCF SIG Observability chair, and other things. He also organizes various conferences, including FOSDEM, DENOG, DebConf, and Chaos Communication Congress. In the past, he made mainframe databases work, ISP backbones run, and built a datacenter from scratch. Show Notes: OpenTelemetry accepted to CNCF incubation OpenTelemetry structure OpenTelemetry community adoption OpenMetrics and Open* confusion OpenMetrics and OpenTelemetry synergy OpenMetrics updates CNCF’s Observability TAG (Technical Advisory Group) How to sync between projects on CNCF Prometheus state and roadmap Prometheus conformance program Thanos and Cortex projects how the tech stack benefits humans Grafana, Loki and Tempo projects Resources: OpenTelemetry.io OpenTelemetry status page Guide to OpenTelemetry CNCF TAG Observability Open* Explainer by RichiH OpenMetrics
01:09:58
July 25, 2021
Codeless Kubernetes Observability with eBPF - OpenObservability Talks S2E01
Codeless Kubernetes Observability with eBPF - OpenObservability Talks S2E01
Current observability practice is largely based on manual instrumentation, which creates a barrier to entry for many wishing to implement observability in their environment. This is especially true in Kubernetes environments and microservices architecture. eBPF (extended Berkeley Packet Filter) is an exciting new technology for Linux kernel level instrumentation, which bears the promise of no-code instrumentation and easier observability into Kubernetes environments (alongside other benefits for networking and security). On this episode of OpenObservability Talks we’ll host Natalie Serrino, Principal Engineer at Pixie Labs, which was recently acquired by New Relic. We’ll talk about observability in Kubernetes environments, eBPF and its use cases for observability. We’ll also talk about Pixie, the Kubernetes-native in-cluster observability platform, and the exciting news of it being open sourced and contributed these days to CNCF under Apache 2.0 license. Natalie is a Principal Engineer and Tech Lead at New Relic. She works on the Pixie auto-telemetry observability platform, which was acquired and open sourced by New Relic. She focuses primarily on Pixie’s data layer, including its query language, compiler, and query execution engine. Show Notes: challenges in k8s observability state of instrumentation automatic instrumentation eBPF overview eBPF vs. service mesh side cars Pixie project overview Pixie’s roadmap and integration plans with CNCF ecosystem Netflix engineering sharing use case of eBPF instrumenting with Istio opensearch RC1 released K8s unpredictable spend logs aren't enough, need tracing - recommended article Resources: http://www.brendangregg.com/ebpf.html https://blog.px.dev/ https://docs.px.dev/about-pixie/roadmap/ https://www.businesswire.com/news/home/20210504005480/en/New-Relic-Joins-Cloud-Native-Computing-Foundation-Governing-Board-and-is-in-the-Process-of-Contributing-Pixie-Open-Source-for-Kubernetes-Native-Observability https://netflixtechblog.com/how-netflix-uses-ebpf-flow-logs-at-scale-for-network-insight-e3ea997dca96 https://logz.io/blog/istio-instrumenting-microservices-distributed-tracing/ https://opensearch.org/blog/update/2021/06/opensearch-release-candidate-announcement/  https://thenewstack.io/tracing-why-logs-arent-enough-to-debug-your-microservices/  https://www.theregister.com/2021/06/29/kubernetes_spend_report/ Social: Twitter: https://twitter.com/OpenObserv  Twitch: https://www.twitch.tv/openobservability  Website: https://openobservability.io/
56:51
June 30, 2021
OpenSearch: The Open Source Successor of Elasticsearch? - OpenObservability Talks S1E12
OpenSearch: The Open Source Successor of Elasticsearch? - OpenObservability Talks S1E12
OpenSearch project was born out of the passion for Elasticsearch and Kibana and the desire to keep them open source in the face of Elastic’s decision to close-source them. After a couple of months of hard work led by AWS, the Beta release was announced earlier this month under Apache2 license. On this episode of OpenObservability Talks we hosted Kyle Davis, Senior Developer Advocate for OpenSearch at AWS. We talked about how OpenSearch came to be, what it took to fork Elasticsearch and Kibana, what the engineers discovered when they dug into the code, what’s planned ahead, and much more. About Kyle Davis: While being a relative newcomer to Amazon, Kyle has a long history with software development and databases. When not working, Kyle enjoys 3D printing, and getting his hand dirty in his Edmonton, Alberta-based home garden. Resources: https://github.com/opensearch-project Beta announcement Roadmap available Put the OPEN in Observability: Elasticsearch and Kibana relicensing and community chat - OpenObservability Talks S1E08
01:01:19
May 27, 2021
Diving deep into Jaeger and OpenTelemetry with Juraci Paixão Kröhling - OpenObservability Talks S1E11
Diving deep into Jaeger and OpenTelemetry with Juraci Paixão Kröhling - OpenObservability Talks S1E11
We are thrilled to have Juraci Kröhling a Software Engineer at Red Hat; CNCF, Maintainer for Jaeger, and OpenTelemetry. He will be live and in-person this month on the podcast in a discussion with Jonah Kowall who is the CTO at logz.io and contributor to Jaeger, OpenTelemetry, and OpenSearch.
56:29
April 30, 2021
Interoperability of open-source observability and new signal in the neighborhood, profiling! - OpenObservability Talks S1E10
Interoperability of open-source observability and new signal in the neighborhood, profiling! - OpenObservability Talks S1E10
Join Jonah Kowall and Bartek Plotka for a discussion on the latest happening topics on open source observability. Bartek works on many projects in open source and is Principal Software Engineer at Red Hat; CNCF SIG Observability Tech Lead. He is very active in the community as one of the leaders of Prometheus, Thanos, OpenMetrics, and many other projects.
58:47
March 29, 2021
How Much Observability Is Enough? - OpenObservability Talks S1E9
How Much Observability Is Enough? - OpenObservability Talks S1E9
The ninth of our OpenObservability Talks has Jujhar Singh, Global DevSecOps Practice Lead at The Economist. How much observability is enough? What is the investment required to achieve it? How can we drive observability in the company in a measured and pragmatic way?  This was first streamed at https://www.twitch.tv/openobservability on February 25th 2021 and the video is available at https://www.youtube.com/channel/UCLKOtaBdQAJVRJqhJDuOlPg The live recording of the OpenObservability talks is the last Thursday of each month, and you can join us on Twitch or YouTube Live. Show Notes: How to drive observability in your organization What is the minimum observability needed for your organization? The tech stack impact on observability needs Direct correlation between organization, product and observability How to assess your observability needs The investment involved in observability eBPF and tools for deep Linux inspection OpenSearch (Elasticsearch fork) status update OpenTelemetry’s Tracing specification reaches v1.0 Stanza contributed its logging agent to OpenTelemetry Docker was contributed to CNCF Resources: https://www.kiwico.com/ https://sre.google/sre-book/monitoring-distributed-systems/ https://devsecops.jujhar.com/observability-strategy/ https://devsecops.jujhar.com Elasticsearch/Kibana fork updates: https://discuss.opendistrocommunity.dev/c/forking-elasticsearch-kibana/50 OpenTelemetry v1.0 for Tracing: https://medium.com/opentelemetry/opentelemetry-specification-v1-0-0-tracing-edition-72dd08936978 Donating Docker Distribution to the CNCF: https://www.docker.com/blog/donating-docker-distribution-to-the-cncf/ Socials: Twitter: https://twitter.com/OpenObserv Twitch: https://www.twitch.tv/openobservability YouTube: https://www.youtube.com/channel/UCLKOtaBdQAJVRJqhJDuOlPg Website: https://openobservability.io/
59:49
February 28, 2021
Put the OPEN in Observability: Elasticsearch and Kibana relicensing and community chat - OpenObservability Talks S1E8
Put the OPEN in Observability: Elasticsearch and Kibana relicensing and community chat - OpenObservability Talks S1E8
The eighth of our OpenObservability Talks has Tomer Levy, CEO & Founder of Logz.io. The community is in turmoil around Elastic's announced plan to take Elasticsearch and Kibana off open source. In this episode, both Dotan and Mike have the pleasure of hosting Tomer where we discuss the recent news of Elastic moving Elasticsearch and Kibana to a dual non-OSS license - SSPL and Elastic License - and the implications that have on the open source community around it, including plans to fork Elasticsearch and Kibana, AWS announcement and more. We also talk about what Logz.io hopes to do, and how it wants the OSS to be better than ever. Tomer Levy is co-founder and CEO of Logz.io. Before founding Logz.io, Tomer was the co-founder and CTO of Intigua, and prior to that he managed the Intrusion Prevention System at CheckPoint. Tomer has an M.B.A. from Tel Aviv University and a B.S. in computer science and is an enthusiastic kitesurfer. The live streaming of the OpenObservability Talks is on the last Thursday of each month, and you can join us on Twitch or YouTube Live. Socials: Website: https://openobservability.io/   Twitter: https://twitter.com/OpenObserv   Twitch: https://www.twitch.tv/openobservability   YouTube: https://www.youtube.com/channel/UCLKOtaBdQAJVRJqhJDuOlPg
37:57
January 28, 2021
An Observability chat with Andy Thurai - OpenObservability Talks S1E7
An Observability chat with Andy Thurai - OpenObservability Talks S1E7
The seventh of our OpenObservability Talks has Andy Thurai, Senior Analyst at GigaOM. Jonah Kowall, CTO and Andy Thurai will talk about Andy's career and journey as a vendor and an analyst. We will discuss the observability market along with APM and other aspects of monitoring. We will then dive into the open-source ecosystem and how this is changing vendor thinking. Of course, we'll also be discussing OpenTelemetry! Andy Thurai is the Founder & Principal at the FieldCTO providing content and advisory services to enterprise customers in particular on AIOps, CloudOps, AI, ML, and Observability areas. He is an accomplished IT executive, strategist, advisor and evangelist with 25+ years of experience in executive, technical and architectural leadership positions at companies such as IBM, Intel, BMC, Nortel and Oracle; he advises many start-ups, and he is a Steering Committee Member for AIOps Exchange. He has been a keynote speaker in many major conferences, as well as a host of many webcasts, podcasts and video chats. He is a regular Forbes contributor and has written 100+ articles on emerging technology topics for publications such as Forbes, AI World, VentureBeat and Wired. Andy Thurai can be reached on Twitter at @AndyThurai, or on LinkedIn. This was first streamed at https://www.twitch.tv/openobservability on December 29th and the full video is available at https://youtu.be/Hr4lGqLiMa0 The live streaming of the OpenObservability Talks is on the last Thursday of each month, and you can join us on Twitch or YouTube Live. Socials: Website: https://openobservability.io/ Twitter: https://twitter.com/OpenObserv Twitch: https://www.twitch.tv/openobservability YouTube: https://www.youtube.com/channel/UCLKOtaBdQAJVRJqhJDuOlPg
44:21
December 31, 2020
All Metrics Are Wrong, Some Are Useful - OpenObservability Talks S1E6
All Metrics Are Wrong, Some Are Useful - OpenObservability Talks S1E6
The sixth of our OpenObservability Talks has Avishai Ish-Shalom, Developer Advocate at ScyllaDB. We trust our metrics to show us the status of our system and where it misbehaves. But do our metrics show us what really happened? You'd be surprised how often it's not the case.  On this episode we discussed the math behind metrics, some common misconceptions, what it take to have accurate metrics, and if there even is such a thing. Avishai Ish-Shalom has served as Engineer in Residence in Aleph VC, engineering manager at Wix.com, co-founded Fewbytes and consulted many other companies on software operations, reliability, design and culture. Currently Avishai is a Developer Advocate for ScyllaDB, the open source NoSQL database. This was first streamed at https://www.twitch.tv/openobservability on November 26th and the full video is available at https://www.youtube.com/watch?v=t9hpWv7fVSk  The live streaming of the OpenObservability Talks is on the last Thursday of each month, and you can join us on Twitch or YouTube Live. Socials: Website: https://openobservability.io/ Twitter: https://twitter.com/OpenObserv Twitch: https://www.twitch.tv/openobservability YouTube: https://www.youtube.com/channel/UCLKOtaBdQAJVRJqhJDuOlPg Links Shared in the recording: DevOps Pulse 2020 Engineer's guide to data analysis  Mature optimization handbook The art of monitoring SRE books
01:13:05
November 29, 2020
Microservices Observability for Developers and Pre-Production - OpenObservability Talks S1E5
Microservices Observability for Developers and Pre-Production - OpenObservability Talks S1E5
The fifth of our OpenObservability Talks has Michael Haberman, CTO & Co-Founder of Aspecto. There's a lot of discussion on how to achieve observability in microservices. Most of the discussion revolves around production workflows such as granular monitoring or debugging of complex environments. A much less discussed aspect is the development stage, before even reaching production. In this talk we will discuss the challenges of microservices environments, and how to leverage tracing for pre-production purposes such as improving your tests, understand dependencies and more. Michael Haberman is a veteran in the microservices space and brings significant experience as a software architect and a consultant. Michael  shared his experience on achieving observability in microservices environments in general, and during development stages in particular, with some useful tips on how to solve production issues, before they reach production. This was first streamed at https://www.twitch.tv/openobservability on October 29th and the full video is available at https://www.youtube.com/channel/UCLKOtaBdQAJVRJqhJDuOlPg The live recording of the OpenObservability talks is the last Thursday of each month, and you can join us on Twitch or YouTube Live. Socials: Twitter: https://twitter.com/OpenObserv Twitch: https://www.twitch.tv/openobservability YouTube: https://www.youtube.com/channel/UCLKOtaBdQAJVRJqhJDuOlPg Website: https://openobservability.io/ Links Shared in the recording: OpenTelemetry's Tracing Specification Reaches Release Candidate A new open source project: Promscale, An analytical platform and long-term store for Prometheus, with the combined power of SQL and PromQL AWS announcing Public Preview of AWS Distro for OpenTelemetry
58:12
October 31, 2020
Distributed Tracing at Uber-Scale - OpenObservability TalksS1E4
Distributed Tracing at Uber-Scale - OpenObservability TalksS1E4
The fourth of our OpenObservability Talks has Albert Teoh from the Jaeger project. Distributed tracing has been gaining momentum with the growing popularity of microservices. Jaeger is a popular open source tool originally developed at Uber and now part of the CNCF (Cloud Native Computing Foundation). Albert had been working at Uber for the past 2.5 years, where he got hands-on with Jaeger. On this talk Albert will share with us his experience with distributed tracing, from introducing it into new code all the way to production. Albert will explain important concepts and considerations, and will discuss common challenges and solutions in introducing distributed tracing in an existing large scale system. This was first streamed at https://www.twitch.tv/openobservability on September 24th and the full video is available at https://youtu.be/35aInRLbTQo The live recording of the OpenObservability talks is the last Thursday of each month, and you can join us on Twitch or YouTube Live.  Socials: Twitter: https://twitter.com/OpenObserv  Twitch: https://www.twitch.tv/openobservability  YouTube: https://www.youtube.com/channel/UCLKOtaBdQAJVRJqhJDuOlPg Website: https://openobservability.io/ Links Shared in the recording: why Jaeger was built by Uber Engineering: https://eng.uber.com/distributed-tracing/ Domain oriented microservice architecture: https://eng.uber.com/microservice-architecture/ Reference to resource usage attribution, from the book "Mastering Distributed Tracing" by Yuri Shkuro: https://subscription.packtpub.com/book/networking_and_servers/9781788628464/2/ch02lvl1sec28/resource-usage-attribution CNCF published the second quarterly CNCF End User Technology Radars; the topic for this Technology Radar is observability https://www.cncf.io/blog/2020/09/11/cncf-end-user-technology-radar-observability-september-2020/ Crtex & Thanos voted into CNCF incubation https://www.cncf.io/blog/2020/08/20/toc-welcomes-cortex-as-an-incubating-project/  https://www.cncf.io/blog/2020/08/19/toc-approves-thanos-from-sandbox-to-incubation/ Kubernetes v1.19 release offers structured logs for the system components https://logz.io/blog/a-practical-guide-to-kubernetes-logging/
58:50
September 30, 2020
Long-Term Metrics with M3 and Prometheus - OpenObservability Talks S1E3
Long-Term Metrics with M3 and Prometheus - OpenObservability Talks S1E3
Episode 3 of OpenObservability Talks, where we'll be hosting Matt Schallert of  Chronosphere. The topic for this episode is Long-Term Metrics with M3 and Prometheus. Prometheus is the de facto solution when getting started with metrics, supporting use cases where a single node can store the entire dataset. When it comes to scaling out workloads or storing long-term data, Prometheus offers the Remote Write and Read interfaces to integrate with other storage solutions.  In this talk, Matt will show attendees how they can scale their existing Prometheus metrics workloads with M3, an open-source metrics platform originated at Uber. Matt will discuss the background of Prometheus’s remote storage interfaces, and how they enable users to integrate with any number of systems they’re familiar with. He will demonstrate how quickly users can get an M3 stack up and running and integrate it with Prometheus. The audience will walk away with a better understanding of the challenges of effectively scaling their metrics workloads, and how they can overcome these challenges with open source tools such as M3. This was first streamed at https://www.twitch.tv/openobservability on August 27th and the full video is available at https://www.youtube.com/watch?v=XQ_yCLyAMLU Links Shared in the recording: M3 Virtual Meetup https://www.meetup.com/M3-Community/ M3 Slack Community https://bit.ly/m3slack If you have a talk or a subject you'd like to talk about send it here https://forms.gle/pTVwDMCP1fK32tAF9 Socials: Twitter: https://twitter.com/OpenObserv Twitch: https://www.twitch.tv/openobservability Website: https://openobservability.io/
53:39
August 27, 2020
A chat with Paul Bruce and Jonah Kowall about the OpenTelemetry community - OpenObservability Talks S1E2
A chat with Paul Bruce and Jonah Kowall about the OpenTelemetry community - OpenObservability Talks S1E2
Episode 2 of OpenObservability Talks, where we'll be hosting Paul Bruce of  Neotys and Jonah Kowall of Logz.io. Our topic of discussion today is OpenTelemetry and the thriving community around it. This was first streamed at https://www.twitch.tv/openobservability on July 30th and the full video is available at https://www.youtube.com/watch?v=OtxYgT-mopY If you have a talk or a subject you'd like to talk about send it here https://forms.gle/pTVwDMCP1fK32tAF9 Socials: Twitter: https://twitter.com/OpenObserv Twitch: https://www.twitch.tv/openobservability Website: https://openobservability.io/
01:11:29
July 30, 2020
Five Tips on Application Monitoring with Prometheus - OpenObservability Talks S1E1
Five Tips on Application Monitoring with Prometheus - OpenObservability Talks S1E1
Welcome to the first-ever OpenObservability Talks! This was first streamed at https://www.twitch.tv/openobservability on June 25th and the full video is available at https://www.youtube.com/watch?v=Otb57MNZfKA You'll get an introduction to the series, a talk by Jack Neely of 42 Lines on Five Tips on Application Monitoring with Prometheus, and a run-through of current events in the world of Open Source Observability and the things that affect it. If you'd like to submit a session for a later event then just drop it in the CFP here https://forms.gle/Le691dn7CU35GwKq7 Socials: Twitter: https://twitter.com/OpenObserv YouTube: https://www.youtube.com/channel/UCLKOtaBdQAJVRJqhJDuOlPg Twitch: https://www.twitch.tv/openobservability Website: https://openobservability.io/
39:33
June 29, 2020