Skip to main content
The Engineering Side of Data

The Engineering Side of Data

By Bob Haffner

Discussions around Data Engineering
Available on
Google Podcasts Logo
Pocket Casts Logo
RadioPublic Logo
Spotify Logo
Currently playing episode

Functional Data Engineering with Sven Balnojan

The Engineering Side of DataJul 25, 2022

00:00
28:41
2023 re:Invent Data Recap with Alex DeBrie

2023 re:Invent Data Recap with Alex DeBrie

Alex Debrie and Bob Haffner recap their favorite announcements from 2023 re:Invent

#data #dataengineering #aws

Connect with Alex Twitter: @alexbdebrie Blog: alexdebrie.com Book: dynamodbbook.com Podcast: youtube.com/@SoftwareHuddle

Connect with Bob Twitter - @bobhaffner LinkedIn - linkedin.com/in/bobhaffner

Alex’s talk https://www.youtube.com/watch?v=PVUofrFiS_A

Dec 07, 202339:29
Data Quality with JP Urrutia

Data Quality with JP Urrutia

Juan Pablo (JP) Urrutia and Bob Haffner discuss Data Quality #data #dataengineering #dataquality


Connect with JP

Twitter - @the_datachef Linkedin - https://www.linkedin.com/in/jpurrutia

Substack - https://substack.com/@thedatachef

Connect with Bob

Twitter - @bobhaffner

LinkedIn - https://www.linkedin.com/in/bobhaffner


Follow the show on Twitter @EngSideOfData


Data quality camp slack community:https://join.slack.com/t/dataqualitycamp/shared_invite/zt-2452i21wm-iZjqABfR7Hr8gcsshYuh2A


Data Quality Fundamentals Book

https://www.oreilly.com/library/view/data-quality-fundamentals/9781098112035/

Oct 02, 202351:47
2023 Snowflake Summit Recap with Roy Hasson
Jul 07, 202350:37
Starting a Career in Data Engineering with Arjun Bansil

Starting a Career in Data Engineering with Arjun Bansil

Arjun Bansil and Bob Haffner chat about Ajun's start in Data Engineering #data #dataengineering Connect with Arjun Linkedin - https://www.linkedin.com/in/arjun-bansil/ Connect with Bob Twitter - @bobhaffner LinkedIn - https://www.linkedin.com/in/bobhaffner


Follow the show on Twitter

@EngSideOfData

Mar 20, 202328:53
Developer Ergonomics with Joseph Machado

Developer Ergonomics with Joseph Machado

Joseph Machado and Bob Haffner discuss the many aspects of developer ergonomics and how they relate to data engineering.  


#data #dataengineering 


Connect with Joseph 

Blog - https://www.startdataengineering.com/

Linkedin - https://www.linkedin.com/in/josephmachado1991/ 


Connect with Bob

Twitter - @bobhaffner

LinkedIn - https://www.linkedin.com/in/bobhaffner

Jan 17, 202331:47
2022 re:Invent Data Recap with Alex DeBrie

2022 re:Invent Data Recap with Alex DeBrie

Alex Debrie and Bob Haffner recap their favorite announcements from 2022 re:Invent  


#data #dataengineering #aws   


Connect with Alex

Twitter: @alexbdebrie 

Blog: alexdebrie.com 

Book: dynamodbbook.com  


Connect with Bob 

Twitter - @bobhaffner 

LinkedIn - linkedin.com/in/bobhaffner   


Top Announcements 

https://aws.amazon.com/blogs/aws/top-announcements-of-aws-reinvent-2022/  


Yan Cui’s Choreography vs Orchestration blog post 

https://theburningmonk.com/2020/08/choreography-vs-orchestration-in-the-land-of-serverless/

Dec 07, 202245:54
Data Lakes, Data Lakehouses and Data Warehouses with Eric Tolotti

Data Lakes, Data Lakehouses and Data Warehouses with Eric Tolotti

Join Eric Tolotti and Bob Haffner for a discussion about Data Lakes, Data Lakehouses and Data Warehouses


#data #dataengineering #datalake #datalakehouse #datawarehouses


Check out Eric's YT channel

https://www.youtube.com/c/nullQueries


Connect with Bob

Twitter - @bobhaffner

LinkedIn - linkedin.com/in/bobhaffner


Show notes

nullQueries’ Intro to the Data LakeHouse

https://www.youtube.com/watch?v=yDXgvsnmUCs


nullQueries’ Do you Need a Data Warehouse

https://www.youtube.com/watch?v=K12lMYE3k0s


nullQueries’  Dealing With Bad Analytics Requirements

https://www.youtube.com/watch?v=8Td0A50cWzw

Nov 28, 202234:34
The Open Data Lakehouse with Alex Merced

The Open Data Lakehouse with Alex Merced

Join Alex Merced and Bob Haffner for a discussion about the Open Data Lakehouse concept  


#data #dataengineering #datalake #datalakehouse   


Connect with Alex 

Twitter - @amdatalakehouse  


Connect with Bob 

Twitter - @bobhaffner 

LinkedIn - linkedin.com/in/bobhaffner   


Show notes  


The DataNation Podcast  

Available on iTunes/Spotify/Stitcher  


The Subsurface Data Lakehouse Community 

dremio.com/subsurface  


Dremio 

dremio.com  


Follow the podcast on Twitter @EngSideOfData

Aug 17, 202232:02
Functional Data Engineering with Sven Balnojan

Functional Data Engineering with Sven Balnojan

Sven Balnojan and Bob Haffner discuss the goals and principles of Functional Data Engineering  

#data #dataengineering #functionaldataengineering  


Connect with Sven 

Twitter - @sbalnojan 

LinkedIn - linkedin.com/in/dr-sven-balnojan-a55b4072 

Blog - https://medium.com/three-data-point-thursday  


Connect with Bob 

Twitter - @bobhaffner 

LinkedIn - linkedin.com/in/bobhaffner  


Follow the podcast on Twitter @EngSideOfData  


Show Notes  

https://github.com/sbalnojan/easy-functional-data-engineering  

http://svenbalnojan.com/  

https://maximebeauchemin.medium.com/functional-data-engineering-a-modern-paradigm-for-batch-data-processing-2327ec32c42a  

Infrastructure as Code by Keif Morris 

https://infrastructure-as-code.com/book/

Jul 25, 202228:41
Data Catalogs with Michael Meyer

Data Catalogs with Michael Meyer

Michael Meyer and Bob Haffner chat about data catalogs  


#data #dataengineering #datacatalogs 


Connect with Mike 

Twitter - @dataguyatheart 

LinkedIn - linkedin.com/in/michael-meyer-6972286  


Connect with Bob 

Twitter - @bobhaffner 

LinkedIn - linkedin.com/in/bobhaffner  


Follow the podcast on Twitter @EngSideOfData

Jul 18, 202239:58
AWS Glue with Johnny Chivers

AWS Glue with Johnny Chivers

Johnny and Bob discuss Glue the serverless ETL tool from AWS  


#dataengineering #etl #aws   


Connect with Johnny  


Youtube Channel 

https://www.youtube.com/c/JohnnyChivers  


The QuestionBank - Completely free Bank of community Questions AWS Certifications developed on AWS Amplify 

https://www.thequestionbank.io/  


Personal Website for Contact, forum and free AWS learning resource 

https://johnnychivers.co.uk/  



Connect with Bob  


Twitter - @bobhaffner 

LinkedIn - linkedin.com/in/bobhaffner


https://twitter.com/EngSideOfData

Jun 22, 202258:26
Data Engineering at the Edge with Mário Pereira

Data Engineering at the Edge with Mário Pereira

Mário Pereira and Bob Haffner chat about Data Engineering at the Edge  


#data #dataengineering #iot #edge #edgecomputing   


Connect with Mário

LinkedIn - linkedin.com/in/xmariopereira/  


Connect with Bob Twitter - @bobhaffner 

LinkedIn - linkedin.com/in/bobhaffner  


Vopak is hiring (https://www.werkenbijvopak.nl/vacatures?lastchecked=discipline&discipline=9)

Apr 26, 202246:17
Careers and Community with Adi Polak

Careers and Community with Adi Polak

Adi Polak and Bob Haffner host a Twitter Spaces conversation on Careers and Community in Data Engineering.  


#data #dataengineering 


Connect with Adi 

Twitter - @AdiPolak  


Connect with Bob 

Twitter - @bobhaffner 

LinkedIn - linkedin.com/in/bobhaffner  


Join the Community - https://lakefs.io/community/

Mar 18, 202253:26
Data Vault Part 2 with Cindi Meyersohn

Data Vault Part 2 with Cindi Meyersohn

Cindi, Mike and Bob are back with the sequel to their 2021 Data Vault conversation.  


Check out the upcoming World Wide Data Vault Consortium! 

https://wwdvc.com/ 


Part 1 of this discussion

https://www.youtube.com/watch?v=4U9HcFS93tM


Connect with Cindi 

Twitter - @Data_Rebels  


Connect with Mike

Twitter - @dataguyatheart 

LinkedIn - linkedin.com/in/michael-meyer-6972286  


Connect with Bob 

Twitter - @bobhaffner 

LinkedIn - linkedin.com/in/bobhaffner  


Show notes  

Learn More: https://www.datavaultalliance.com - THE Standards Board for DV2 and Certification Education  

https://www.datarebels.com - Authorized Training Partner, CDVP2 Certified Instruction, and Consulting Services  

https://www.danLinstedt.com - Extensive Blogs by Daniel Linstedt, Data Vault Inventor and Found  

https://kentgraziano.com/ - Kent Graziano's Data Warrior Blogs  

https://twitter.com/dlinstedt 

Participation and Getting Involved: 

https://www.wwdvc.com - Annual Data Vault Conference 

https://www.meetup.com/data-vault-north-american-user-group - Data Vault North American User' Group 

https://www.linkedin.com/groups/9050802/ - Data Vault North American Users' Group LinkedIn Profile  

Certified DV2 Training and Education: 

https://datarebels.com/get-schooled/ 

https://datavaultalliance.com/certification/authorized-trainers/ 

https://datavaultalliance.com/event-directory/ 

https://www.arihovi.com/en/courses/data-vault-2-0-bootcamp-certification/ 


Other Resources: 

Building a Scalable Data Warehouse with Data Vault 2.0 Published by Elsevier Paperback ISBN: 9780128025109 eBook ISBN: 9780128026489 

Amazon:  https://www.amazon.com/Building-Scalable-Data-Warehouse-Vault/dp/0128025107/ref=sr_1_1?crid=335M6BS7BRLQA&dchild=1&keywords=building+a+scalable+data+warehouse+with+data+vault+2.0&qid=1633796921&sprefix=building+a+scala%2Caps%2C148&sr=8-1


#data. #dataengineering #datavault

Feb 28, 202201:04:12
Data Engineering for Data Discovery with Brian McMillan

Data Engineering for Data Discovery with Brian McMillan

Brian McMillan and Bob Haffner talk about Data Engineering for Data Discovery  

#data #dataengineering #elt #etl #analysticsengineering 

Connect with Brian 

Like to discuss this further? First 5 people to schedule a meeting with me and receive a free copy of the book (https://calendly.com/building-data-products).  

Purchase the book: https://www.minimumviablearchitecture.com   

Brian McMillan – Brian [at] minimumviablearchitecture.com 

Twitter – @brianmcmillan01 

LinkedIn – linkedin.com/in/brianmcmillan01 

GitHub – https://github.com/brianmcmillan/intro_to_data_and_analytics_engineering

Schedule a discussion – https://www.minimumviablearchitecture.com/contact.html 

The Book: Building Data Products: Introduction to Data and Analytics Engineering for non-programmers – https://www.minimumviablearchitecture.com  

Full Length Demo - https://www.youtube.com/watch?v=xLjQhz1KJoQ&t=4s 

Connect with Bob 

Twitter - @bobhaffner 

LinkedIn - linkedin.com/in/bobhaffner   


Links: 3X - Kent Beck – https://www.youtube.com/watch?v=FlJN6_4yI2A

Why Use Make (Mike Bostock) – https://bost.ocks.org/mike/make/

SQLite – https://www.sqlite.org/index.htm, https://blog.wesleyac.com/posts/consider-sqlite

Building Production Applications Using Go & SQLite – https://www.youtube.com/watch?v=XcAYkriuQ1o) 

CSVkit – https://csvkit.readthedocs.io/en/latest/

SQLite-Utils – https://sqlite-utils.datasette.io/en/stable/

Datasette – https://datasette.io

Vega Lite –https://vega.github.io/vega-lite/

Feb 01, 202201:11:02
ETL vs ELT with Dan Silberman

ETL vs ELT with Dan Silberman

Dan Silberman and Bob Haffner discuss the traditional approach of Extract, Transform and Load (ETL) vs Extract, Load and Transform (ELT).   What each one means, why has ELT become so popular and how ELT will evolve.

#data #dataengineering #elt #etl #analysticsengineering 

Check out Mozart Data 

https://www.mozartdata.com/ 


Connect with Bob

Twitter - @bobhaffner 

LinkedIn - linkedin.com/in/bobhaffner

Dec 21, 202128:51
2021 re:Invent Data Recap with Alex DeBrie

2021 re:Invent Data Recap with Alex DeBrie

Alex DeBrie joins Bob Haffner to recap the 2021 re:Invent from the data perspective  


#data #dataengineering 


Connect with Alex 

Twitter: @alexbdebrie 

Blog - alexdebrie.com 

Book - dynamodbbook.com   


Connect with Bob 

Twitter - @bobhaffner 

LinkedIn - linkedin.com/in/bobhaffner

Dec 08, 202135:41
The State of Streaming with Arjun Narayan

The State of Streaming with Arjun Narayan

Arjun Narayan and Bob Haffner discuss the state of streaming.  

Connect with Arjun

Twitter - @narayanarjun 

LinkedIn - linkedin.com/in/arjunravinarayan/  


Connect with Bob 

Twitter - @bobhaffner 

LinkedIn - linkedin.com/in/bobhaffner  


Show Notes 

Materialize website - materialize.com 

Materialize Twitter - @MaterializeInc 

Materialize LinkedIn - https://www.linkedin.com/company/materializeinc

Kafka is not a database blog post - https://materialize.com/kafka-is-not-a-database/

Streaming SQL blog post - https://materialize.com/streaming-sql-intro/ 


#data #dataengineering #streaming

Nov 10, 202127:33
Data Vault with Cindi Meyersohn

Data Vault with Cindi Meyersohn

Cindi Meyersohn joins Michael Meyer and Bob Haffner for a discussion about Data Vault.    


Connect with Cindi 

Twitter - @Data_Rebels  

Connect with Mike 

Twitter - @dataguyatheart 

LinkedIn - linkedin.com/in/michael-meyer-6972286  

Connect with Bob 

Twitter - @bobhaffner 

LinkedIn - linkedin.com/in/bobhaffner   

Show notes 

 Learn More: 

https://www.datavaultalliance.com - THE Standards Board for DV2 and Certification Education  

https://www.datarebels.com - Authorized Training Partner, CDVP2 Certified Instruction, and Consulting Services  

https://www.danLinstedt.com - Extensive Blogs by Daniel Linstedt, Data Vault Inventor and Found  

https://kentgraziano.com/ - Kent Graziano's Data Warrior Blogs  

https://twitter.com/dlinstedt  

Participation and Getting Involved: 

https://www.wwdvc.com - Annual Data Vault Conference 

https://www.meetup.com/data-vault-north-american-user-group - Data Vault North American User' Group 

https://www.linkedin.com/groups/9050802/ - Data Vault North American Users' Group LinkedIn Profile  


Certified DV2 Training and Education: 

https://datarebels.com/get-schooled/

https://datavaultalliance.com/certification/authorized-trainers/ 

https://datavaultalliance.com/event-directory/ 

https://www.arihovi.com/en/courses/data-vault-2-0-bootcamp-certification/ 


Other Resources: 

Building a Scalable Data Warehouse with Data Vault 2.0 Published by Elsevier Paperback ISBN: 9780128025109 eBook ISBN: 9780128026489 

Amazon: 

 https://www.amazon.com/Building-Scalable-Data-Warehouse-Vault/dp/0128025107/ref=sr_1_1?crid=335M6BS7BRLQA&dchild=1&keywords=building+a+scalable+data+warehouse+with+data+vault+2.0&qid=1633796921&sprefix=building+a+scala%2Caps%2C148&sr=8-1

Oct 11, 202101:11:53
Geographic Information Systems(GIS) with Caleb Welchans

Geographic Information Systems(GIS) with Caleb Welchans

Caleb Welchans and Bob Haffner discuss GIS.  GIS is a topic that everyone in data should be familiar with.    

Connect with Caleb 

Twitter - @calebwelchans 

LinkedIn - linkedin.com/in/welchans       

Intro/Outro music by John Yasut  

#data #dataengineering #gis 

Connect with Bob 

Twitter - @bobhaffner 

LinkedIn - linkedin.com/in/bobhaffner  

Please Like, Subscribe and Comment!

Sep 26, 202129:36
Building Data Lakes in AWS with Johnny Chivers

Building Data Lakes in AWS with Johnny Chivers

Johnny Chivers and Bob Haffner discuss Data Lakes on AWS.


Connect with Johnny!  

Youtube Channel 

The QuestionBank - Completely free Bank of community Questions AWS Certifications developed on AWS Amplify   

Personal Website - Contact, forum and free AWS learning resource   


Please Like, Subscribe and Comment!

Sep 02, 202157:02
Data Pipeline Testing with Bartosz Mikulski

Data Pipeline Testing with Bartosz Mikulski

I'm joined by Bartosz Mikulski for this discussion involving Data Pipeline Testing.


Check out the video version on YT 

https://www.youtube.com/watch?v=RebyV0lJ_Aw


Connect with Bartosz

LinkedIn | Twitter | Blog


Intro/Outro music by John Yasut

Aug 04, 202126:08
Analytics Engineering with Kelly Burdine

Analytics Engineering with Kelly Burdine

I'm joined by Kelly Burdine for this discussion involving Analytics Engineering.


Kelly Burdine

https://www.linkedin.com/in/kellyburdine/


Intro/Outro music by John Yasut

Jul 15, 202139:12
The Rise of the Cloud Data Warehouse with Michael Meyer
Jun 23, 202131:09
Intro to The Engineering Side of Data

Intro to The Engineering Side of Data

A podcast about Data Engineering and the edges of this critical and ever-changing domain


Intro/Outro by John Yasut

May 09, 202101:02