Skip to main content
DataTalks.Club

DataTalks.Club

By DataTalks.Club
DataTalks.Club - the place to talk about data!
Listen on
Where to listen
Apple Podcasts Logo

Apple Podcasts

Breaker Logo

Breaker

Google Podcasts Logo

Google Podcasts

Pocket Casts Logo

Pocket Casts

RadioPublic Logo

RadioPublic

Spotify Logo

Spotify

Advancing Big Data Analytics: Post-Doctoral Research - Eleni Tzirita Zacharatou
We talked about: Eleni’s background Spatial data analytics Responsibilities of a postdoc Publishing papers Best places for data management papers Differences between postdoc and PhD Helping students become successful Research at the DIMA group Identifying important research directions Reviewing papers Underrated topics in data management Research in data cleaning Collaborating with others Choosing the field for Master’s students Choosing the topic for a Master thesis Should I do a PhD? Promoting computer science to female students Links: https://www.user.tu-berlin.de/tzirita/ Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html
01:00:45
December 3, 2021
Becoming a Data Product Manager - Sara Menefee
We talked about: Sara’s background Product designer’s responsibilities Data product manager’s responsibilities Planning with the team Design thinking and product design Data PMs vs regular PMs Skill requirements for Data PMs Going from a product designer to a data product manager Case studies Resources for learning about product management Data PM’s biggest challenge Multitasking and context switching Insights from user interviews Using new, unfamiliar tools Documentation Idea generation Do Data PMs need to know ML? Links: Product Management Courses: https://www.lennyrachitsky.com/course and https://www.reforge.com/mastering-product-management Product Management Reading: https://svpg.com/inspired-how-to-create-products-customers-love/ and https://steveblank.com/category/customer-development/ Data Engineering for Noobs: https://www.datacamp.com/ Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html
59:02
November 26, 2021
Data Science Manager vs Data Science Expert - Barbara Sobkowiak
We talked about: Barbara’s background Do you need a manager or an expert? Technical and non-technical requirements for managers Importance of technical skills for managers Responsibilities and skills of a manager Importance of technical background for managers Getting involved in business development and sales Developing the team Checking team’s work Data science expert Hiring experts Who should we hire first? Can an expert build a team? Data science managers in startups Project management Ensuring that projects provide value Questions before starting a project Women in data science Finding Barbara online General advice Link: Barbara's LinkedIn: https://www.linkedin.com/in/barbara-sobkowiak-1a4a9568 Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html
59:39
November 19, 2021
Ace Non-Technical Data Science Interviews - Nick Singh
We talked about: Nick’s background Being a career coach Overview of the hiring process Behavioral interviews for data scientists Preparing for behavioral interviews Handling "tricky" questions Project deep dive Business context Pacing, rambling, and honesty “What’s your favorite model?” What if I haven’t worked on a project that brought $1 mln? Different questions for different levels Product-sense interviews Identifying key metrics in unfamiliar domains Tech blogs Cold emailing Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html
01:01:45
November 12, 2021
Becoming a Solopreneur in Data - Noah Gift
We talked about: Noah’s background Solopreneurship A day of a solopreneur Exponential vs linear work Escaping the office work - digging the tunnel Structuring goals Staying motivated Publishing books Planning out books Writing a book is like preparing to run a marathon Distributed income Getting started as a solopreneur Lowering expenses and adding time The right time to quit full-time Building a network Teaching at universities Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html
59:19
November 5, 2021
Building Business Acumen for Data Professionals - Thom Ives
Links: https://join.slack.com/t/integratedmlai/shared_invite/zt-r3hpj44k-gfhf1pzIt3jixrATyXCWnQ https://www.linkedin.com/in/thomives/ Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html
01:05:30
October 29, 2021
Conquering the Last Mile in Data - Caitlin Moorman
We talked about: Caitlin’s background The last mile in data The Pareto Principle Failing to use data Making sure data is used Communicating with decision-makers Working backwards from the last mile Understanding how data drives decisions Sketching and prototyping Showing the benefits of power data Measurability Driving change in data Asking high-leverage questions Resistance from users Understanding domain experts Linear projects vs circular projects Recommendations for data analyst students Finding Caitlin online Links: Emelie's talk https://locallyoptimistic.com/post/linear-and-circular-projects-part-1/ https://locallyoptimistic.com/post/linear-and-circular-projects-part-2/ Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html
01:02:02
October 22, 2021
Similarities and Differences between ML and Analytics - Rishabh Bhargava
We talked about: Rishabh's background Rishabh’s experience  as a sales engineer Prescriptive analytics vs predictive analytics The problem with the term ‘data science’ Is machine learning a part of analytics? Day-to-day of people that work with ML Rule-based systems to machine learning The role of analysts in rule-based systems and in data teams Do data analysts know data better than data scientists? Data analysts’ documentation and recommendations Iterative work - data scientists/ML vs data analysts Analyzing results of experiments Overlaps between machine learning and analytics Using tools to bridge the gap between ML and analytics Do companies overinvest in ML and underinvest in analystics? Do companies hire data scientists while forgetting to hire data analysts? The difficulty of finding senior data analysts Is data science sexier than data analytics? Should ML and data analytics teams work together or independently? Building data teams Rishabh’s newsletter – MLOpsRoundup Links: https://mlopsroundup.substack.com/ https://twitter.com/rish_bhargava Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html
59:39
October 15, 2021
Building and Leading Data Teams - Tammy Liang
We talked about: Tammy’s background Being the chief of data First projects as the first data person in a company Initial resistance Expanding the team Role of business analyst Platanomelon’s stack Order for growing the data team Demand forecasting Should analysts know machine learning Qualifications for the first data person in a company Providing accurate results Receiving insights in a timely manner Providing useful insights Giving ownership to the team Starting as the first data person in a company Data For Future podcast Supporting team members that are stuck Finding Tammy online Links:  Tammy's podcast: https://dataforfuture.org/ Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html
59:10
October 8, 2021
What Researchers and Engineers Can Learn from Each Other - Mihail Eric
We talked about: Mihail’s background NLP and self-driving vehicles Transitioning from academia to the industry Machine learning researchers Finding open-ended problems Machine learning engineers Is data science more engineering or research? What can engineers and researchers learn from one another? Bridging the disconnect between researchers and engineers Breaking down silos Fluid roles Full-stack data scientists Advice to machine learning researchers Advice to machine learning engineers Reading papers Choosing between engineering or research if you’re just starting Confetti.ai Links: https://twitter.com/mihail_eric http://confetti.ai/ Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html
01:01:44
October 1, 2021
Introducing Data Science in Startups - Marianna Diachuk
We talked about: Marianna’s background Being the only data scientist What should already be in the company How much experience do you need Identifying problems Prioritization What should the company already know? First week First month First quarter Managing expectations Solving problems without ML Project timelines Finding the best solution Evaluating performance Getting stuck Communicating with analysts Transitioning from engineering to data science Growing the team Stopping projects Questions for the company From research to production Wrapping up Links: Marianna's LinkedIn: https://www.linkedin.com/in/marianna-diachuk-53ba60116/ Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html
58:33
September 24, 2021
Defining Success: Metrics and KPIs - Adam Sroka
We talked about: Adam’s background Adam’s laser and data experience Metrics and why do we care about them Examples of metrics KPIs KPI examples Derived KPIs Creating metrics — grocery store example Metric efficiency North Star metrics Threshold metrics Health metrics Data team metrics Experiments: treatment and control groups Accelerate metrics and timeboxing Links: Domino's article about measuring value: http://blog.dominodatalab.com/measuring-data-science-business-value Adam's article about skills useful for data scientists: https://towardsdatascience.com/how-to-apply-your-hard-earned-data-science-skillset-812585e3cc06 Adam's article about standing out: https://towardsdatascience.com/how-to-stand-out-as-a-great-data-scientist-in-2021-3b7a732114a9 Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html
01:02:51
September 17, 2021
Making Sense of Data Engineering Acronyms and Buzzwords - Natalie Kwong
We talked about: Natalie’s background Airbyte What is ETL? Why ELT instead of ETL? Transformations How does ELT help analysts be more independent? Data marts and Data warehouses Ingestion DB ETL vs ELT Data lakes Data swamps Data governance Ingestion layer vs Data lake Do you need both a Data warehouse and a Data lake? Airbyte and ELT Modern data stack Reverse ETL Is drag-and-drop killing data engineering jobs? Who is responsible for managing unused data? CDC – Change Data Capture Slowly changing dimension Are there cases where ETL is preferable over ELT? Why is Airbyte open source? The case of Elasticsearch and AWS Links: Natalie's LinkedIn: https://www.linkedin.com/in/nataliekwong/ https://airbyte.io/blog/why-the-future-of-etl-is-not-elt-but-el Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html
01:00:21
September 11, 2021
Mastering Algorithms and Data Structures - Marcello La Rocca
We talked about: Learning algorithms and data structures Resources for learning algorithms and data structures Most important data structures Learning the abstractions Learning algorithms if they aren’t needed at work Common mistakes when using wrong data structures Importance of data structures for data scientists Marcello’s book - Advanced Algorithms and Data Structures Bloom filters Where Bloom filters are useful Approximate nearest neighbours Searching for most similar vectors Knowing frameworks vs knowing internals of data structures Serializing Bloom filters Algorithmic problems in job interviews Important data structures for data scientists and data engineers Learning by doing Importance of compiled languages for data scientists Links: Marcello's book: Advanced Algorithms and Data Structures http://mng.bz/eP79 (promo code for 35% discount: poddatatalks21) MIT, Introduction to Algorithms: https://ocw.mit.edu/courses/electrical-engineering-and-computer-science/6-006-introduction-to-algorithms-fall-2011/ Algorithms specialization by Tim Roughgarden: https://www.coursera.org/specializations/algorithms Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html
01:02:11
September 3, 2021
Chief Data Officer - Marco De Sa
We talked about: Marco’s background Role of CDO Keeping track of many things Becoming a CDO Strategy vs tactics VP of Data vs CDO How many VPs of Data could be there? Splitting the work between VP and CDO Difference between CTO, CPO, and CDO Breaking down the goals and working backwards from them Assessing if we’re moving in the right direction Dealing with many meetings Being more effective Building the data-driven culture Challenges of working remotely Does CDO need deep technical skills? Importance of MBA The key skills for becoming a CDO Biggest challenges within OLX so far Demonstrating the CDO skills on a job interview Overcoming resistance Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html
01:01:55
August 27, 2021
Freelancing in Machine Learning - Mikio Braun
We talked about: Mikio’s background What Mikio helps with Moving from a full-time job to freelancing Finding clients and importance of a strong network Building a network Initial meetings with clients Understanding what clients need Template for the offer (Million dollar consulting) Deciding on rate type: hourly, daily, per project Taking vacations (and paying twice for them) Avoiding overworking Specializing: consulting as a product Working full-time as a principal vs being a consultant Is the overhead worth it? Getting a new client when you already have a project After freelancing: what’s next? Output of Mikio’s work Learning new things Lessons learned after finding clients Registering as a freelancer in Germany Personal liability of a freelancer Effect of globalization and remote work on consulting Advice for people who want to start freelancing Woking full-time and freelancing at the same time Books:  Million Dollar Consulting  by Alan Weiss Built to Sell by John Warrillow Links: Mikio's Twitter: https://twitter.com/mikiobraun Mikio's LinkedIn: https://www.linkedin.com/in/mikiobraun/ Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html
01:02:05
August 20, 2021
Launching a Startup: From Idea to First Hire - Carmine Paolino
We talked about: Carmine’s background Carmine’s startup FreshFlow Doing user research Design thinking Entrepreneur first Finding co-founders: the “expertise edges” framework The structure of the EF program Coming up with the idea How important is going through a startup accelerator? Finding your first client Finding investors Consequences of having a bad investor Splitting responsibilities between co-founders Hiring The importance of delegating Making work attractive to hires Plans for the future Just-in-time supply chain What would you have done differently? Advice for people starting a startup Don’t focus on skills only Getting motivation Am I ready for a startup? Importance of a business school Advice on finding a co-founder Do I need EF if I already have an idea? Having a prototype before the pitch Books: The Mom Test by Rob Fitzpatrick Design Thinking by Robert Curedale Links: FreshFlow: https://freshflow.ai/ Carmine's LinkedIn: https://www.linkedin.com/in/carminepaolino Carmine's Twitter: https://twitter.com/paolino Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html
01:07:28
August 13, 2021
Approach Learning as ML Project - Vladimir Finkelshtein [mini]
We don't have an episode lined up for this week, but we recorded a small chat with Vladimir some time ago. Enjoy it!  We talked about: Vladimir's background Learning by answering questions Don't be afraid of being wrong Winnings books Learning random things Approach learning as a machine learning project Links: Vladimir on LinkedIn: https://www.linkedin.com/in/vladimir-finkelshtein/ Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html
13:56
August 6, 2021
Humans in the Loop - Lina Weichbrodt
We talked about: Lina’s background What we need to remember when starting a project (checklists) Make sure the problem is formalized and close to the core business Get the buy-in with stakeholders Building trust with stakeholders Don’t just focus on upsides – ask about concerns Turning a concert into a metric What happens when something goes wrong? Post mortem reporting Apply the 5 why’s If a lot of users say it’s a bug – it’s worth investigating Post mortem format Action points Debugging vs explaining the model Are there online versions of checklists? Make sure to log your inputs Talking to end-users and using your own service Your ideas vs Stakeholder ideas Should data practitioners educate the team about data? People skills and ‘dirty’ hacks Where to find Lina Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html
57:55
July 30, 2021
Running from Complexity - Ben Wilson
We talked about: Ben’s Background Building solutions for customers Why projects don’t make it to production Why do people choose overcomplicated solutions? The dangers of isolating data science from the business unit The importance of being able to explain things Maximizing chances of making into production The IKEA effect Risks of implementing novel algorithms If it can be done simply – do that first Don’t become the guinea pig for someone’s white paper The importance of stat skills and coding skills Structuring an agile team for ML work Timeboxing research Mentoring Ben’s book ‘Uncool techniques’ at AI-First companies Should managers learn data science? Do data scientists need to specialize to be successful? Links: Ben's book: https://www.manning.com/books/machine-learning-engineering-in-action (get 35% off with code "ctwsummer21") Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html
01:11:43
July 23, 2021
I Want to Build a Machine Learning Startup! - Elena Samuylova
We talked about: Elena’s background Why do a startup instead of being an employee? Where to get ideas for your startup Finding a co-founder What should you consider before starting a startup? Vertical startup vs infrastructure startup ‘AI First’ startups Building tools for engineers What skills do you need to start a startup? Startup risks How to be prepared to fail Work-life balance The part-time startup approach Startup investment models No resources and no technical expertise – what to do? Productionizing your services When to hire an expert Talking to people with a problem before solving the problem Starting Elena’s startup, Evidently Elena’s role at Evidently Why is Evidently open source? “People will just copy my open source code. Should I be concerned?” Bottom-up adoption Creating value so that clients engage with your product Is there a difference between countries when creating a startup? Does open source mean the data is safer? When should you hire engineers? Following the market Startups out of genuine interest vs Just for money and for fun Links: EvidentlyAI: https://evidentlyai.com/ Elena's LinkedIn: https://www.linkedin.com/in/elenasamuylova/ Elena's Twitter: https://twitter.com/elenasamuylova/ Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html
58:26
July 16, 2021
Big Data Engineer vs Data Scientist - Roksolana Diachuk
Links: Twitter: https://twitter.com/dead_flowers22 LinkedIn: https://www.linkedin.com/in/roksolanadiachuk/ Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html
01:01:30
July 9, 2021
Build Your Own Data Pipeline - Andreas Kretz
We talked about: Andreas’s background Why data engineering is becoming more popular Who to hire first – a data engineer or a data scientist? How can I, as a data scientist, learn to build pipelines? Don’t use too many tools What is a data pipeline and why do we need it? What is ingestion? Can just one person build a data pipeline? Approaches to building data pipelines for data scientists Processing frameworks Common setup for data pipelines — car price prediction Productionizing the model with the help of a data pipeline Scheduling Orchestration Start simple Learning DevOps to implement data pipelines How to choose the right tool Are Hadoop, Docker, Cloud necessary for a first job/internship? Is Hadoop still relevant or necessary? Data engineering academy How to pick up Cloud skills Avoid huge datasets when learning Convincing your employer to do data science How to find Andreas Links: LinkedIn: https://www.linkedin.com/in/andreas-kretz Data engieering cookbook: https://cookbook.learndataengineering.com/ Course: https://learndataengineering.com/ Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html
01:01:53
July 2, 2021
From Software Engineering to Machine Learning - Santiago Valdarrama
We talked about: Santiago’s background “Transitioning to ML” vs “Adding ML as a skill” Getting over the fear of math for software developers Learning by explaining Seven lessons I learned about starting a career in machine learning Lesson 1 – Take the first step Lesson 2 – Learning is a marathon, not a sprint Lesson 3 – If you want to go quickly, go alone. If you want to go far, go together. Lesson 4 – Do something with the knowledge you gain Lesson 5 – ML is not just math. Math is not scary. Lesson 6 – Your ability to analyze a problem is the most important skill. Coding is secondary. Lesson 7 – You don’t need to know every detail Tools and frameworks needed to transition to machine learning Problem-based learning vs Top-down learning Learning resources Santiago’s favorite books Santiago’s course on transitioning to machine learning Improving coding skills Building solutions without machine learning Becoming a better engineer What is the difference between machine learning and data science? Getting into machine learning - Reiteration Getting past the math Links: Santiago's Twitter: https://twitter.com/svpino Santiago's course: https://gumroad.com/svpino#kBjbC Pinned tweet with a roadmap: https://twitter.com/svpino/status/1400798154732212230 Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html
59:43
June 25, 2021
Analytics Engineer: New Role in a Data Team - Victoria Perez Mola
Links: https://www.notion.so/Analytics-Engineer-New-Role-in-a-Data-Team-9decbf33825c4580967cf3173eb77177 https://www.linkedin.com/in/victoriaperezmola/ Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html Conference: https://datatalks.club/conferences/2021-summer-marathon.html
59:55
June 18, 2021
Data Governance - Jessi Ashdown, Uri Gilad
We talked about: Jessi’s background Uri’s background Data governance Implementing data governance: policies and processes Reasons not to have data governance Start with “why” Cataloging and classifying our data Let data work for you The human component Data quality Defining policies Implementing policies Shopping-card experience for requesting data Proving the value of data catalog Using data catalog Data governance = data catalog? Links: Book: https://www.oreilly.com/library/view/data-governance-the/9781492063483/ Jessi’s LinkedIn: https://www.linkedin.com/in/jashdown/ Uri’s LinkedIn: https://linkedin.com/in/ugilad Uri’s Twitter: https://twitter.com/ugilad Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html Conference: https://datatalks.club/conferences/2021-summer-marathon.html
57:59
June 11, 2021
What Data Scientists Don’t Mention in Their LinkedIn Profiles - Yury Kashnitsky
We talked about: Yury’s background Failing fast: Grammarly for science Not failing fast: Keyword recommender Four steps to epiphany Lesson learned when bringing XGBoost into production When data scientists try to be engineers Joining a fintech startup: Doing NLP with thousands of GPUs Working at a Telco company Having too much freedom The importance of digital presence Work-life balance Quantifying impact of failing projects on our CVs Business trips to Perm: don’t work on the weekend What doesn’t kill you makes you stronger Links: Yury's course: https://mlcourse.ai/ Yury's Twitter: https://twitter.com/ykashnitsky Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html
59:56
June 4, 2021
Becoming a Data-led Professional - Arpit Choudhury
We talked about: Data-led academy Arpit’s background Growth marketing Being data-led Data-led vs data-driven Documenting your data: creating a tracking plan Understanding your data Tools for creating a tracking plan Data flow stages Tracking events — examples Collecting the data Storing and analyzing the data Data activation Tools for data collection Data warehouses Reverse ETL tools Customer data platforms Modern data stack for growth Buy vs build People we need to in the data flow Data democratization Motivating people to document data Product-led vs data-led Links: https://dataled.academy/ Join our Slack: https://datatalks.club/slack.html
01:00:20
May 28, 2021
How to Market Yourself (without Being a Celebrity) - Shawn Swyx Wang
We talked about: Shawn’s background and his book Marketing ourselves Components of personal marketing Personal brand for an average developer Picking a domain: what to write about? Being too niche Finding a good niche Learning in public Borrowed platforms vs own platform Starting on social media: Picking what they put down Career transitioning: mutual exchange of value Personal marketing for getting a new job Getting hired through the back door Finding content ideas Marketing yourself in public — summary Open-source knowledge Internal marketing: promoting ourselves at work Signature initiative Public speaking Wrapping up Discount for the coding career book 75% of the engineering ladder criteria are not technical Links: Shawn's personal page: https://www.swyx.io/ Twitter: https://twitter.com/swyx Book of the week page: https://datatalks.club/books/20210510-the-coding-career-handbook.html (with a discount for DTC members!) Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html
01:02:57
May 21, 2021
From Physics to Machine Learning - Tatiana Gabruseva
We talked about: Tatiana’s background 12 career hacks and changing career Hack #1: Change your social circle Hack #2: Forget your fears and stereotypes Hack #3: Forget distractions Hack #4: Don’t overestimate others and don’t underestimate yourself Hack #5: Attention genius Hack #6: Make a team Hack #7: Less is more. Forget about perfectionism Hack #8: Initial creation Hack #9: Find mentors Hack #10: Say “no” Hack #11: Look for failures Hack #12: Take care of yourself Kaggle vs internships and pet projects Resources for learning machine learning Starting with Kaggle Improving focus Astroinformatics How background in Physics is helpful for transitioning Leaving academia Preparing for interviews Links: Mock interviews: https://www.pramp.com/ Learning ML: https://www.coursera.org/learn/machine-learning and https://www.coursera.org/specializations/deep-learning Python: https://www.coursera.org/learn/machine-learning-with-python  SQL: https://www.sqlhabit.com/  Practice: https://www.kaggle.com/ MIT 6.006: https://courses.csail.mit.edu/6.006/fall11/notes.shtml Coding: https://leetcode.com/ System design: https://www.educative.io/courses/grokking-the-system-design-interview Ukrainian telegram groups for interview preparation: https://t.me/FaangInterviewChannel,  https://t.me/FaangTechInterview, https://t.me/FloodInterview Join DataTalks.Club: https://datatalks.club/slack.html
01:06:33
May 14, 2021
What I Learned After Interviewing 300 Data Scientists - Oleg Novikov
We talked about: Oleg’s background Standing out in recruitment process NextRound — a service for free mock interviews Why rejections are generic Starting NextRount — preparing a list of situations Steps in the interview process Read the job description! CV is your landing page Take-home assignments Questions about your past experience Hypothetical case questions Technical rounds Handling rejections What to do after receiving an offer? Do recruiters pay attention to age? Getting a job with a PhD — it’s a cold start problem Should I answer rejection emails? Negotiating when my salary is low Should I apply for jobs that require 5 years of experience? Tricking applicant tracking systems What else Oleg learned after interviewing 300 data scientists How a horse's ass determined the design of a space shuttle Links: Oleg's service for interviews: https://nextround.cc/ LinkedIn: https://www.linkedin.com/in/olegnovikov/ Join DataTalks.Club: https://datatalks.club/slack.html
01:08:36
May 7, 2021
Effective Communication with Business for Data Professionals - Lior Barak
We talked about: DataTalks.Club intro Lior’s background Who is a data strategist? Improving communication between business and tech Building trust Putting data and business people together Dealing with pushbacks Building things in the lean way (and growing tomatoes) Starting with ugly code Convincing others to take our code MVP vs development and Hummus Talking to people who can’t code Break down the silos Hummus Hummus places in Berlin Lior’s book: Data is Like a Plate of Hummus Data chaos Links: Book: https://www.amazon.com/-/en/Sarah-Mayor/dp/B086L277LZ (can be found on any amazon store) Company: https://www.taleaboutdata.com/ Podcast: https://podcast.whatthedatapodcast.com/ Linkedin: https://www.linkedin.com/in/liorbarak/ Twitter: https://twitter.com/liorb Hummus places in Berlin: Azzam: https://goo.gl/maps/uCkb3ATc5CVKapDa6 Akkawy: https://g.page/akkawy The Eatery Berlin: https://g.page/theeateryberlin Join DataTalks.Club: https://datatalks.club/slack.html
57:23
April 30, 2021
Data Observability - Barr Moses
We covered: Barr’s background Market gaps in data reliability Observability in engineering Data downtime Data quality problems and the five pillars of data observability Example: job failing because of a schema change Three pillars of observability (good pipelines and bad data) Observability vs monitoring Finding the root cause Who is accountable for data quality? (the RACI framework) Service level agreements Inferring the SLAs from the historical data Implementing data observability Data downtime maturity curve Monte carlo: data observability solution Open source tools Test-driven development for data Is data observability cloud agnostic? Centralizing data observability Detecting downstream and upstream data usage Getting bad data vs getting unusual data Links: Learn more about Monte Carlo: https://www.montecarlodata.com/ The Data Engineer's Guide to Root Cause Analysis: https://www.montecarlodata.com/the-data-engineers-guide-to-root-cause-analysis/ Why You Need to Set SLAs for Your Data Pipelines: https://www.montecarlodata.com/how-to-make-your-data-pipelines-more-reliable-with-slas/ Data Observability: The Next Frontier of Data Engineering: https://www.montecarlodata.com/data-observability-the-next-frontier-of-data-engineering/ To get in touch with Barr, ping her in the DataTalks.Club group or use barr@montecarlodata.com Join DataTalks.Club: https://datatalks.club/slack.html
01:01:44
April 23, 2021
Shifting Career from Analytics to Data Science - Andrada Olteanu
We talked about: Andrada’s background Recommended courses Kaggle and StackOverflow Doing notebooks on Kaggle Projects for learning data science Finding a job and a mentor with Kaggle’s help The process for looking for a job Main difficulties of getting a job Project portfolio and Kaggle Helpful analytical skills for transitioning into data science Becoming better at coding Learning by imitating Is doing masters helpful? Getting into data science without a masters Kaggle is not just about competitions The last tip: use social media Links: https://www.kaggle.com/andradaolteanu  https://twitter.com/andradaolteanuu https://www.linkedin.com/in/andrada-olteanu-3806a2132/ Join DataTalks.Club: https://datatalks.club/slack.html
01:02:34
April 16, 2021
Transitioning from Project Management to Data Science - Ksenia Legostay
We talked about: Knesia’s background Data analytics vs data science Skills needed for data analytics and data science Benefits of getting a masters degree Useful online courses How project management background can be helpful for the career transition Which skills do PMs need to become data analysts? Going from working with spreadsheets to working with python Kaggle Productionizing machine learning models Getting experience while studying Looking for a job Gap between theory and practice Learning plan for transitioning Last tips and getting involved in projects Links: Notes prepared by Ksenia with all the info: https://www.notion.so/ksenialeg/DataTalks-Club-7597e55f476040a5921db58d48cf718f Join DataTalks.Club: https://datatalks.club/slack.html
01:03:32
April 9, 2021
Building Online Tech Communities - Demetrios Brinkmann
We talked about: Demetrious’ background and starting the MLOps community Growing MLOps community Community moderations and dealing with problems Becoming a community and connecting with people Feeling belonged Managing a community as an introvert Keeping communities active Doing custdev and talking to users Random coffee and meeting with community members Organizing community activities Is community a business? Five steps for starting a community in 2021 Shameless plug from Demetrious Links: https://mlops.community/ Join DataTalks.Club: https://datatalks.club/slack.html​
01:13:52
April 2, 2021
DataOps 101 - Lars Albertsson
We talked about: Lars’ career Doing DataOps before it existed What is DataOps Data platform Main components of the data platform and tools to implement it Books about functional programming principles Batch vs Streaming Maturity levels Building self-service tools MLOps vs DataOps Data Mesh Keeping track of transformations Lake house Links: https://www.scling.com/reading-list/ https://www.scling.com/presentations/ Join DataTalks.Club: https://datatalks.club/slack.html​​​
01:09:26
March 26, 2021
The Essentials of Public Speaking for Career in Data Science - Ben Taylor
We talked about: Ben’s background AI evangelism Ben’s first experiences speaking in public Becoming a great speaker  Key Takeaways and Call to Action Making a good introduction Being Remembered Writing a talk proposal for conferences Landing a keynote Good topics to start talks on Pitching a solution talk to meetup organizers Top public speaking skill to acquire Book recommendations Join DataTalks.Club: https://datatalks.club/slack.html​​​
01:08:48
March 19, 2021
New Roles and Key Skills to Monetize Machine Learning - Vin Vashishta
We discussed monetization roles and the capabilities people need to move into those roles. The key roles are ML Researcher, ML Architect, and ML Product Manager. We talked about: Vin's career journey What does it mean to "monetize machine learning" Important monetization metrics Who should we have on the team to make a project successful Machine Learning Researcher (applied and scientist) - background, responsibilities, and needed skills Developing new categories  The best recipe for a startup: angry users + data scientists What research actually is ML Product Manager - background, responsibilities, and needed skills How product managers can actually manage all their responsibilities (and they have a lot of them!) ML Architect - background, responsibilities, and needed skills Path to becoming an architect  How should we change education to make it more effective  Important product metrics And more!  Links: https://twitter.com/v_vashishta​ https://linkedin.com/in/vineetvashishta​ https://databyvsquared.com/​ Join DataTalks.Club: https://datatalks.club/slack.html​
01:19:52
March 12, 2021
Personal Branding - Admond Lee Kin Lim
We talked about:  Admond's career journey What is personal brand How Admond started being active online Publishing on medium and LinkedIn Idea generation process and tools Other platforms Podcasts Offline presence 1x1 meetings Speaking on conferences Having confidence to publish Selling online courses Personal values Admond's course And many other things Links: https://twitter.com/admond1994 https://linkedin.com/in/admond1994 https://buzzsumo.com https://feedly.com/ https://lunchclub.com/ https://thelead.io/data-scientist-personal-brand-toolkit?utm_medium=instructor&utm_source=admond Join DataTalks.Club: https://datatalks.club/slack.html
01:13:14
March 5, 2021
The ABC’s of Data Science - Danny Ma
Did you know that there are 3 types different types of data scientists? A for analyst, B for builder, and C for consultant - we discuss the key differences between each one and some learning strategies you can use to become A, B, or C. We talked about: Inspirations for memes  Danny's background and career journey The ABCs of data science - the story behind the idea Data scientist type A - Analyst  Skills, responsibilities, and background for type A Transitioning from data analytics to type A data scientist (that's the path Danny took) How can we become more curious? Data scientist B - Builder  Responsibilities and background for type B Transitioning from type A to type B Most important skills for type B Why you have to learn more about cloud  Data scientist type C - consultant Skills, responsibilities, and background for type C Growing into the C type Ideal data science team Important business metrics Getting a job - easier as type A or type B? Looking for a job without experience Two approaches for job search: "apply everywhere" and "apply nowhere" Are bootcamps useful? Learning path to becoming a data scientist Danny's data apprenticeship program and "Serious SQL" course  Why SQL is the most important skill R vs Python Importance of Masters and PhD Links: Danny's profile on LinkedIn: https://linkedin.com/in/datawithdanny Danny's course: https://datawithdanny.com/ Trailer: https://www.linkedin.com/posts/datawithdanny_datascientist-data-activity-6767988552811847680-GzUK/ Technical debt paper: https://proceedings.neurips.cc/paper/2015/hash/86df7dcfd896fcaf2674f757a2463eba-Abstract.html Join DataTalks.Club: https://datatalks.club/slack.html
01:25:49
February 26, 2021
Translating ML Predictions Into Better Real-World Results with Decision Optimization - Dan Becker
We talked about: How we make decisions with machine learning What is decision optimization  Specifying the decision function Emulation for making the best decisions Decision optimization and reinforcement learning Getting started with decision optimization Trends in the industry Links: https://datatalks.club/people/danbecker.html https://www.decision.ai/​ Join DataTalks.Club: https://datatalks.club/slack.html
55:44
February 19, 2021
Feature Stores: Cutting through the Hype - Willem Pienaar
We covered: What is a feature store Problems it solves When to use a feature store  When not to use a feature store The main components When a team should start using a feature store  Links: Feast: https://feast.dev/ https://www.tecton.ai/blog/what-is-a-feature-store/  https://docs.greatexpectations.io/en/latest/reference/core_concepts.html Join DataTalks.Club: https://datatalks.club​​​
01:01:06
February 12, 2021
The Rise of MLOps - Theofilos Papapanagiotou
We covered: What is MLOps The difference between MLOps and ML Engineering Getting into MLOps Kubeflow and its components, ML Platforms Learning Kubeflow DataOps  And other things Links: Microsoft MLOps maturity model: https://docs.microsoft.com/en-us/azure/architecture/example-scenario/mlops/mlops-maturity-model Google MLOps maturity levels: https://cloud.google.com/solutions/machine-learning/mlops-continuous-delivery-and-automation-pipelines-in-machine-learning MLOps roadmap 2020-2025: https://github.com/cdfoundation/sig-mlops/blob/master/roadmap/2020/MLOpsRoadmap2020.md Kubeflow website: https://www.kubeflow.org/ TFX Paper: https://research.google/pubs/pub46484/ Join DataTalks.Club: https://datatalks.club​​
01:02:51
February 5, 2021
Getting Started with Open Source - Vincent Warmerdam
We talked about  open source getting started with open source convincing your employer to contribute to open source public speaking the checklist for open source projects the role of research advocate And many more things! Links from Vincent: https://www.youtube.com/watch?v=68ABAU_V8qI&t=975s&ab_channel=PyData https://www.youtube.com/watch?v=kYMfE9u-lMo&t=958s&ab_channel=PyData https://koaning.io/projects.html https://calmcode.io/ https://makenames.io/ https://koaning.github.io/clumper/api/clumper.html Join DataTalks.Club: https://datatalks.club​
01:02:47
January 29, 2021
Developer Advocacy for Data Science - Elle O'Brien
We talked about development advocacy for data science. We covered The role of a developer advocate The skills needed for the job and the responsibilities How to become a developer advocate You can find Elle on: Twitter: https://twitter.com/DrElleOBrien LinkedIn: https://linkedin.com/in/drelleobrien DVC's youtube channel: https://www.youtube.com/channel/UC37rp97Go-xIX3aNFVHhXfQ Join DataTalks.Club: https://datatalks.club
55:36
January 23, 2021
The Importance of Writing in a Tech Career - Eugene Yan
We talk about blogging technical writing. We cover: Why should we write online? What should we write about? Writing at work: Design documents, wikis, etc. The writing process (also at work) Eugene's website:  eugeneyan.com  Follow Eugene on Twitter: https://twitter.com/eugeneyan Suggest topics: https://eugeneyan.com/topic-poll/ Join DataTalks.Club: https://datatalks.club
57:24
January 15, 2021
Mentoring - Rahul Jain
We talked about: The role of mentoring in career Looking for mentors and preparing for mentoring sessions as a mentee Becoming a mentor And many other things!  Links: Rahul's profile on the mentoring club: https://www.mentoring-club.com/the-mentors/rahul-jain Rahul's article about mentoring: https://rahulj51.github.io/career/coaching/mentoring/2020/06/22/career-coaching.html Join DataTalks.Club: https://datatalks.club
56:12
December 25, 2020
Standing out as a Data Scientist - Luke Whipps
We covered: Getting the recruiter's attention Making CV look great Tailoring your application to the position  And many other things!  Luke's LinkedIn profile: https://www.linkedin.com/in/lukewhipps/ Join DataTalks.Club: https://datatalks.club
01:09:26
December 18, 2020
Building a Data Science Team - Dat Tran
We talked about:  Dat's career so far and the startup he co-founded (Priceloop) Who to hire first in a data team How to hire the first data scientist And many other things! You can find Dat on LinkedIn: https://www.linkedin.com/in/dat-tran-a1602320/ Join DataTalksClub: https://datatalks.club
58:45
December 11, 2020
Processes in a Data Science Project - Alexey Grigorev
In this podcast, we talk about CRISP-DM - a methodology for organizing data science projects DataTalks.Club is the place to talk about data. Join our community: https://datatalks.club Read more about CRISP-DM here: https://mlbookcamp.com/article/crisp-dm
31:33
December 4, 2020
Roles in a data team - Alexey Grigorev
We talked about: - different roles in a data team: product managers, data analysts, data engineers, data scientists, ML engineers, MLOps engineers - their responsibilities - the skills they need DataTalks.Club is the place to talk about data. Join our community: https://datatalks.club
42:45
November 21, 2020