Skip to main content
Chai Time Data Science

Chai Time Data Science

By Sanyam Bhutani
Chai Time Data Science show is a series where Sanyam Bhutani interviews his Data Science Heroes: Practitioners, Kagglers & Researchers about all things Data Science
Listen on
Where to listen
Apple Podcasts Logo

Apple Podcasts

Breaker Logo

Breaker

Google Podcasts Logo

Google Podcasts

Overcast Logo

Overcast

Pocket Casts Logo

Pocket Casts

RadioPublic Logo

RadioPublic

Spotify Logo

Spotify

Stitcher Logo

Stitcher

Currently playing episode

Hugging Face, Transformers | NLP Research and Open Source | Interview with Julien Chaumond

Chai Time Data Science

1x
Arsha Nagrani: Multi-Modal Research, Speaker Diarisation, VoxCeleb #123
Video Version: https://youtu.be/2tLNfR2whio Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews Dr. Arsha Nagrani. Arsha has been working as a researcher at the intersection of audio, video, or audio and computer vision. They try to uncover what the field is really about. She has also co organised challenges for problems in this domain. They talk talk about all of these works along with her approach to research. Links: Interview with Dima: https://youtu.be/GXqq_hj-UuY Follow: Arsha Nagrani: https://twitter.com/nagraniarsha https://www.linkedin.com/in/arsha-nagrani-601a726b Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
41:32
December 3, 2020
Ekaterina Kochmar: Automated Language Teaching & Assessment, NLP, Korbit.ai #122
Video Version: https://youtu.be/2MT7bYZsiV4 Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews Ekaterina Kochmar: Lecturer and Researcher at Cambridge University, Co-Founder & CSO at Korbit.ai As you might know, Sanyam is a fan of learning to learn - The topic in general, they bring back the conversation in this episode. Ekaterina has been working on automated language teaching and assessment, which is using machine learning or different tools to augment teaching as an intelligent tutor to build intelligent systems for teaching different concepts for language, specifically English language, and even beyond. They have a deeper dive into this conversation discuss: - What does building a system like this take? - What research goes into it? - What are the interesting trends here? They also dive into Katrina's approach of research, and what does a research pipeline for her look like. Links: https://www.manning.com/books/getting-started-with-natural-language-processing Follow: Ekaterina Kochmar: https://www.linkedin.com/in/ekaterina-kochmar-0a655b14/ https://www.cl.cam.ac.uk/~ek358/ Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
1:19:06
November 29, 2020
Richard Craib: The Numerai Story, Building the World's last Hedge Fund #121
Video Version: https://youtu.be/H0NfDIDcu84 Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews a founder from South Africa, of the ranks of Elon Musk's Genius, Founder & CEO at Numerai: Richard Craib They talk about Richard's journey into the world of Finance, Trading and Data Science. They also discuss the Numerai story, the hedge fund, the competition and recently announced signals, along with the "Master Plan" Links: Signals: https://signals.numer.ai https://medium.com/numerai/numerais-master-plan-1a00f133dba9 Follow: Richard Craib: https://twitter.com/richardcraib https://www.linkedin.com/in/richardcraib/ http://twitter.com/numerai Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
54:21
November 26, 2020
Michael Kennedy: Talk Python and Python Bytes Podcast, Creating Pythonic Content #120
Video Version: https://youtu.be/pgzEqhuGBd0 Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani talks Python to the amazing podcaster, content creator and educator: Michael Kennedy, the host of talk Python podcast and Python Bytes podcast, In this episode, they talk about Michael's journey in programming and with Python. Michael has been hosting the podcast for five years and has been in the programming world for even longer, they dive into what he's learned through this and his perspective, how it has evolved through creating content, and to creating these courses and, and eventually a business around it as well. Links: https://talkpython.fm/home Automating the saw: https://www.youtube.com/watch?v=JEImn7s7x1o The ML leads to 50 exoplanet discovery: https://www.techrepublic.com/article/machine-learning-algorithm-confirms-50-new-exoplanets-in-historic-first/ Follow: Michael Kennedy: https://twitter.com/mkennedy https://www.linkedin.com/in/mkennedy/ Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
1:05:33
November 22, 2020
Charlie Boyle: Pushing the Envelope of Super Computing, NVIDIA DGX A100, Hardware Engg at NVIDIA #119
Video Version: https://youtu.be/SiUnKGD90uI Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews the VP, GM of DGX Systems at NVIDIA. NVIDIA recently unveiled their new DGX A100 systems and new A100 GPUs, that are really pushing the envelope of super computing. In this episode, they discuss the engineering that goes behind the scenes, how the systems are designed and their origin, how these have evolved over the years and NVIDIA's design philosophy as a proxy to understanding how these might bleed into consumer GPUs over time. Links: Links: https://nvda.ws/32HaMCm  https://nvidianews.nvidia.com/news/nvidia-dgx-station-a100-offers-researchers-ai-data-center-in-a-box  https://www.youtube.com/watch?v=TKtN04z7Q5Q Follow: Charlie Boyle: https://www.linkedin.com/in/charlie-boyle-0201a8/ Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
55:02
November 19, 2020
Tim Dettmers: Personal Side of Academia, How to pick your Grad School, RTX 3000 FAQ #118
Video Version: https://youtu.be/RvwynqDUoQE Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews PhD Student: Tim Dettmers for the second time on the show! In this episode, they talk a lot about research in general, and the personal aspect of research, which isn't covered as much: How you should pick a grad school, Is creativity important in academia They talk a lot about personal side of things while you're going through the process of exploration in research, or in life in general They also talked about the RTX 3000 series, and discussed a few FAQs based on these. Links: https://timdettmers.com/2020/03/10/how-to-pick-your-grad-school/ https://timdettmers.com/2020/09/07/which-gpu-for-deep-learning/ Follow: Tim Dettmers: https://twitter.com/tim_dettmers https://timdettmers.com Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd #GoogleAI #Research #MultiModal
1:12:31
November 15, 2020
"Beluga", Gabor Fodor: Cornell Birdcall 10 Pos Solo Gold Sol #117
Video Version: https://youtu.be/1O15z3Bv3UE In this episode, Sanyam Bhutani interviews the Beluga of Kaggle World, Competitions GM: Gabor Fodor. They talk about Gabor's journey into Data Science, Kaggle and his 10th Pos Solo Gold Finish on the Cornell Birdcall competition Links: Comp Link: https://www.kaggle.com/c/birdsong-recognition Writeup: https://www.kaggle.com/c/birdsong-recognition/discussion/183407 Follow: Gabor Fodor: https://twitter.com/BelugaFodor https://www.linkedin.com/in/gábor-fodor-6a081548/ https://www.kaggle.com/gaborfodor Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
48:05
November 12, 2020
Luis Serrano: AI Content Creation, Teaching ML, Grokking ML #116
Video Version: https://youtu.be/n2bUu9AcGnE Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews one of his teachers, content creator and Quantum AI Research Scientist: Luis Serrano In this episode, they talk about Luis' process of creating the amazing videos for anyone to enjoy on his YouTube channel, they talk about his process of creating this, his journey of teaching thousands, now millions of people through his content. They also discuss about his book, his upcoming book titled Grokking Machine Learning, which is as you might expect him for anyone who's interested in machine learning, there are no prerequisites. This interview is filled with different golden advices someone with Luis' background of having taught so many people would have. Links: Grokking Machine Learning: https://www.google.com/search?client=safari&rls=en&q=Grokking+Machine+Learning&ie=UTF-8&oe=UTF-8 Follow: Luis Serrano: https://twitter.com/luis_likes_math https://www.linkedin.com/in/luisgserrano/ https://serrano.academy Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd #GoogleAI #Research #MultiModal
52:09
November 8, 2020
Jacqueline Nolis & Emily Robinson: Building a Career in Data Science #115
Video Version: https://youtu.be/-OSWCeo18vs Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews two amazing Data Scientists who've quite literally authored a book on Building a Career in Data Science: Jacqueline Nolis and Emily Robinson. Both of the amazing heroes on this episode have worked across different industries, and have written amazing blog posts. They've also written a book quite literally on the topic of building a career in data science. It covers topics that aren't usually discussed almost anywhere, have the soft skills side of things and things you learn in your jobs over the years. Links: https://www.manning.com/books/build-your-career-in-data-science?a_aid=buildcareer&a_bid=76784b6a http://podcast.bestbook.cool Follow: Jacqueline Nolis: http://twitter.com/skyetetra Emily Robinson: https://twitter.com/robinson_es https://www.linkedin.com/in/robinsones/ Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
1:16:17
November 5, 2020
Catherine Nelson & Hannes Hapke: Building ML Pipelines #114
Video Version: https://youtu.be/wxVGu8vKvG8 Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews the authors of the book "Building ML Pipelines": Catherine Nelson and Hannes Hapke Catherine is a data scientist at SAP concur labs. And Hannes is an ML Engineer in the same company. They talk about the journey into machine learning and the journey into this world of building machine learning pipelines, they dive into depth about the book, covering: - Who's the book really for? - What does it cover? - What can you expect out of it? Links: https://www.buildingmlpipelines.com Follow: Catherine Nelson: https://twitter.com/drcatnelson Hannes Hapke: https://twitter.com/hanneshapke hanneshapke.github.io Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
45:08
November 1, 2020
Jeff Heaton: Creating AI Content, Kaggle and Teaching #113
Video Version: https://youtu.be/nShflQj5RXY Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews a Content Creator, Educator, Data Scientist and Kaggler: Jeff Heaton Jeff is currently VP at RGA, and an amazing content creator on YouTube, he puts out videos almost every week in the realm of machine learning, broadly speaking. He's also an adjunct professor at Washington University, they talk about his journey, into machine learning, data science kaggle, content creation, how he got started across all of these different things. Follow: Jeff Heaton: https://twitter.com/jeffheaton https://github.com/jeffheaton Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
1:13:52
October 29, 2020
Ken Jee: Creating AI Content, Sports Analytics, Data Science Community #112
Video Version: https://youtu.be/UmEFK4RinUU Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews Data Scientist, Content Creator: @Ken Jee They talk about his journey into Data Science, Creating content and Sports Analytics. They discuss what Ken has learned via creating content and how does his experience tie in together with everything he has been doing. Follow: Ken Jee: http://twitter.com/kenjee_ds https://kennethjee.com https://www.linkedin.com/in/kenjee/ https://www.youtube.com/channel/UCiT9RITQ9PW6BhXK0y2jaeg Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
52:08
October 25, 2020
Philipp Singer & Christof Henkel: Google Landmark Recognition 1st Pos Sol #111
Video Version: https://youtu.be/NRl3lMlixPc Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews two Kaggle legends: Philipp Singer and Christof Henkel about their team's Winning Solution to the Google Landmark Recognition 2020 Kaggle competition. This interview is a continued conversation with both of the legends, please find the older interviews to know more about their journey and watch this one for their winning solution and Christof's wisdom :D Links: Winning Writeup: https://www.kaggle.com/c/landmark-recognition-2020/discussion/187821 Github Link: https://github.com/psinger/kaggle-landmark-recognition-2020-1st-place Paper Link: https://arxiv.org/abs/2010.01650 Prev Interviews: Psi: NFL Data Bowl Win Sol: https://www.youtube.com/watch?v=_Srv0bKmfjY IEEE-CIS Comp 6th Pos Sol: https://www.youtube.com/watch?v=7sh5QrUIAHI Jigsaw & Tweet Sentiment: https://youtu.be/_X6wl0CX8xA Christof Henkel: Google Quest Q&A Labelling Comp 2nd Pos Sol: https://www.youtube.com/watch?v=Q0_Xajic_9U Follow: Philipp Singer: https://twitter.com/ph_singer https://www.kaggle.com/philippsinger Christof Henkel: https://twitter.com/kagglingdieter https://www.kaggle.com/christofhenkel Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
42:11
October 22, 2020
Thomas Wolf: NLP at Hugging Face, Transformers #110
Video Version: https://youtu.be/invR7r7c_pU Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews another NLP Hero from 🤗, the CSO: Thomas Wolf. They talk about Thomas's journey into the field, Thomas has worked across many different fields, and they connect the dots of how he followed his passions leading towards finally now NLP and the world of transformers. They talk about his research work and his work at hugging face. Thomas shares his overview of the pipelines that he works on at hugging face, the resources that he contributes to, and the project that he at the time of recording was involved in. There are many great advices on how to get started in NLP and in the world of machine learning as well. Links: https://huggingface.co https://twitter.com/huggingface Follow: Thomas Wolf: https://twitter.com/Thom_Wolf https://www.linkedin.com/in/thomas-wolf-a056857 https://thomwolf.io Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
1:01:32
October 18, 2020
Ian Ozsvald: High Performance Python, Teaching & Consulting in ML #109
Video Version: https://youtu.be/I7376Wl8JQ4 Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews the Chief Data Scientist at Mor consulting and an author of High Performance Python with O'Reilly: Ian Ozsvald. They talk all about his rich experience in Python and in data science, broadly speaking. Ian shares a lot of insights that he's picked up along his journey while working on so many rich projects, so many projects across different consulting tasks, different consulting missions that he's taken up. They also touch upon his book: who should pick read the book, and what will they get out of it. Ian also broadly shares many amazing advices that he has learned over over all of these years. Links: https://www.oreilly.com/library/view/high-performance-python/9781492055013/ Follow: Ian Ozsvald: https://twitter.com/ianozsvald https://www.linkedin.com/in/ianozsvald/ https://ianozsvald.com https://morconsulting.com Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
1:11:11
October 15, 2020
Tim Dettmers: Which RTX 3000 GPU to get for DL? 3090 FAQ | CTDS.Show #108
Video Version: https://youtu.be/CaoQLrSBk0o Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani This episode, which is an excerpt from Sanyam Bhutani's complete interview with Tim Dettmers In this small excerpt, they discuss the RTX 3000 FAQs: - Should you buy these? - Which ones should you buy? - How can you make two of these work together? - Are they slower than the expected Quadro Cards? Links: https://timdettmers.com/2020/09/07/which-gpu-for-deep-learning/ Follow: Tim Dettmers: https://twitter.com/tim_dettmers https://timdettmers.com Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
26:43
October 11, 2020
Kun Hao Yeh: SIIM ISIC Melanoma 14 Pos Sol | Journey to Becoming Kaggle Master | CTDS.Show #107
Video Version: https://youtu.be/XoHYqL_LlWs Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews Kaggle Master and Sr. Software Engineer: Kun Hao Yeh. They Talk about their team's 14th pos gold solution to the SIIM ISIC Melanoma Classification Comp. They also discuss Kun Hao's journey from taking one of the courses by Marios to teaming up with him along with many Kaggle Tips & Tricks Links:  Wining Sol: https://www.kaggle.com/c/siim-isic-melanoma-classification/discussion/175403 Interview with Marios: https://www.youtube.com/watch?v=A3GvuHqGGZI Follow: Kun Hao Yeh: https://www.linkedin.com/in/michaelkhyeh/ https://www.kaggle.com/khyeh0719 Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd #Fastai #Learning #PyTorch
25:36
October 8, 2020
Jose Fernandez Portal & Francisco Ingham | Approaching Fast.ai | ML Community | CTDS.Show #106
Video Version: https://youtu.be/ZrXaY3tdXuc Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews two of his mentors, peers from fast.ai, DL Engineers at Mercado Libre: Jose Fernandez Portal and Francisco Ingham. This episode is a lot about making a transition into the field: Learning via fast.ai, and how to foster a community similar to fast.ai in your local region, the importance of this, Jose and Francisco both have taken the course have shared a similar journey and share a lot of interesting advices along these lines on how you should approach the material, make the most out of it. How should you plan your time around it, and how should you find the projects to pursue. Links: Combining Tabular data & speech: https://forums.fast.ai/t/share-your-work-here/27676/526 SGD Animator: https://forums.fast.ai/t/share-your-work-here/27676/300 Learning AI: My Journey by Francisco: https://medium.com/@fpingham/learning-ai-my-journey-d99f47ba79f Follow: Jose Fernandez Portal: https://twitter.com/joshfp https://www.linkedin.com/in/josefp/ Francisco Ingham: https://twitter.com/fpingham https://medium.com/@fpingham Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
1:09:56
October 4, 2020
Habib Bukhari & Igor Zubarev | Kaggle PANDA Challenge: 2nd Pos Sol | Gold Medalling on Kaggle | CTDS.Show #105
Video Version: https://youtu.be/UVZm2gEZhtA Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews two amazing Kagglers, Kaggle Masters: "DrHB" & "CatEek": Habib Bukhari & Igor Zubarev. They discuss their team's second position in the PANDA competition, The competition stands for Prostate cANcer graDe Assessment. Their team, "Save the prostate" ranked second in the private leaderboard, they discuss about the journey into machine learning and kaggle. They connect the dots of how both of them got interested in ML and in kaggle, How did they get addicted to the platform. They discuss about their approach to competitions, the teaming up approach, and their solution overview. Links: https://www.kaggle.com/c/prostate-cancer-grade-assessment https://www.kaggle.com/c/prostate-cancer-grade-assessment/discussion/169108 Follow: Habib Bukhari: https://twitter.com/dr_hb_ai https://www.kaggle.com/drhabib Igor Zubarev: https://www.kaggle.com/cateek Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
1:16:47
October 1, 2020
Marie-Anne Lachaux | Unsupervised Translation of Programming Languages | FAIR | CTDS.Show #104
Video Version: https://youtu.be/dIejo08k9mI Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews Research Engineer at FAIR: Marie-Anne Lachaux. They discuss the joint work shared by Marie-Anne, "Unsupervised translation of programming languages" If you're unaware: it's an amazing, mind-blowing work demonstrating how to translate code from one language to another. They. dive into her research and her journey into the field along with how research projects are approached at FAIR Labs. Links: Unsupervised Translation of Programming Languages: https://arxiv.org/abs/2006.03511 Follow: Marie-Anne Lachaux: https://twitter.com/malachaux https://www.linkedin.com/in/marieannelachaux/ Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
35:07
September 27, 2020
Lavanya Shukla | Journey from coding websites at age 10 to Applying ML | CTDS.Show #103
Video Version: https://youtu.be/fOEG6EpAsLI Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews the Head of Growth at Weights & Biases: Lavanya Shukla Lavanya Shukla has a very interesting journey into machine learning and programming, they talk all about how she started programming at the age of 10 built amazing websites through a dial up connection in India, and over the years improved her skills, eventually joining a computer science degree programme and dropping out of it and then finding her passion for machine learning. They also talk a lot about general advices on how should you progress in your programming and machine learning journey. Links: Weights and Biases: https://www.wandb.com Follow: Lavanya Shukla: https://twitter.com/lavanyaai https://lavanya.ai/ Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
46:13
September 24, 2020
Alexey Grigorev | Machine Learning BookCamp | ML at Scale | Data Science at OLX | CTDS.Show
Audio (Podcast Version) available here: https://anchor.fm/chaitimedatascience Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews Kaggle Comp Master, Lead Data Scientist at OLX Group: Alexey Grigorev. They talk about his experience of transitioning into ML, how he did it, how was kaggle helpful. Alexey shares a very honest overview of how his university how how much his university was helpful or not, and how was Kaggle helpfu They also discuss Alexey's upcoming book: Machine Learning Bookcamp. Links: https://www.manning.com/books/machine-learning-bookcamp Follow: Alexey Grigorev: https://twitter.com/al_grigor https://www.linkedin.com/in/agrigorev/?originalSubdomain=de https://alexeygrigorev.com Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd #Scale #OLX #MachineLearning
50:02
September 20, 2020
Bryan Catanzaro | Research at NVIDIA | RTX 3000 | Deep Learning | CTDS.Show #101
GTC Conference tickets giveaway details: https://twitter.com/bhutanisanyam1/status/1306637737592840193?s=20 Video Version: https://youtu.be/guJT5GOiNjA Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews the VP of Applied Research at NVIDIA: Bryan Catanzaro They talk about Bryan's journey into ML and his journey at NVIDIA as a proxy to understanding the evolution of research being done at NVIDIA. They discuss how Bryan's work led to the creation of CuDNN and what tasks he is involved in Towards the later end, they also briefly discuss the recently launched RTX 3000 Series. Follow: Bryan Catanzaro: https://twitter.com/ctnzr https://ctnzr.io Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd #NVIDIA #RTX3000 #Research
1:05:42
September 17, 2020
Sudalai Rajkumar, SRK | Journey to becoming 3x Kaggle GM | Datasets | CTDS.Show #100
Video Version: https://youtu.be/VPdmamFR6y8 Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews the SRK of Kaggle, 3x Kaggle Grandmaster and Data Scientist at H2O.ai: Sudalai Rajkumar. This interview is a Part-2 of the previous blog interview, they talk about SRK's journey to becoming 3x Kaggle GM along with his journey in Data Science and Kaggle. SRK shares many solid suggestions for people getting started in Data Science and Kaggle as well as for people with a "Non-Traditional" Background Links: Prev Interview: https://hackernoon.com/interview-with-twice-kaggle-grandmaster-and-data-scientist-at-h20-ai-sudalai-rajkumar-cd952ef0c522 Follow: SRK: https://twitter.com/sudalairajkumar https://www.kaggle.com/sudalairajkumar http://linkedin.com/in/sudalairajkumar Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
1:00:27
September 13, 2020
Sara Hooker | Research at Google AI | Interpretability and Model Compression | CTDS.Show #99
Video Version: https://youtu.be/O7JwKAv99_M Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews Research Scholar at Google brain: Sara Hooker. Sara has an amazing background and a "non-traditional" path into research. They talk all about her journey into research into machine learning. Sara is still very much connected to her roots, she's from Africa, and she shares her amazing perspective on how different problems, different constraints can contribute to the field, research, or otherwise in interesting ways. Towards the end they also briefly dive into her recent works. Links: Underrated ML: https://www.underratedml.com Selective Brain Damage: Measuring the Disparate Impact of Model Pruning: https://arxiv.org/pdf/1911.05248 Follow: Sara Hooker: https://twitter.com/sarahookr https://www.sarahooker.me Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
1:29:41
September 10, 2020
Adrian Rosebrock | The PyImageSearch Story | OpenCV, Deep Learning & Optical Character Recognition
Video Version: https://youtu.be/wyfROwJUW-Y Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews the Computer Vision Guru, Chief at PyImageSearch: Dr. Adrian Rosebrock. This interview is part-2 of Sanyam's blog interview with Adrian. They talk about Adrian's journey into CV and ML. They also discuss the secrets of PyImageSearch HQ and how the amazing tutorials are created. They also talk about Adrian's upcoming OCR Book and the importance of OCR. Links: OCR Book (Indiegogo link): https://www.indiegogo.com/projects/ocr-with-opencv-tesseract-and-python/ PyImageSearch: https://www.pyimagesearch.com Courses and Books by Adrian: https://www.pyimagesearch.com/books-and-courses/ Previous Interview: https://hackernoon.com/interview-with-the-author-of-pyimagesearch-and-computer-vision-practitioner-dr-adrian-rosebrock-e00583a225a0 Follow: Adrian Rosebrock: Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd #OpenCV #PyImageSearch #ComputerVision
1:29:50
September 6, 2020
Anastasiia Mishchuk | Applying Kaggle to Research | Learning ML | CTDS.Show #97
Video Version: https://youtu.be/o-2NJWKWwdA Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews Computer Vision Research Engineer at EPFL: Anastasiia Mishchuk They talk about her journey into research, into engineering, and also on kaggle. Anastasiia is a kaggle competition master: They talk about her perspective on kaggle and also dive into her process of approaching research problems and discuss one of her recent works. Follow: Anastasiia Mishchuk: https://twitter.com/Ana_Geneva Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
24:12
September 3, 2020
Dima Damen | Action & Video Recognition | Epic-Kitchens-100 | CTDS.Show #96
Personal Note: This is one of the best interviews that I've had the privilege of conducting. Every single answer by Dima, every single qoute is simply, golden. Video Link: https://youtu.be/GXqq_hj-UuY Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews an associate professor at the University of Bristol and CV researcher with interests in action recognition and video recognition: Dr. Dima Damen They talk about her journey into the field and her research, what she's learned from working with so many students advising PhD, Masters and undergrad students and about how she teaches computer vision. They discuss her recent works on action recognition, along with kitchens 100 data set that she's recently created. Links: Epic-Kitchens-100 Webpage: https://epic-kitchens.github.io/2020-100 Epic-Kitchens-100 Trailer: https://www.youtube.com/watch?v=8IzkrWAfAGg Follow: Dima Damen: https://twitter.com/dimadamen https://www.linkedin.com/in/dimadamen/ http://people.cs.bris.ac.uk/~damen/ Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
1:13:50
August 30, 2020
Andrey Kurenkov | Lessons from Grad School | TheGradient.pub | Stanford Research | CTDS.Show #95
Video Version: https://youtu.be/NrkfNqz5w7M Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews PhD Student at Stanford, AI and vision lab: Andrey Kurenkov This interview is a more critical take on research and different aspects of Andrey's research. They talk about what he's learned through his mistakes at grad school. They also discuss about TheGradient.pub, and Andrey's process of writing. Links: "All my failures in Grad School": https://www.youtube.com/watch?v=uxYpJ5mMKx0 Follow: Andrey Kurenkov: https://twitter.com/andrey_kurenkov https://www.andreykurenkov.com https://www.linkedin.com/in/andreykurenkov/ Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
28:59
August 27, 2020
Fatma Guney | Unsupervised Learning of Multi-Frame Optical Flow w Occlusions | 3D CV | CTDS.Show #94
Video Version: https://youtu.be/Fc6IEqEEpZw Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews Computer Vision Researcher: Fatma Guney They talk all about Computer Vision, her research interests: optical flow and even beyond, self driving cars. Fatma has also taught a few courses based on computer vision and they dive into her take on what should students focus on in this field. Links: Unsupervised Learning of Multi-Frame Optical Flow with Occlusions: http://www.cvlibs.net/publications/Janai2018ECCV.pdf Slow Flow: Exploiting High-Speed Cameras for Accurate and Diverse Optical Flow Reference Data: http://www.cvlibs.net/publications/Janai2017CVPR.pdf Computer Vision for Autonomous Vehicles: Problems, Datasets and State-of the-Art: https://arxiv.org/abs/1704.05519 Follow: Fatma Guney: https://twitter.com/ftmguney https://mysite.ku.edu.tr/fguney/ Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
52:00
August 23, 2020
Ahmet Erdem | Trends Neuroimaging 7th place solo gold | Rapids.ai | CTDS.Show #93
Video Version: https://youtu.be/G1ZvjJYRxH8 Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews a Kaggle Grandmaster from Turkey, Sr. Software Engineer at Rapids.ai: Ahmet Erdem They talk all about Ahmet's Kaggle journey, and his seventh place solo gold solution to the Trends Neuroimaging competition. Ahmet has been competing for a few years now, they discuss his approach to Kaggle which has led him to achieving 50% medals solo versus 50% in a team. Links: Interview with Even Oldridge: https://www.youtube.com/watch?v=-WzXIV8P_Jk 7th Place Solution writeup: https://www.kaggle.com/c/trends-assessment-prediction/discussion/162787 Follow: Ahmet Erdem: https://twitter.com/a_erdem4 https://www.linkedin.com/in/aerdem4/ https://www.kaggle.com/aerdem4 Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
31:17
August 20, 2020
Abhishek Thakur | Approaching (Almost) any ML Problem | CTDS.Show #92
Video Version: https://youtu.be/zgC8fjF0Now Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews Abhishek Thakur for the 2nd time! They talk about his book: Approaching (Almost) any ML Problem, Abhishek's process to creating Videos and his process to writing the book. They also discuss on how to effectively approach Kaggle and Abhishek shares many advices that are applicable to Kaggle. Links: Previous Interview: https://www.youtube.com/watch?v=Ezbo57Z33N8 Follow: Abhishek Thakur Twitter: https://twitter.com/abhi1thakur YouTube: https://www.youtube.com/channel/UCBPRJjIWfyNG4X-CRbnv78A Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
45:40
August 16, 2020
Guanshuo Xu | Achieving #1 Rank in Kaggle Competitions Tier | Alaska2 Winning Sol | CTDS.Show #91
Video version: https://youtu.be/lkUhibNLMNk Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews Data Scientist at H2O.ai, the new King of Kaggle Competitions, new Rank 1: Dr. Guanshuo Xu Guanshuo has just secured the title with his win in Alaska2 Image Steganalysis competition, in his journey he has achieved 15 gold medals, 15 silver medals, out of these 10 are solo gold 13 are solo Silver! They talk about his journey on Kaggle, he shares a few advices around how you should approach Kaggle, and how his approach has changed, how he approaches competition and how he learned over the period of competing on Kaggle. They also briefly discuss his solution to the Alaska2 competition as a proxy to understanding his approach to competitions. They also talk about his role at H2O.ai and the projects that he is contributing to. Links: Alaska2 Comp Page: https://www.kaggle.com/c/alaska2-image-steganalysis/discussion Solution Writeup: https://www.kaggle.com/c/alaska2-image-steganalysis/discussion/168548 Follow: Dr. Guanshuo Xu: https://www.linkedin.com/in/guanshuo-xu/ https://www.kaggle.com/wowfattie Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
47:07
August 13, 2020
Aakash Nain | Tensorflow 2.0 | TF Add-Ons | Good API Designs | CTDS.Show #90
Audio (Podcast Version) available here: https://anchor.fm/chaitimedatascience Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this Episode, Sanyam Bhutani interviews one of his personal mentors and a mentor and a guru to the Indian machine learning community, Aakash Nain: Deep Learning engineer at Ola, GoogleDev Expert and TensorFlow add on maintainer. This interview is really part two of the earlier blog interview with Akash. In this interview, they really expand on the things from earlier and discuss Aakash's journey into open source, how his appreciation for open source TensorFlow and good API has evolved. He also shares many amazing advices on how someone who's interested in becoming a better practitioner, better Kaggler, better contributor, can learn and learn from Aakash's mistakes as well. Previous Interview: https://medium.com/dsnet/interview-with-kaggle-kernels-expert-aakash-nain-73209223bbd0 Follow: Aakash Nain: https://twitter.com/A_K_Nain https://t.co/BuGVTun1TH?amp=1 Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
30:15
August 9, 2020
Rachael Tatman | Rasa, Kaggle | Conversational AI | CTDS.Show #89
Video Version: https://youtu.be/WyVWt3Jr6O8 Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews Rachael Tatman for the second time! This interview is a continuation of the same, Rachael was earlier at Kaggle, and they start the interview by talking about her work at Kaggle, and her involvement with the community's contribution towards her journey. Later, shifting gears into her role today, which is a developer advocate at Rasa: What exactly does a Developer Advocate mean? Rachael also shares many interesting opinions on conversational AI, her opinion of the state of conversational AI in 2020, and many great interesting project ideas for any of you that's interesting in building projects out using Rasa, or otherwise, for anyone who's seeking ideas They talk about her process of content creation, and if you watch any of our streams, they also discuss how does she brings so much positivity and so much energy to all of our streams. Links: Previous Interview: https://medium.com/dsnet/interview-with-data-scientist-at-kaggle-dr-rachael-tatman-8bc61f9efdb9 https://rasa.com Follow: Rachael Tatman: https://twitter.com/rctatman http://www.rctatman.com https://www.kaggle.com/rtatman https://www.linkedin.com/in/rachael-tatman-500a323a/ Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd #Chatbots #Rasa #Kaggle
40:43
August 6, 2020
Philipp Singer | Two Solo Gold Solutions: Jigsaw Toxic Comment, Tweet Sentiment | CTDS.Show #88
Video Version: https://youtu.be/_X6wl0CX8xA Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews the relentless Solo Lion from The Zoo: Philipp Singer for the 3rd time! Philipp has just become a competition Gandnaster after gold meddling solo in two competitions bringing 11 gold medals to his profile! They talk about his journey on kaggle: his reflections on this journey to becoming a grandmaster in one and a half years, his secret to winning and his constant success on kaggle. They also discuss about his recent role: he's just joined the team at h2o.ai, and we discuss about what tasks is he taking on here. Timestamps: 00:00: Intro 03:24: Grandmaster Fight Club & Favourite Kaggle Memories 08:50: Advice to yourself from the past 12:23: New role and H2O.ai 17:12: Competing in 2 comps 19:23: Jigsaw Comp 40:12: Twitter Sentiment Comp 58:42: What's next for you? Links: Jigsaw Multilingual Toxic Comment Classification 8th Pos Sol writeup by Philipp: https://www.kaggle.com/c/jigsaw-multilingual-toxic-comment-classification/discussion/160937 Tweet Sentiment Extraction, 11th Pos Sol Writeup by. Philipp: https://www.kaggle.com/c/tweet-sentiment-extraction/discussion/159440 Previous Interviews: NFL Data Bowl Win Sol: https://www.youtube.com/watch?v=_Srv0bKmfjY IEEE-CIS Comp 6th Pos Sol: https://www.youtube.com/watch?v=7sh5QrUIAHI Follow: Dr. Philipp Singer: https://twitter.com/ph_singer https://www.linkedin.com/in/philippsinger/ https://www.kaggle.com/philippsinger Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
1:01:30
August 2, 2020
Rob Mulla | Trends Neuroimaging: 15th Pos & Ion Switching: 11th Pos Sol | CTDS.Show #87
Personal Note: In My opinion, this interview really connects the dots well for anyone who's looking to make a transition or just starting the journey in data science via Kaggle. Video Version: https://youtu.be/uBr1HXHWjOQ Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews Dual Grandmaster and Competitions Master: Rob Mulla. They talk about Rob's journey into data: He started as an electrical engineer transitioned into analytics and data science afterwards. Rob has been sharing a few interesting kernels which are called race to the name of the competition, and many amazing other kernels as well, they discussed about his process of creating them. They also discussed two of his recent amazing finishes on different competitions: The first is a solo golden finish, on University of Liverpool: Ion Switching competition, as well as a 15th position silver finish on the Trends NeuroImaging Competition. Links: Ion Switching Solution Writeup: https://www.kaggle.com/c/liverpool-ion-switching/discussion/153694#861209 Trends NeuroImaging Solution Writeup: https://www.kaggle.com/c/trends-assessment-prediction/discussion/162749 Follow: Rob Mulla: https://twitter.com/rob_mulla https://www.linkedin.com/in/rob-mulla/ https://www.kaggle.com/robikscube Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd #Grandmaster #Solution #Kaggle
1:06:54
July 30, 2020
Shaikat Galib | Tweet Sentiment Classification 13th Pos Gold Sol | CTDS.Show #86
Video Version: https://youtu.be/C0O6QfiWc5k Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews the first Kaggle Grandmaster from Bangladesh: Shaikat Galib Shaikat has done a PhD in nuclear engineering and has just become a competition Grandmaster: as you expect in this episode, they talk all about his journey on and off kaggle. He's also a data scientist at h2o.ai and they discuss his work along with his 13th position gold winning solution to the Tweet Sentiment Extraction Kaggle Competition Links: Tweet Sentiment Extraction, 13th Pos Sol Writeup by. Shaikat: https://www.kaggle.com/c/tweet-sentiment-extraction/discussion/159505 Follow: Shaikat Galib: https://www.linkedin.com/in/smg478/ https://www.kaggle.com/sgalib Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
1:00:39
July 26, 2020
Anniversary Episode: Celebrating 1 Year of CTDS.Show | CTDS.Show Kaggle Contest Results | Episode #85
Note 1: This episode is a reflection on the past 1 year of running CTDS.Show and results of the Kaggle contest, interviews resume this Sunday. The complete playlist upto September end has been announced on the YouTube page. Note 2: The show notes will be updated in a few hours from the release to sync with the kernel leaderboard announcement.  Kaggle Contest Link: https://www.kaggle.com/rohanrao/chai-time-data-science/discussion/156137 Follow: Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
1:24:55
July 23, 2020
Chris Deotte | Secrets to Becoming 4x Kaggle Grandmaster | Discussions and Notebooks #1 | CTDS.Show #84
Audio (Podcast Version) available here: https://anchor.fm/chaitimedatascience Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews 4x Kaggle Grandmaster and the new King of Discussions and Notebooks: Chris Deotte. Chris has an amazing, very diverse and very rich background, they connect the dots of his journey his professional journey with data science, they talk about his previous life: How did he transition into data science on kaggle and his journey on kaggle. Chris at the time of recording has just become a 4x Grandmaster: He's ranked 32 on the This interview is really a complete overview of Chris's journey on and off Kaggle, his overview of Kaggle. Links: More about rapids.ai: https://youtu.be/-WzXIV8P_Jk Follow: Chris Deotte: https://twitter.com/ChrisDeotte https://www.kaggle.com/cdeotte Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
1:06:52
July 19, 2020
Aman Madaan | Politeness Transfer: A tag and Generate Approach | fast.ai | CTDS.Show #83
Video Version: https://youtu.be/_5XDjUoQvp4 Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews Graduate student and researcher at Carnegie Mellon University: Aman Madaan Aman has worked across different industries across different machine learning related use cases, which they talk all about in this interview. He's worked on use cases involving applications at large scale and has recently transitioned into academia. They discuss about his perspective and his perspective changes from that transition. Aman has also taken the fast ai course, one of his posts from three years ago just became a reality his idea led to people research project about "Politeifying text": Making your text making your text more polite. They discuss all about this project, and how have the last three years helped make this happen. Links: Politeness Transfer: A tag and Generate Approach: https://arxiv.org/pdf/2004.14257 Follow: Aman Madaan: https://twitter.com/aman_madaan https://madaan.github.io Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd #fastai #StyleTransfer #DeepLearning
48:56
July 16, 2020
Yifan Xie | Freelancing in Data Science | Kaggle DFDC Comp | CTDS.Show #82
Video Version: https://www.youtube.com/playlist?list=PLLvvXm0q8zUbiNdoIazGzlENMXvZ9bd3x Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews Data Scientist at Arion.ai, Kaggle Comp Master: Yifan Xie They talk about Yifan’s journey into Data Science, kaggle and how his approach to competing and freelancing changed over the years. They also discuss their team’s Solution to DFDC Comp. Links: https://www.kaggle.com/c/deepfake-detection-challenge/discussion/157983 Follow: Yifan Xie: Twitter: https://twitter.com/YifanX Linkedin: https://www.linkedin.com/in/yifanxie/ Kaggle: https://www.kaggle.com/yifanxie Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
1:48:23
July 12, 2020
Yuki Asano | Self-Supervision | Self-Labelling | Labelling Unlabelled videos from scratch w multi-modal self-supervision | CV | CTDS.Show #81
Video Link: https://youtu.be/LPdbnasJ9wI Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews Computer Vision Researcher at the Oxford VGG Group: Yuki Asano. They talk all about Yuki's journey into Computer Vision and his research interests, how Yuki approaches research. They also discuss three of Yuki's recent works and Yuki walks us through the process of how he approached the projects, along with an interesting overview of the same. Links: A critical analysis of self-supervision, or what we can learn from a single image: https://arxiv.org/abs/1904.13132 Self-labelling via simultaneous clustering and representation learning: https://arxiv.org/abs/1911.05371 Labelling unlabelled videos from scratch with multi-modal self-supervision: https://arxiv.org/abs/2006.13662 Follow: Yuki Asano: https://twitter.com/y_m_asano https://www.linkedin.com/in/yuki-m-asano/ https://yukimasano.github.io Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
47:45
July 9, 2020
Parul Pandey | Journey to becoming Kaggle Grandmaster | CTDS.Show #80
Video: https://youtu.be/_--6uAOPEP4 Newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews Parul Pandey for the second time on the series. They talk all about her journey to becoming Kaggle Kernels Grandmaster, her process of writing the amazing kernels along with a lot of ethical issues around Kaggle. Previous Interview: https://youtu.be/DjBgB_fNXl0 Follow: Parul Pandey: https://www.linkedin.com/in/parulpandeyindia/ https://twitter.com/pandeyparul https://www.kaggle.com/parulpandey Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: http://chaitimedatascience.com/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. Flow by LiQWYD https://soundcloud.com/liqwyd
59:02
July 5, 2020
Addison Howard, Maggie Demkin, Phil Culliton | Kaggle Team | CTDS.Show #79
Video Version: https://youtu.be/-eU2z8xWpEs Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews three people from the kaggle team: Phil Culliton, who's on the data science team at Kaggle, Maggie Demkin: who's on the customer success and business development team, and Addison Howard, who's a programme manager at Kaggle. They talk a lot about what happens behind the scenes at kaggle, Including a very interesting Gladiator fight. Please stay tuned! ;) They talk mainly about what happens from the start to the ending of the competition. How does a competition really come to its life? What happens before that, what happens after that, what happens during the duration, on the other side of the team? The team also shares many amazing stories, their favourite memories from hosting these competition, their overview of kaggle and the community. Follow: Addison Howard: https://www.linkedin.com/in/addison-howard-3ab25a25/ https://www.kaggle.com/addisonhoward Maggie Demkin: https://twitter.com/mdemkin https://www.linkedin.com/in/mmdemkin/ Phil Culliton: https://twitter.com/PhilCulliton https://www.linkedin.com/in/philculliton/ https://www.kaggle.com/philculliton Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd #Kaggle #CTDS #Competition
1:25:46
July 2, 2020
Kaggle Legend, "Raddar": Darius Barušauskas | Becoming Comp Grandmaster in 1 year | AI in Medicine
Video Link: https://youtu.be/qLcBbbYTsSU Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews Kaggle legend, "Raddar": Darius Barušauskas. They talk about Raddar's journey to becoming Comp Master in ~6 weeks and Comp GM in 1 year. Raddar shares his secrets and advices on how to approach Kaggle problems. They also discuss his favourite memories, his approach to problems and AI Applied to Medicine. Follow: Darius Barušauskas: https://www.kaggle.com/raddar https://twitter.com/tweet_raddar https://www.linkedin.com/in/raddar-in/ Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
48:21
June 28, 2020
Philipp Singer, Kaggle GM, Ranked #4 | Pos #8 Sol: Jigsaw Multilingual Toxic Comment Classification | CTDS.Show #77
Video: https://youtu.be/Z788bRuemsI Newsletter: https://tinyletter.com/sanyambhutani This Episode is an excerpt from Sanyam Bhutani's 3rd interview with Dr. Philipp Singer, Sr. Data Scientist at H2O.ai, Kaggle Comp and Discussions Grandmaster ranked 4 in both tiers. In this short version, they talk about Philipp's solution to Jigsaw Multilingual Toxic Comment Classification Comp where he ranked 8th and won a Solo-Gold Medal, his second one in the same week! They talk about the problem statement, TPUs and Philipp's solution. The complete interview will go out next week, so please stay tuned for that. Comp Link: https://www.kaggle.com/c/jigsaw-multilingual-toxic-comment-classification Solution Overview: https://www.kaggle.com/c/jigsaw-multilingual-toxic-comment-classification/discussion/160937 Previous Interviews: Interview 1: https://www.youtube.com/watch?v=7sh5QrUIAHI Interview 2: https://www.youtube.com/watch?v=_Srv0bKmfjY Follow: Philipp Singer https://twitter.com/ph_singer https://www.philippsinger.info/ Sanyam Bhutani: https://twitter.com/bhutanisanyam1 About: http://chaitimedatascience.com/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every Sunday, Thursday available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
23:41
June 25, 2020
Connor Shorten | Creating AI Content, Videos, | Henry AI Labs, Research & GANs | CTDS.Show #76
Note: We are hosting a 3 week comp to celebrate 1 year anniversary of CTDS. Details: https://www.kaggle.com/rohanrao/chai-time-data-science/tasks?taskId=1183 Video Version: https://youtu.be/Cn97ynuWAiQ Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews, an amazing content creator at Henry AI Labs: Connor Shorten. Connor is also the host of ML Street Talk Podcast. They talk about his process of creating content, research, how does he summarise the research papers and create very accessible YouTube videos. Follow: Connor Shorten: https://twitter.com/CShorten30 https://www.youtube.com/channel/UCHB9VepY6kYvZjj0Bgxnpbw?sub_confirmation=1 https://www.youtube.com/channel/UCMLtBahI5DMrt0NPvDSoIRQ Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
37:26
June 21, 2020
Rachel Thomas | Fast.ai | Applied Ethics | Top Down Learning | CTDS.Show #75
Video Version: https://youtu.be/tq_XcFubgKo Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani Personal Note: A huge thanks to the community for the support with the interview series. This is my 100th interview, it's an honour and the 100th interview had to be with my guru, Dr. Rachel Thomas In this episode, Sanyam Bhutani interviews co-founder of fast.ai and Director, USF Center for Applied Data Ethics. This episode covers three broad themes: They talk about top down learning, and what does it take to create a course or material that follows the top down deep teaching approach, as well as ethics and biases. Rachel answers the question of how can we do better? How can we address this theme better? They also discuss project building and blogging. This episode really addresses a few very important topics and we hope you all get to learn about Ethics, specially Ethics applied in AI. Links: http://course.fast.ai http://forums.fast.ai http://fast.ai Follow: Rachel Thomas: https://twitter.com/math_rachel Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd #fastai #Ethics #CTDS.Show
36:54
June 18, 2020
Dmitry Larko | H2O.ai | Kaggle, Applying Kaggle to Real world | AutoML | CTDS.Show #74
Video Version: https://youtu.be/aC9t9D7HpYE In this episode, Sanyam Bhutani interviews Kaggle Grandmaster, Chief Data Scientist at H2O.ai: Dmitry Larko. Dmitry has been active on Kaggle for over 6 years and they discuss how his views on Kaggle, and his journey with Kaggle. They also discuss about Dmitry's work at H2O.ai, H2O's products and the challenges that Dmitry is working on outside of Kaggle. Follow: Dmitry Larko: https://twitter.com/DmitryLarko https://www.kaggle.com/dmitrylarko https://www.linkedin.com/in/dlarko/ Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: http://chaitimedatascience.com/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. Flow by LiQWYD https://soundcloud.com/liqwyd
1:07:11
June 14, 2020
Maximilian Jeblick | Physics, Math and Data Science | Kaggle and H2O.ai | CTDS.Show #73
Video Version: https://youtu.be/VeM1T7UaYTk In this episode, Sanyam Bhutani interviews another amazing Maker and Kaggler from H2O.ai: Dr. Maximilian Jeblick. Max is a Senior Data Scientist at H2O.ai, Kaggle Comp master and PhD in Mathematical Physics. They talk about his transition from Physics into Data Science, and they draw many parallels between Physics and Data Science. They discuss about his experience on Kaggle, his advices on how you can improve on Kaggle and learn from it along with his work at H2O.ai Follow: Maximilian Jeblick: Linkedin: https://www.linkedin.com/in/maximilian-jeblick-170477173/ Kaggle: https://www.kaggle.com/maxjeblick Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: http://chaitimedatascience.com/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. Flow by LiQWYD https://soundcloud.com/liqwyd
39:32
June 11, 2020
Andreas Mueller | Scikit-Learn | ML and Open Source | CTDS.Show #72
Video Version: https://youtu.be/iNZd_5T8tCI Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani Personal Note: It was really a privilege to talk to Andreas on the show, thanks to  Abhishek Thakur  for helping make this happen. In this episode, Sanyam Bhutani interviews one of the core developers at Scikit-Learn:  Andreas Mueller  This episode is all about open source and machine learning, They talked about how Andreas' overview about open source and machine learning and Scikit-Learn itself has evolved over the years how his approach to creating open source API's, his understanding of open source has evolved over the years that he's been active in the open source community. There's a lot of discussion around scikit learn and learning through materials and Andreas' take on the recent developments in deep learning. We also discuss Andreas' move back to Industry, he'll be joining Microsoft and this interview talks about what's next for him as well. Follow: Andreas: https://twitter.com/amuellerml https://amueller.github.io Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
1:06:37
June 7, 2020
Martin Henze, "Heads Or Tails", First Kaggle Kernel GM | Astronomy | Story-Telling with Data
Video Version: https://youtu.be/2dpaSTWdhSk Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani Note: Correction: At 7:35: Heads or Tails' Kaggle avatar is the spiral galaxy M101, which has the nickname 'Pinwheel galaxy', not 'Whirlpool galaxy' In this episode, Sanyam Bhutani interviews the First Kernel #GrandMaster on Kaggle: "Heads or Tails": Martin Henze. In this interview, they talk about his journey into data science, his transition from astronomy. Astronomy involves a lot of looking up to the skies and trying to figure out through data what's out there, and as you might imagine, many parallels are drawn between astronomy and data science and Martin's approach to creating the amazing kernels. They also discuss his transition into doing data science during the day. Personal Note: There are a lot of rich hidden golden advices for people that are working towards sharing their kernels on Kaggle. Follow: Martin Henze: https://twitter.com/heads0rtai1s https://www.linkedin.com/in/martin-henze/ http://kaggle.com/headsortails Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd Note: The giveaway isn't sponsored, I really enjoyed the book and want to share it with another learner who might. #MachineLearning #Production #Applications
1:09:36
June 4, 2020
Interview with Yauhen Babakhin | Kaggle, Computer Vision and AutoML | CTDS.Show
Video Version: https://youtu.be/n_IUOeiKwnE In this episode, Sanyam Bhutani interviews Kaggle Grandmaster and Data Scientist at H2O.ai: Yauhen Babakhin. They talk about Yauhen's roots in Belarus and some secrets of his and his approaches to Kaggle competition and his approach to auto ML. They discuss a lot about how Yauhen approach these problems, how he finds ideas and how he iterated upon them. They also covered an interesting area that Yauhen has contributed to via Kaggle which is academia. Links: Interview by Parul Pandey: https://www.h2o.ai/blog/meet-yauhen-the-first-and-the-only-kaggle-grandmaster-from-belarus/ Paper: https://arxiv.org/pdf/1904.04445.pdf Yauhen's talk from H2O World: https://youtu.be/gRraEabTX3o Blog version: https://www.h2o.ai/blog/image-tasks-on-h2o-driverless-ai/ Follow: Yauhen Babakhin: Linkedin: https://www.linkedin.com/in/yauhenbabakhin/ Kaggle: https://www.kaggle.com/ybabakhin Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: http://chaitimedatascience.com/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. Flow by LiQWYD https://soundcloud.com/liqwyd
1:05:52
May 31, 2020
Birthday Special AMA: Answering Questions from my ML Heroes | CTDS.News Launch
Check out the CTDS.News Teaser, live now: http://ctds.news Blog for this video and news launch will go live here: https://medium.com/@init_27 This is a special birthday AMA episode, hope you enjoy this one. Feel free to AMA in the comments below. I'm really thankful to everyone that sent their questions and wishes and agreed to this idea. Thank you! Follow: Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd Note: The giveaway isn't sponsored, I really enjoyed the book and want to share it with another learner who might. #Birthday #AMA #ML
1:06:24
May 27, 2020
Emmanuel Ameisen | Building Machine Learning Powered Apps
Link to tweet for details on Giveaway: https://twitter.com/bhutanisanyam1/status/1264589958146174976?s=20 Video Version: https://youtu.be/ctss0hcD9SE Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews Emmanuel Ameisen, ML engineer at Stripe, and the author of the O'Reilly book: "Building Machine Learning Powered Applications: Going from idea to product" In this interview they talk about Emmanuel's journey into machine learning, and how his journey through the different roles led him to writing the book. The book is one of the greatest resources for the title for Building ML Apps, and also one of the greatest top down learning books that follow the top down teaching approach. They talk about who the book is a really for, what can you expect out of it and Emanuel's journey of writing to Book and where to go after you've read the book, How do you find your idea to take to project your passion project or your million dollar app idea. Links: Book: https://www.amazon.com/Building-Machine-Learning-Powered-Applications/dp/149204511X/ http://shop.oreilly.com/product/0636920215912.do Radek's blog: https://medium.com/@radekosmulski Follow: Emmanuel Ameisen: https://twitter.com/mlpowered https://mlpowered.com Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd Note: The giveaway isn't sponsored, I really enjoyed the book and want to share it with another learner who might.
58:07
May 24, 2020
Eli Stevens, Luca Antiga, and Thomas Viehmann | Deep Learning with PyTorch
Link to tweet for details on Giveaway: https://twitter.com/bhutanisanyam1/status/1263500914427494400?s=20 Discount code: "PodChai20" Video Version: https://youtu.be/f5Qv3eSZpug Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews three great contributors to PyTorch: Eli Stevens, Luca Antiga, and Thomas Viehmann. The three are also authors of an upcoming book by Manning Publications: Deep learning with Pytorch In this interview, they talk about the authors' journey into machine learning and the journey with Pytorch towards the efforts into writing the book. They also discuss the book writing efforts, the authors share what can one expect from the book and what are you expected to have prepared before learning from the book and what can you take away from it. There is also a lot of great advice around project building, which is, is one of the essence to getting your break into the field. Links: Link to the book: https://www.manning.com/books/deep-learning-with-pytorch PyTorch: https://pytorch.org Follow: Eli Stevens: https://www.linkedin.com/in/eli0stevens/ Luca Antiga: https://twitter.com/lantiga?lang=en https://lantiga.github.io Thomas Viehmann: https://twitter.com/thomasviehmann http://thomas.viehmann.net Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd Note: Thanks to Manning Publication for doing the giveaway, note that manning hasn't sponsored the video in anyway, I'm a huge fan of their book and I'm grateful that they're doing a special giveaway + discount for the CTDS.Show audience #MachineLearning #PyTorch #Book
1:17:42
May 22, 2020
Goku Mohandas | MadeWithML | AI Research | Healthcare | Education
Video Version: https://youtu.be/VqysJmIqko8 Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews Goku Mohandas. They dive into the three aspects on Goku's personal profile: AI research. Health Care and Education They start by talking about Goku's journey into research how he got interested into machine learning. He shares many amazing advices for future aspiring researchers, followed by healthcare and machine learning. Education: Goku has recently started madewithmachinelearning.com his latest contribution to education machine learning education, Made with ML is a platform for discovering projects. organising projects and building projects, sharing them with the community. We really dive deep into what the platform is the philosophy behind it, and the future of it, and many upcoming features. Most of these would have been integrated by the time this interview goes out. So please do sign up if you get a chance. Personal Note: I really enjoyed talking with Goku, all of the interaction on all of the topics had really interesting golden nuggets. Follow: Goku Mohandas: https://twitter.com/GokuMohandas https://madewithml.com https://goku.me Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
1:35:28
May 22, 2020
Dmitry Danevskiy | Google Quest Q&A Labelling Comp: Winning Sol | Becoming Kaggle Grandmaster
Video version: https://youtu.be/pQL892iT-dM Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews Kaggle Grandmaster Dmitry Danevskiy. Dmitry's team whose team at the time of recording had just finished first on the Google quest q&a labeling Kaggle competition. With that gold medal Dmitry just achieved the title of becoming a Grand Master in the competition tier. They speak about his journey on Kaggle and into data science on this episode along with their first position solution to the Google quest competition. Dmitry is currently working as a machine learning specialist or machine learning research engineer they also discuss about his work, how Kaggle has impacted his work and his take on where does Kaggle help you in data science? Links: Winning sol writeup: https://www.kaggle.com/c/google-quest-challenge/discussion/129840 Comp: https://www.kaggle.com/c/google-quest-challenge/overview Interview with Yury about mlcourse: https://www.youtube.com/watch?v=guvFOjxdeeA Follow: Dmytro Danevskyi: https://twitter.com/DanevskiyD https://www.linkedin.com/in/dmitry-danevskiy/ Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
33:36
May 14, 2020
Hamel Husain | Fastpages, Open Source | ML at Github | fastai
Video Version: https://youtu.be/-pYMXSThpvc Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews, Hamel Husain, who's currently a staff ML engineer at GitHub. They talk all about the open source world and his life behind the open source world at GitHub, the projects that he's been working on and his journey into the field of machine learning: his research ideas and the projects that he's been working on at GitHub. They also talk about fast AI, his journey with fast ai, and his best advice is to student along with his recent project, so to speak, fast pages, which is one of the best ways to get started with blogging. This interview has many hidden advices around open source and fast a or even data science so to speak. Links: fast.ai: https://course.fast.ai Fastpages: https://github.com/fastai/fastpages http://hamel.io Semantic Code Search: https://medium.com/@hamelhusain/semantic-code-search-3cd6d244a39c Follow: Hamel Husain: https://twitter.com/HamelHusain http://hamel.io Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
49:33
May 10, 2020
Robert Bracco | Learning to Learn | Approaching Fast.ai Materials, Kaggle & Blogging
Video version: https://youtu.be/CYYvQ-5V3xA Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Robert Bracco and Sanyam Bhutani are in conversation where they discuss their approach learning to learn, as you might imagine, they talk a lot about the fast AI courses and how their approach to learning to learning has evolved as we have approved the materials. Robert has been working on the fast.ai, the fast AI audio module, the unofficial audio module, and Sanyam discuss his his journey in that direction, how have his learnings evolved? Links: Previous Interview: https://www.youtube.com/watch?v=k-gZAyg5ib8 fast.ai courses: https://course.fast.ai Follow: Robbert Bracco: https://twitter.com/madeupmasters Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
1:58:15
May 7, 2020
Pablo Samuel Castro | ML Research, Google Brain & Creative AI | Learning ML w the community | LatinX
Video Version: https://youtu.be/muiM5SQxTIA Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews Dr. Pablo Samuel Castro, who's currently a staff research software developer at Google and is working with the Google brain team in Montreal with a focus on reinforcement learning and machine learning applied to music and creativity. They talk all about these themes in this interview reinforcement learning and using AI in creative domains. Pablo is a musician himself and they discuss his overview on using AI to replace or enhance human creativity in domains outside of technical focus. They also discuss how does Pablo approach research problems his take on research and and his pipeline, or approach to working on new problems: Personal Note: This interview definitely has a lot of interesting pieces of advice around approaching research problems IMO & I hope you find it fascinating. Follow: PSC: https://twitter.com/pcastr https://www.linkedin.com/in/pablo-samuel-castro-2113641b Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
59:17
May 3, 2020
Daniel Bourke | Learning to Learn | Creating AI Content | Fitness & Machine Learning
Video Version: https://youtu.be/r5_SuLF5UWY Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews Daniel Bourke who has had a very interesting journey into the field: he created his own AI curriculum that he followed very rigorously, religiously and with a lot of discipline. Daniel has also been running a YouTube channel where he's been posting about his journey of doing online courses. We talk all about his wonderful journey, how he kept the discipline that he required to throughout this why throughout this period, and why did he decide to create content. We also talk about one very important thing, nutrition, fitness and health. Daniel is an expert on this, if I may, we he shares his advice on how to balance this lifestyle of being ambitious and machine learning machine learning, pushing your goals becoming a better machine learning practitioner or a data scientist while also taking care of your health. Follow: Daniel Bourke: https://twitter.com/mrdbourke Blog: https://www.mrdbourke.com YT: https://www.youtube.com/channel/UCr8O8l5cCX85Oem1d18EezQ https://www.mrdbourke.com/mlcourse/ Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
1:33:11
April 30, 2020
Interview w Ines Montani | Spacy, NLP & Open Source Frameworks | Explosion.ai, Thinc.ai & Prodi.gy
Video Version: https://youtu.be/C5DGFSDlMBM Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews a machine learning hero who's been inspiring and empowering many, many future heroes through their open source work: Ines Montani, co founder of explosion, the company behind spacy, prodigy and thinc.ai. They talk about all of the amazing work that is, and the team at explosion has been doing, her journey into the field of programming her journey into the field of NLP and her journey at explosion. They talk about all these things, all these three things along with all of the amazing work that Explosion has been putting out. Yes, Spacy, prodigy and thinc.ai, thinc,ai is the latest framework that has been put out along with open source development and the NLP industry. Personal Note: Yes, those are a lot of things that have been All (Luckily for us), been covered in this single podcast! This has been a treat for me & I hope you enjoy it as much as I did. Links: https://explosion.ai/about https://spacy.io https://prodi.gy https://thinc.ai Follow: Ines Montani: https://twitter.com/_inesmontani https://www.linkedin.com/in/inesmontani/ https://ines.io Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
51:56
April 26, 2020
Suzana Ilić | Democratising AI w Communities | Machine Learning Tokyo | Inclusivity in AI
Video Version: https://youtu.be/TzgHNJN8D3I Subscribe to the newsletter for updates: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews Suzana Ilić who's currently a PhD student in the domain of NLP and also the co founder and director of Machine Learning Tokyo. In this interview they discuss about Suzana's journey into machine learning and how she discovered her passion for machine learning and NLP. They also discuss all about her efforts to democratize AI through her awesome community efforts & about Machine Learning Tokyo community. How community building can help you take away many skill sets or personal growth aspects: Suzana shares her journey as the co founder of Machine Learning Tokyo. They discuss an important topic, about inclusivity in AI in community. Fostering an environment that encourages beginners while also being inclusive to a huge extent. Machine learning Tokyo has been a platform for open science, open education and open project development discuss about all of these three aspects including the projects, research, and education that has come out of it. Follow: Suzana Ilić: https://twitter.com/suzatweet https://www.linkedin.com/in/suzanailic/ https://github.com/suzana-ilic Machine Learning Tokyo: https://www.meetup.com/Machine-Learning-Tokyo/ https://mltokyo.ai https://github.com/Machine-Learning-Tokyo Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
36:30
April 23, 2020
CEO of Decision.ai: Dan Becker | What does it take to become a Data Scientist? | Kaggle Learn & Data Science Portfolio
Video Version: https://youtu.be/eEYvgsUeEgw Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews founder and CEO at decision.ai: Dan Becker, you might know Dan as the creator of Kaggle learn where he enabled, or actually democratised a lot of machine learning courses that he created on kaggle.com/learn. In this episode, they talk all about Dan's journey into the field. Dan's work at Kaggle, where he created Kaggle learn. They talk a lot about MOOCs: open online courses. What does it take to become a data scientist? And how can MOOCs come into the picture for you? How can you think of ideas or projects that can help you build a portfolio? They also talk about Dan's new venture: decision.ai, which he's just recently started, Dan already has been a contributor to Keras and TensorFlow and has done projects, consulting projects for six companies in the Fortune 100 list of companies. They of course, talk about how that was helpful in Dan's new journey. Links: https://decision.ai/example https://decision.ai https://www.kaggle.com/learn/overview Follow: Dan Becker: dan@decision.ai https://www.linkedin.com/in/dansbecker/ https://twitter.com/dan_s_becker https://www.kaggle.com/dansbecker Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
56:12
April 19, 2020
Interview w Mark Landry | Data Science, Kaggle, H2O.ai | AutoML
Chai Time Data Science Playlist: https://www.youtube.com/playlist?list=PLLvvXm0q8zUbiNdoIazGzlENMXvZ9bd3x Audio (Podcast Version) available here: https://anchor.fm/chaitimedatascience In this episode, Sanyam Bhutani interviews legendary Kaggle GrandMaster: Mark Landry. They talk all about Mark's journey into machine learning: How he got interested in machine learning? How did he discover Kaggle? His addiction for Kaggle? How his approach on Kaggle has evolved over all of these years that he's been active on the platform? They talked about his journey at h2o: he's been at h2o.ai for a few years now. They also talk about auto ML, we reveal interesting story: AutoML at h2o.ai was originally called AutoMarkLandry. And there's a lot of discussion around data science on and off Kaggle, how to approach Kaggle or data science problems. This one is all about data science, Kaggle, and h2o.ai but it's true to the three words, they talk all about these three words and Mark's journey in all of these three. Follow: Mark Landry: https://twitter.com/mark_a_landry https://www.linkedin.com/in/mark-landry-78b863a/ https://www.kaggle.com/mlandry Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: http://chaitimedatascience.com/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. Flow by LiQWYD https://soundcloud.com/liqwyd
1:03:49
April 16, 2020
Interview with Dmytro Mishkin | Computer Vision Research | Kaggle, ML & Education
Video Version: https://youtu.be/lWwkbiufwNE Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews Dmytro Mishkin, who's currently a computer vision researcher and PhD student at the Czech Technical University in Prague. They talk all about Dmytro, his journey into deep learning and his journey into research. Dmytro has put out some amazing research work over the past few years, and they talk about how he approaches research in general, his take on research and his suggestion to future aspiring researchers. Dymtro is an active Kaggler and he has accomplished amazing results on many competitions/ This interview covers a lot of ideas around education, research, and practice. Follow: Dmytro Mishkin: https://twitter.com/ducha_aiki http://cmp.felk.cvut.cz/~mishkdmy/ https://www.kaggle.com/oldufo Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
52:40
April 12, 2020
SharpestMinds Team on Learning to Learn | Data Science, Startups & Hiring
Video available here: https://www.youtube.com/playlist?list=PLLvvXm0q8zUbiNdoIazGzlENMXvZ9bd3x Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this special version of the podcast, Sanyam Bhutani is in conversation with the complete SharpestMinds team: Edouard Harris, Russell Pollari and Jeremie Harris. They talk a lot about learning to learn. The theme, broadly speaking, learning to learn applied to the business, the startup and the data science world. They discuss multiple ideas of how to get a break into the data science field, how should you go about learning. How should you go about putting projects on your resume of what projects to put on your resume and when you really know that you're ready for a job? Links: Interview with Ed: https://www.youtube.com/watch?v=69urmSt34Ac SharpestMinds: https://www.sharpestminds.com https://twitter.com/sharpestmindsai?lang=en https://www.linkedin.com/company/sharpestminds/ Follow: Edouard Harris: https://twitter.com/neutronsNeurons https://medium.com/@edouard_harris Russell Pollari: https://twitter.com/russ_poll https://www.linkedin.com/in/russell-pollari-b555895a https://russellpollari.com Jeremie Harris: https://twitter.com/jeremiecharris https://www.linkedin.com/in/jeremieharris https://medium.com/@jeremie_sharpestminds Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
1:07:08
April 9, 2020
Interview w Sylvain Gugger | fast.ai: The new Framework & course | FastBook & Research at fast.ai
Previous Interview: https://hackernoon.com/interview-with-deep-learning-researcher-at-fast-ai-sylvain-gugger-7cb08fe2ff53 Video available here: https://youtu.be/-3fw9hxiop0 Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews another hero from the fastai family: Sylvain who's a research scientist at fast.ai In this interview they talk about a lot of exciting, upcoming developments from the fast.ai research lab, the framework, the book and the upcoming course. The upcoming course will be released along with the book and the library as a MOOC later, around the middle of this year. They discuss about what we can expect from the library, the course and the book. The efforts go into these and what topics is the book and the course going to cover is the newer version. Links: Fast.ai Course: https://course.fast.ai https://twitter.com/fastdotai Book: http://shop.oreilly.com/product/0636920216391.do https://www.amazon.com/Deep-Learning-Coders-fastai-PyTorch/dp/1492045527 Interview with Jeremy Howard: https://www.youtube.com/watch?v=205j37G1cxw Follow: Sylvain Gugger: https://twitter.com/guggersylvain https://sgugger.github.io/pages/about-me.html Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
34:01
April 5, 2020
Interview w Erin LeDell | H2O-AutoML & H2O.ai | Open Source | RLadies & WiMLDS Community
Note: Correction from the intro: Erin is the *co-founder* of R-Ladies Global & Founder of WiMLDS. Chai Time Data Science Playlist: https://www.youtube.com/playlist?list=PLLvvXm0q8zUbiNdoIazGzlENMXvZ9bd3x In this episode, Sanyam Bhutani interviews one of the inspirations for the complete Machine Learning Community: Dr. Erin Ledell, chief machine learning scientist at H2O.ai. Erin holds a PhD from UC Berkeley where her research was focused on machine learning and computational stats, and she has worked in the software industry before joining H2O. They talk all about her journey into the field into starts machine learning and her journey at H2O along with h2o's open source products, h2o-auto ML. Erin clarifies the question of the difference between h2o, h2o-3 and h2o auto ML, so please stay tuned if you're curious to find out the answer to that. They discuss about her work at h2o.ai and Erin shares many great advisors and opinions about auto ML and software or software tools for data scientists or humans generally speaking, Erin is the (co)-founder of Rladies and women in machine learning and data science. They talk about her contributions to the community and how can outsiders or just newcomers contribute to this. Links: http://wimlds.org https://rladies.org H2O: https://www.h2o.ai H2O Tutorials: http://docs.h2o.ai/h2o-tutorials/latest-stable/ https://github.com/h2oai/h2o-3 Follow: Erin LeDell: https://twitter.com/ledell https://www.linkedin.com/in/erin-ledell/ https://github.com/ledell Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: http://chaitimedatascience.com/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. Flow by LiQWYD https://soundcloud.com/liqwyd
57:04
April 2, 2020
Interview with Russ Wolfinger | Statistics, Data Science & Kaggle | NFL Big Data Bowl #14 Pos Sol
Video available here: https://youtu.be/akYeBUTXmT4 Subscribe to the newsletter for updates: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews Dr. Russ Wolfinger, Director of scientific discovery and genomics at SAS, a department which he had started. In this interview, they talk all about his journey into the field, how his Stats and software development evolved over the years. Russ is also kaggle Grandmaster in the competition tier. They talk a lot about kaggle. His approach to kaggle his his viewpoints on kaggle, and best advice is to new big new joiners/beginners. This interview is an amazing intersection of statistics, data science and data science applied to kaggle and the real world. We also talk a lot about kaggle as a platform for learning data science and applying your skills how to get better at kaggle and data science broadly speaking. So thank you to Russ for sharing all of those amazing advises! Follow: Dr. Russ Wolfinger: https://www.linkedin.com/in/russ-wolfinger-5ab7999/ https://www.kaggle.com/sasrdw Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
1:14:17
March 29, 2020
Interview with Sergey Kolesnikov | Catalyst: PyTorch Framework for DL & RL | Open Source, Soft. Engg & Community
Audio (Podcast Version) available here: https://anchor.fm/chaitimedatascience Subscribe to the newsletter for updates: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews a researcher, practitioner and open source contributor: Sergey Kolesnikov, creator of catalyst, which is a deep learning and reinforcement learning framework based on Pytorch. In this interview, they talk all about Sergey's journey into the field of machine learning and reinforcement learning. His thoughts about open source development and the story of catalyst, how the development of catalyst started the story behind it, and its current features, the ecosystem. And what all can it support. They also talk a lot about the community in machine learning open data science community as well. Sergey shares many great advices about software engineering and open source development as well. Follow: Sergey Kolesnikov https://twitter.com/scitator?lang=en https://t.co/ahXByAInFI?amp=1 Catalyst: https://twitter.com/catalyst_core https://github.com/catalyst-team/catalyst Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
1:14:44
March 26, 2020
Interview with "Inversion": Walter Reade | Data Science at Kaggle | Becoming a Data Scientist & Kaggle Grandmaster
Video available here: https://youtu.be/OoB_LQpgDCk Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews a data scientist, but from the other side of the world, a data scientist at Kaggle, Walter Reade, also known as "Inversion" famously on Kaggle. Walter, was the first Kaggle discussions Grand Master. In this interview, they talk all about his journey into data science and on Kaggle how his journey into data science started via Kaggle. Walter holds a PhD in chemical engineering. This interview also talks a lot about the science in data science, how Walter has approached problems over the years. And also what does data science on the other side of Kaggle. Personal Note: I feel this is a very special interview in the sense that this time it's a Kaggle Grand Master that is a part of Kaggle's team. Follow: Walter Reade: https://twitter.com/walterreade https://www.linkedin.com/in/reade/ Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
45:43
March 22, 2020
Interview with Parul Pandey | Getting Started with Data Science & Blogging | Women in Data Science
Chai Time Data Science Playlist: https://www.youtube.com/playlist?list=PLLvvXm0q8zUbiNdoIazGzlENMXvZ9bd3x In this episode, Sanyam Bhutani interviews Parul Pandey, data evangelist at h2o.ai and they talk all about what her role as a data evangelist means and her work at h2o. Personal Note: I've been a fan of her blog posts. And this podcast is all about her secrets to blogging along with all of the amazing things that she's been contributing to including meetups evangelising, or sharing amazing things on LinkedIn and Twitter. They discuss all about her journey into the field. Her advices to future aspirants. Parul is a great contributor to the women in Data Science community, and they discuss about WiDS and how can others who recognize the community, contribute towards it. Follow: Parul Pandey: https://www.linkedin.com/in/parulpandeyindia/?originalSubdomain=in https://twitter.com/pandeyparul https://t.co/qyR0tpZVnA?amp=1 WiDS Hyderabad: https://twitter.com/WiMLDS_HYD Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: http://chaitimedatascience.com/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. Flow by LiQWYD https://soundcloud.com/liqwyd
59:26
March 19, 2020
Interview with ChristOf Henkel | Google Quest Q&A Labelling Comp 2nd Pos Sol | Rapids.ai & Kaggle
Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews Kaggle Grandmaster, "Dieter": Christof Henkel. They talk a lot about his previous assumed life of having worked as a construction worker, about Christof's Kaggle experience how he got interested in Kaggle, his Kaggle journey, his Kaggle pipeline, and his recent second position solution to Google's quest competition. They also discuss Christof's new job at rapids.ai where he's just joined as a senior deep learning data scientist. Links: https://rapids.ai https://github.com/rapidsai https://www.kaggle.com/c/google-quest-challenge/discussion/129978 Referenced Interviews: The Zoo: https://youtu.be/_Srv0bKmfjY Even Oldridge: https://youtu.be/-WzXIV8P_Jk Follow: Christof Henkel: https://twitter.com/kagglingdieter https://www.linkedin.com/in/dr-christof-henkel-766a54ba/ https://www.kaggle.com/christofhenkel Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: https://sanyambhutani.com/tag/chaitimedatascience/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
52:57
March 15, 2020
Interview w John Miller | NFL 1st and Future: Analytics Winning Sol | "Real World Data Sci" & Kaggle
Chai Time Data Science Playlist: https://www.youtube.com/playlist?list=PLLvvXm0q8zUbiNdoIazGzlENMXvZ9bd3x In this episode, Sanyam Bhutani interviews Kaggle Master & Senior Data Scientist: John Miller John Miller is one of the peoples similar from the league of guests on the show who started working in data science, even before it was called data science. And in this interview, they talk about his journey and how data science and his work has evolved. Over the years, John has contributed to different roles. And they talk about all of them, including his current work at h2o.ai. They also discussed his winning solution to the NFL first and future analytics Kaggle competition, where John had even created a real world impact via Kaggle competition, yes, via Kaggle competition even in the previous year where he had won the competition and his findings led to the change of a rule. Similar to last year, John has again won the competition this year as well. And they talked about his findings about the NFL and his discoveries while working on the competition. Links: https://www.kaggle.com/jpmiller/nfl-1standfuture-report https://www.linkedin.com/pulse/would-your-machine-learning-model-hold-up-court-john-miller/ https://www.kaggle.com/c/nfl-playing-surface-analytics/discussion/126164 https://www.kaggle.com/c/nfl-playing-surface-analytics/discussion/125977 Follow: John Miller: https://twitter.com/johnmillertx https://www.linkedin.com/in/johnmiller/ https://www.kaggle.com/jpmiller Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: http://chaitimedatascience.com/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. Flow by LiQWYD https://soundcloud.com/liqwyd
48:53
March 12, 2020
Interview with Tarin Clanuwat | Classical Japanese Literature & ML | Kuzushiji recognition kaggle comp
Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews Dr. Tarin Clanuwat, who's currently an assistant professor at the National Institute of Informatics in Japan. Tarin holds a PhD in Japanese literature and has followed-a traditional path in classical Japanese literature. Tarin has been doing some amazing research around Japanese literature including her work on Kuzushiji, they talk all about her background, her journey in classical literature and her journey in machine learning and her current research which is in the intersection of both. Links: https://www.youtube.com/watch?v=maR9ibJ2r7g https://www.kaggle.com/c/kuzushiji-recognition https://scholar.google.com/citations?user=oGpFVUUAAAAJ&hl=en Follow: Tarin: https://twitter.com/tkasasagi/ Sanyam Bhutani: https://twitter.com/bhutanisanyam1 About: http://chaitimedatascience.com/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every Sunday, Thursday available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
36:02
March 8, 2020
08: Where to go from here, General fast.ai advice
This episode reviews Lesson 8 from fast.ai Part 1, 2019 and the Things Jeremy says to do About: The motivation behind the 3-4 min video/audio summaries is to allow our fellow fast.ai family members to review the lectures from Part 1, 2019 and "Things Jeremy Says to do" in a 3 min format. Jeremy Howard, mentions many pearls of wisdom that Many Thanks to Robert Bracco, Author of "Things Jeremy Howard says to do" are now also available in this format. Reminder Note: This series is not a replacement in any format for the fast.ai lectures. It's supposed to act as supplementary material for the course. Links: Take the course here: https://course.fast.ai Things Jeremy Says to do thread: https://forums.fast.ai/t/things-jeremy-says-to-do/36682 Follow: fast.ai: http://twitter.com/fastdotai Jeremy Howard: http://twitter.com/jeremyphoward Robbert Bracco: https://twitter.com/MadeUpMasters Sanyam Bhutani: http://twitter.com/bhutanisanyam1
10:04
March 7, 2020
07: fast.ai Lesson-7 ResNet; U-Net; GANs | fast.ai 2019 & Things Jeremy Howard says to do
This episode reviews Lesson 7 from fast.ai Part 1, 2019 and the Things Jeremy says to do About: The motivation behind the 3-4 min video/audio summaries is to allow our fellow fast.ai family members to review the lectures from Part 1, 2019 and "Things Jeremy Says to do" in a 3 min format. Jeremy Howard, mentions many pearls of wisdom that Many Thanks to Robert Bracco, Author of "Things Jeremy Howard says to do" are now also available in this format. Reminder Note: This series is not a replacement in any format for the fast.ai lectures. It's supposed to act as supplementary material for the course. Links: Take the course here: https://course.fast.ai Things Jeremy Says to do thread: https://forums.fast.ai/t/things-jeremy-says-to-do/36682 Follow: fast.ai: http://twitter.com/fastdotai Jeremy Howard: http://twitter.com/jeremyphoward Robbert Bracco: https://twitter.com/MadeUpMasters Sanyam Bhutani: http://twitter.com/bhutanisanyam1
07:47
March 7, 2020
06: fast.ai Lesson-6 CNN Deep Dive; Ethics | fast.ai 2019 & Things Jeremy Howard says to do
This episode reviews Lesson 6 from fast.ai Part 1, 2019 and the Things Jeremy says to do About: The motivation behind the 3-4 min video/audio summaries is to allow our fellow fast.ai family members to review the lectures from Part 1, 2019 and "Things Jeremy Says to do" in a 3 min format. Jeremy Howard, mentions many pearls of wisdom that Many Thanks to Robert Bracco, Author of "Things Jeremy Howard says to do" are now also available in this format. Reminder Note: This series is not a replacement in any format for the fast.ai lectures. It's supposed to act as supplementary material for the course. Links: Take the course here: https://course.fast.ai Things Jeremy Says to do thread: https://forums.fast.ai/t/things-jeremy-says-to-do/36682 Follow: fast.ai: http://twitter.com/fastdotai Jeremy Howard: http://twitter.com/jeremyphoward Robbert Bracco: https://twitter.com/MadeUpMasters Sanyam Bhutani: http://twitter.com/bhutanisanyam1
06:52
March 7, 2020
05: fast.ai Lesson 5: Backprop; Neural Nets from scratch | fast.ai 2019 & Things Jeremy Howard says to do
This episode reviews Lesson 5 from fast.ai Part 1, 2019 and the Things Jeremy says to do About: The motivation behind the 3-4 min video/audio summaries is to allow our fellow fast.ai family members to review the lectures from Part 1, 2019 and "Things Jeremy Says to do" in a 3 min format. Jeremy Howard, mentions many pearls of wisdom that Many Thanks to Robert Bracco, Author of "Things Jeremy Howard says to do" are now also available in this format. Reminder Note: This series is not a replacement in any format for the fast.ai lectures. It's supposed to act as supplementary material for the course. Links: Take the course here: https://course.fast.ai Things Jeremy Says to do thread: https://forums.fast.ai/t/things-jeremy-says-to-do/36682 Follow: fast.ai: http://twitter.com/fastdotai Jeremy Howard: http://twitter.com/jeremyphoward Robbert Bracco: https://twitter.com/MadeUpMasters Sanyam Bhutani: http://twitter.com/bhutanisanyam1
05:11
March 7, 2020
04: fast.ai Lesson-4 NLP:Tabular Data; Recsys | fast.ai 2019 & Things Jeremy Howard says to do
This episode reviews Lesson 4 from fast.ai Part 1, 2019 and the Things Jeremy says to do About: The motivation behind the 3-4 min video/audio summaries is to allow our fellow fast.ai family members to review the lectures from Part 1, 2019 and "Things Jeremy Says to do" in a 3 min format. Jeremy Howard, mentions many pearls of wisdom that Many Thanks to Robert Bracco, Author of "Things Jeremy Howard says to do" are now also available in this format. Reminder Note: This series is not a replacement in any format for the fast.ai lectures. It's supposed to act as supplementary material for the course. Links: Take the course here: https://course.fast.ai Things Jeremy Says to do thread: https://forums.fast.ai/t/things-jeremy-says-to-do/36682 Follow: fast.ai: http://twitter.com/fastdotai Jeremy Howard: http://twitter.com/jeremyphoward Robbert Bracco: https://twitter.com/MadeUpMasters Sanyam Bhutani: http://twitter.com/bhutanisanyam1
04:41
March 7, 2020
02: fast.ai Lesson-2 Production & SGD From Scratch | fast.ai 2019 & Things Jeremy Howard says to do
This episode reviews lesson 2 from fast.ai Part 1, 2019 and the Things Jeremy says to do About: The motivation behind the 3-4 min video/audio summaries is to allow our fellow fast.ai family members to review the lectures from Part 1, 2019 and "Things Jeremy Says to do" in a 3 min format. Jeremy Howard, mentions many pearls of wisdom that Many Thanks to Robert Bracco, Author of "Things Jeremy Howard says to do" are now also available in this format. Reminder Note: This series is not a replacement in any format for the fast.ai lectures. It's supposed to act as supplementary material for the course. Links: Take the course here: https://course.fast.ai Things Jeremy Says to do thread: https://forums.fast.ai/t/things-jeremy-says-to-do/36682 Follow: fast.ai: http://twitter.com/fastdotai Jeremy Howard: http://twitter.com/jeremyphoward Robbert Bracco: https://twitter.com/MadeUpMasters Sanyam Bhutani: http://twitter.com/bhutanisanyam1
05:16
March 7, 2020
03: fast.ai Lesson-3 Multi-label; SGD from scratch | fast.ai 2019 & Things Jeremy Howard says to do
This episode reviews Lesson 3 from fast.ai Part 1, 2019 and the Things Jeremy says to do About: The motivation behind the 3-4 min video/audio summaries is to allow our fellow fast.ai family members to review the lectures from Part 1, 2019 and "Things Jeremy Says to do" in a 3 min format. Jeremy Howard, mentions many pearls of wisdom that Many Thanks to Robert Bracco, Author of "Things Jeremy Howard says to do" are now also available in this format. Reminder Note: This series is not a replacement in any format for the fast.ai lectures. It's supposed to act as supplementary material for the course. Links: Take the course here: https://course.fast.ai Things Jeremy Says to do thread: https://forums.fast.ai/t/things-jeremy-says-to-do/36682 Follow: fast.ai: http://twitter.com/fastdotai Jeremy Howard: http://twitter.com/jeremyphoward Robbert Bracco: https://twitter.com/MadeUpMasters Sanyam Bhutani: http://twitter.com/bhutanisanyam1
05:32
March 7, 2020
00 fast.ai 2019 Summaries & Things Jeremy Howard says to do
This episode is an introduction to the Mini-Chai Time Data Science series, about fast.ai summaries from Part 1, 2019 and a collection of things Jeremy Howard says to do. About: The motivation behind the 3-4 min video/audio summaries is to allow our fellow fast.ai family members to review the lectures from Part 1, 2019 and "Things Jeremy Says to do" in a 3 min format. Jeremy Howard, mentions many pearls of wisdom that Many Thanks to Robert Bracco, Author of "Things Jeremy Howard says to do" are now also available in this format. Reminder Note: This series is not a replacement in any format for the fast.ai lectures. It's supposed to act as supplementary material for the course. Links: Take the course here: https://course.fast.ai Things Jeremy Says to do thread: https://forums.fast.ai/t/things-jeremy-says-to-do/36682 Follow: fast.ai: http://twitter.com/fastdotai Jeremy Howard: http://twitter.com/jeremyphoward Robbert Bracco: https://twitter.com/MadeUpMasters Sanyam Bhutani: http://twitter.com/bhutanisanyam1
10:24
March 7, 2020
01 fast.ai Lesson-1 Image Classification | fast.ai 2019 & Things Jeremy Howard says to do
This episode summarises Lesson 1: Image Classification from fast.ai Part-1 along with the things Jeremy says to do. About: The motivation behind the 3-4 min video/audio summaries is to allow our fellow fast.ai family members to review the lectures from Part 1, 2019 and "Things Jeremy Says to do" in a 3 min format. Jeremy Howard, mentions many pearls of wisdom that Many Thanks to Robert Bracco, Author of "Things Jeremy Howard says to do" are now also available in this format. Reminder Note: This series is not a replacement in any format for the fast.ai lectures. It's supposed to act as supplementary material for the course. Links: Take the course here: https://course.fast.ai Things Jeremy Says to do thread: https://forums.fast.ai/t/things-jeremy-says-to-do/36682 Follow: fast.ai: http://twitter.com/fastdotai Jeremy Howard: http://twitter.com/jeremyphoward Robbert Bracco: https://twitter.com/MadeUpMasters Sanyam Bhutani: http://twitter.com/bhutanisanyam1
05:40
March 7, 2020
Interview with Marios Michailidis | What does it take to become #1 on Kaggle | DSB 2019, 14th Pos Sol
Previous Interview Link: https://sanyambhutani.com/interview-with-kaggle-competitions-grandmaster--kazanova--rank--3---dr--marios-michailidis/ Chai Time Data Science Playlist: https://www.youtube.com/playlist?list=PLLvvXm0q8zUbiNdoIazGzlENMXvZ9bd3x In this episode, Sanyam Bhutani interviews Kaggle Legend, GM: Marios where they continue talking a lot about Kaggle and how Kaggle has helped Marios in his data science journey, and his data science work at H2O.ai, all about the projects where he is contributing to. They also discuss his recent gold winning solution to the data science bowl 2019 competition, the approaches shared by Marios are applied, even generally, outside of that competition. They also touch upon a very important topic that isn't discussed as much on this podcast, the personal side of things, and the personal sacrifices it takes to really become the best become the best in the world become the best on Kaggle, like the sacrifices it took Marios to become number one on Kaggle both in competitions and discussions. Links: Solution by Marios' team: https://www.kaggle.com/c/data-science-bowl-2019/discussion/127221 DSB 2019 Competition: https://www.kaggle.com/c/data-science-bowl-2019/overview H2O blog: https://www.h2o.ai/blog Driverless AI: https://www.h2o.ai/products/h2o-driverless-ai/ Follow: Marios Michailidis: https://twitter.com/stacknet_?lang=en https://www.linkedin.com/in/mariosmichailidis/ https://www.kaggle.com/kazanova Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: http://chaitimedatascience.com/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. Flow by LiQWYD https://soundcloud.com/liqwyd
54:32
March 5, 2020
Interview with fast.ai hero: Radek Osmulski | Fast.ai, Learning to Learn | Machine Learning, Kaggle & Blogging
Video version available here: https://youtu.be/4h41v07bYYI Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews a machine learning hero to the complete fast.ai: Radek Osmulski. Personal Note: I think this is one of the most honest, and the best interviews in terms of the transparent Kaggle, learning to learn, and how to approach machine learning problems or machine learning materials advice. Radek shares his journey with complete honesty about how he went about learning the fast.ai materials, how he went on to smashing the kaggle competitions that he participated in, and his journey on learning to learn about all of the materials fast.ai and beyond in the machine learning world. They also talk a lot about learning to learn fast ai, and Kaggle all three together and individually as well. Follow: Radek: https://twitter.com/radekosmulski https://medium.com/@radekosmulski https://www.kaggle.com/radek1 https://www.linkedin.com/in/radek-osmulski-6b935794/?originalSubdomain=pl Sanyam Bhutani: https://twitter.com/bhutanisanyam1 About: http://chaitimedatascience.com/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every Sunday, Thursday available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
1:05:31
March 1, 2020
Interview with Dr. Ashrith Barthur | Cyber-Security & Anti-Money Laundering | Applied AI & H2O AI
Chai Time Data Science Playlist: https://www.youtube.com/playlist?list=PLLvvXm0q8zUbiNdoIazGzlENMXvZ9bd3x In this episode, Sanyam Bhutani interviews Dr. Ashrith, Chief Security Scientist at H2O.ai As you can guess, they talk all about cybersecurity and AI, AI broadly speaking in this episode. Ashrith has a background in cyber security and has done a lot of interesting research in the field, he's also currently doing applied research at H2O.ai. This is a first on this podcast series: They discuss about cybersecurity generally speaking, and its applications in AI, including anti money laundering and the applications that H2O is working on in the cybersecurity domain. Links: Webinars: https://www.h2o.ai/webinars/ H2O blog: https://www.h2o.ai/blog Driverless AI: https://www.h2o.ai/products/h2o-driverless-ai/ Follow: Ashrith Barthur: https://twitter.com/cyberbaggage https://www.linkedin.com/in/abarthur/ Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: http://chaitimedatascience.com/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. Flow by LiQWYD https://soundcloud.com/liqwyd
44:06
February 27, 2020
Interview with Sebastian Raschka | Statistics, Open Source & ML Research | Python for ML Book
Video Version available here: https://youtu.be/beSLA-wO2T4 Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews Dr. Sebastian Raschka, currently an assistant professor of Statistics at University of Wisconsin, Madison and the Author of Python for machine learning book. Sebastian has a background in biology and holds a PhD in quantitative biology, biochemistry, and molecular biology. In this interview, they talk all about his journey into the intersection of biology, machine learning, statistics, open source, and machine learning research. Yes, these are all of the topics that Sebastian is currently involved in. They also talk about his journey with writing the book and also discuss the his current research interests and his research efforts. They talk about another area that Sebastian is also active in, which is open source. Sebastian is an assistant professor at UW but he shares advices that apply to any student at university or otherwise in the field of machine learning looking to learn anything. Links: Python for ML Book: https://sebastianraschka.com/books.html#python-machine-learning-3rd-edition Research: https://scholar.google.com/citations?user=X4RCC0IAAAAJ&hl=en Follow: Sebastian Raschka: https://twitter.com/rasbt https://github.com/rasbt https://www.linkedin.com/in/sebastianraschka/ https://sebastianraschka.com Sanyam Bhutani: https://twitter.com/bhutanisanyam1 About: http://chaitimedatascience.com/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every Sunday, Thursday available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
1:08:16
February 23, 2020
Navdeep Gill | Software Engineering & Data Science | Machine Learning Interpretability | Open Source
Chai Time Data Science Playlist: https://www.youtube.com/playlist?list=PLLvvXm0q8zUbiNdoIazGzlENMXvZ9bd3x In this episode, Sanyam Bhutani interviews Navdeep Gill: Senior Data Scientist and Software Engineer at H2O.ai. In this interview, they talk all about the intersection of these two fields: data science and software engineering, best practices for both data science and software engineering and how much of software engineering skills should our data scientists really know, is the question that they also discuss in this interview. They talk all about Navdeep's journey into machine learning, machine learning interpretability and his journey at H2O.ai. They also talk a lot about machine learning interpretability, Navdeep's thoughts on it, as well as MLI inside of H2O's products. Links: An Intro to MLI Book: https://www.h2o.ai/wp-content/uploads/2019/08/An-Introduction-to-Machine-Learning-Interpretability-Second-Edition.pdf Driverless AI: https://www.h2o.ai/products/h2o-driverless-ai/ H2O-3: http://docs.h2o.ai/h2o/latest-stable/h2o-docs/welcome.html Follow: Navdeep Gill: https://twitter.com/Navdeep_Gill_ Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: http://chaitimedatascience.com/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. Flow by LiQWYD https://soundcloud.com/liqwyd
59:19
February 20, 2020
Interview with Zachary Mueller | Fast.ai: The course and New Library | SGs and Top Down Learning
Video Version available here: https://youtu.be/AXr8pzXXUDQ Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani Correction: Apologies and correction: Zach is a student at the University of West Florida, not university of Washington In this episode, Sanyam Bhutani interviews his peer from the fast.ai community: Zachary Mueller, who's a student at the University of West Florida, currently pursuing his bachelor's degree in software design and development. In this interview they talk all about fast.ai the course and fast.ai V2 the upcoming library along with all about Zach's experience with fast.ai, his journey into deep learning and fast.ai, and the projects that he's built while going through the course. They also talked about a study group that Zach has now been running for a few weeks, that builds on top of fast.ai V2 the new library on which the course is yet to come out. They also talked a lot about the top down learning approach and how can you go about learning fast.ai. Links: Study Group by Zach: https://forums.fast.ai/t/a-walk-with-fastai2-study-group-and-online-lectures-megathread/59929 Practical-Deep-Learning-for-Coders-2.0: https://github.com/muellerzr/Practical-Deep-Learning-for-Coders-2.0 Zach's YouTube Channel: https://www.youtube.com/channel/UCmKoQOD8uBqsRS8XDdSgrlQ Follow: Zachary Mueller: https://twitter.com/TheZachMueller Sanyam Bhutani: https://twitter.com/bhutanisanyam1 About: http://chaitimedatascience.com/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every Sunday, Thursday available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
1:02:34
February 16, 2020
Interview with Sr. Director of Product at H2O.ai: Patrick Hall | Machine Learning, H2O.ai & Machine Learning Interpretability
Chai Time Data Science Playlist: https://www.youtube.com/playlist?list=PLLvvXm0q8zUbiNdoIazGzlENMXvZ9bd3x In this episode, Sanyam Bhutani interviews Patrick Hall, Sr. Director of Product at H2O.ai. Patrick has a background in Math and has completed a MS Course in Analytics. In this interview they talk all about Patrick's journey into ML, ML Interpretability and his journey at H2O.ai, how his work has evolved over the years. They talk a lot about MLI, ML Explainability and Model Debugging. They also talk about how these ideas are implemented inside of h2o.ai and how can someone bring these ideas to their pipelines. Links: "Real-World Strategies for Model Debugging": https://medium.com/@jphall_22520/strategies-for-model-debugging-aa822f1097ce An Intro to MLI Book: https://www.h2o.ai/wp-content/uploads/2019/08/An-Introduction-to-Machine-Learning-Interpretability-Second-Edition.pdf "Why you should care about debugging machine learning models": https://www.oreilly.com/radar/why-you-should-care-about-debugging-machine-learning-models/ "Proposed Guidelines for the Responsible Use of Explainable Machine Learning": https://arxiv.org/pdf/1906.03533.pdf Follow: Patrick Hall: https://twitter.com/jpatrickhall https://www.linkedin.com/in/jpatrickhall/ Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: http://chaitimedatascience.com/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. Flow by LiQWYD https://soundcloud.com/liqwyd
58:06
February 13, 2020
"Anokas": Mikel Bober-Irizar | Becoming The Youngest Kaggle Grandmaster | ML For Japanese Literature | Kaggle
Video version here: https://youtu.be/maR9ibJ2r7g Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews the Youngest Kaggle Grandmaster, "Anokas": Mikel Bober-Irizar for the second time. In this Interview, they talk all about Mikel's journey into Kaggle, ML and his current life as a CS Student. Note: He's 18 years old at the time of publishing and still the youngest person to achieve the GM Title in Comp Tier at the age of 17, as well as the youngest person to achieve the triple master title. They discuss his Kaggle journey, into ML, and ML research where he has published very interesting work and even organised a Kaggle Comp in the intersection of ML & Japanese Literature. Links: https://hackernoon.com/interview-with-the-youngest-kaggle-grandmaster-mikel-bober-irizar-anokas-17dfd2461070 https://www.kaggle.com/anokas https://www.kaggle.com/c/landmark-retrieval-challenge/discussion/57855 https://www.kaggle.com/c/kuzushiji-recognition https://scholar.google.com/citations?user=UTgURoAAAAAJ Follow: Mikel Bober-Irizar: https://twitter.com/mikb0b?lang=en Sanyam Bhutani: https://twitter.com/bhutanisanyam1 About: http://chaitimedatascience.com/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every Sunday, Thursday available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
59:15
February 9, 2020
The Story of Kaggle & Kaggle's Evolution | Interview with the CEO of Kaggle: Anthony Goldbloom
Video Version available here: https://youtu.be/jw2Z-IMyFYw Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews the CEO of Kaggle: Anthony Goldbloom. Anthony has a background in economics and has been working on Kaggle, working on the other side of Kaggle as the CEO and co founder for almost over a decade now, They talk about his journey as the co founder of Kaggle, and even Kaggle's journey broadly speaking over the few past few years. How Kaggle has evolved, how their perspective, how their team's perspective on Kaggle and the team itself has evolved from the start until the point even after it got acquired by Google. This interview shares a lot of insights about the behind the scenes work that goes into Kaggle while hosting a competition, broadly speaking, or even putting out a new course on Kaggle learn. They also discuss Kaggle's future plans and what's next and exciting for us as frequent Kagglers. Follow: Anthony Goldbloom: https://twitter.com/antgoldbloom Sanyam Bhutani: https://twitter.com/bhutanisanyam1 About: http://chaitimedatascience.com/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every Sunday, Thursday available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
1:05:05
February 6, 2020
Interview with Jason Antic | DeOldify | Fast.ai & NoGAN | Machine Learning & Software Engineering
In this episode, Sanyam Bhutani interviews Jason Antic: The creator and researcher at DeOldify for the second time. In this interview, they talk all about Jason's software engineering experience, his software experience applied to deep learning, his journey with DeOldify, how he came up with the idea of DeOldify and how he kept on improving it. They talk also broadly speaking a lot about software engineering practices and how those were helpful to Jason while working on DeOldify, fast.ai, and even the top down way of learning. Reminder: This interview and all future will have fixed subtitles for the non-native English speaking (also lazy) audience along with blog posts to be released at: sanyambhutani.com Video Version available here: https://youtu.be/A5Cq8SWudts Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani Links: https://sanyambhutani.com/interview-with-the-creator-of-deoldify--fast-ai-student--jason-antic/ https://github.com/jantic/DeOldify https://www.fast.ai/2019/05/03/decrappify/ https://www.instagram.com/america_in_color/ Follow: Jason Antic: https://twitter.com/citnaj Sanyam Bhutani: https://twitter.com/bhutanisanyam1 About: http://chaitimedatascience.com/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every Sunday, Thursday available as Video, Podcast, and blogposts. If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience Intro track: Flow by LiQWYD https://soundcloud.com/liqwyd
1:05:12
February 2, 2020
Rohan Rao | Numbers, Data Science & Kaggle | ASHRAE - Great Energy Predictor 2nd Pos Sol
Chai Time Data Science Playlist: https://www.youtube.com/playlist?list=PLLvvXm0q8zUbiNdoIazGzlENMXvZ9bd3x Audio (Podcast Version) available here: https://anchor.fm/chaitimedatascience Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani In this episode, Sanyam Bhutani interviews Kaggle Grand Master and Data Scientist at H2O.ai: Rohan Rao. Rohan is a Kaggle Grand Master in the competition tier, and a data scientist, He has represented India not just in data science, but also in Sudoku and puzzles. He's a seven time and current national Sudoku champion, and the first Indian to be ranked in top 10 at the World Championship in 2012, where he secured the eighth position. Currently, his world rank is 17 and he's secured two podium finishes at Asian Sudoku championship. And he's a five time national puzzle champion with his current rank being 44th. Four time and current times natural Sudoku champion. In this interview they talk all about his journey into data science, into data and numbers-his journey into the world of competitive sports, both on and off Kaggle. In the off Kaggle journey we discuss his data science journey and on Kaggle will discuss how his approach to Kaggle has changed over the years, how it's evolved, and his current thoughts on the platform and tips and advice. They also discuss his recent Eighth gold medal on Kaggle and his team's second position finish on the ASHRAE great energy prediction. Links:  Interview with CPMP:https://www.youtube.com/watch?v=wqHlAOFSFuQ  2nd Pos Solution: https://www.kaggle.com/c/ashrae-energy-prediction/discussion/123481  KaggleDaysMeetup: https://sanyambhutani.com/kaggledaysmeetup-bangalore--recap-by-dsnet-team/ Follow: Rohan Rao: https://en.wikipedia.org/wiki/Rohan_Rao https://twitter.com/vopani?lang=en https://www.kaggle.com/rohanrao https://www.linkedin.com/in/vopani/?originalSubdomain=in Sanyam Bhutani: https://twitter.com/bhutanisanyam1 Blog: sanyambhutani.com About: http://chaitimedatascience.com/ A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani. You can expect weekly episodes every available as Video, Podcast, and blogposts. Flow by LiQWYD https://soundcloud.com/liqwyd
1:42:25
January 30, 2020
Dmitry Gordeev & Philipp Singer | What does it take to win a Kaggle Comp? | NFL Data Bowl Win Sol
Reminder: This interview and all future will have fixed subtitles (on YouTube) for the non-native English speaking (also lazy) audience along with blog posts to be released at: sanyambhutani.com, Video posted here: https://youtu.be/_Srv0bKmfjY Subscribe here to the newsletter: https://tinyletter.com/sanyambhutani   In this episode, Sanyam Bhutani interviews The Two Lions from the Kaggle Team "The Zoo": Dmitry Gordeev, also known as the "dott" on kaggle. And Philipp Singer, the Psi on Kaggle   In this interview, they talk about first Dmitry's journey into data science and on kaggle. In the later half, about "The Zoo": Kaggle Team.   The zoo's journey both of their journey on kaggle. Together, their approach on teaming up their approach about winning a competition and they discuss about what does it really take to win a kaggle competition.    They also talk about their recent win on the NFL Big Data bowl competition.    Links:   Interview with Psi: https://youtu.be/7sh5QrUIAHI  Interview with Giba: https://youtu.be/MpYeDKw8EOg  Interview with CPMP: https://youtu.be/wqHlAOFSFuQ   NFL Big Data Bowl Winning Sol Writeup: https://www.kaggle.com/c/nfl-big-data-bowl-2020/discussion/119400   Follow:   Dmitry Gordeev:  http://twitter.com/dott1718  https://www.kaggle.com/dott1718   Philipp Singer:  https://twitter.com/ph_singer  https://www.philippsinger.info/   Sanyam Bhutani:  https://twitter.com/bhutanisanyam1   About: http://chaitimedatascience.com/  A show for Interviews with Practitioners, Kagglers & Researchers and all things Data Science hosted by Sanyam Bhutani.   You can expect weekly episodes every Sunday, Thursday available as Video, Podcast, and blogposts.   If you'd like to support the podcast: https://www.patreon.com/chaitimedatascience  Intro track:  Flow by LiQWYD https://soundcloud.com/liqwyd
1:17:32