PyDataMCR
By Joseph Allen
PyDataMCRMar 29, 2019
Episode 18 - Functional Programming and Data Engineering with James Fielder
Welcome to PyDataMCR Episode 18, James Fielder, Jennifer and John are talking functional programming, typing and data engineering.
Sponsors
Cathcart Associates - cathcartassociates.com/
Horsefly Analytics - horseflyanalytics.com/
Our Collaborators
HER+data - meetup.com/HER-Data-MCR/
Pyladies - twitter.com/pyladiesnwuk
Django Girls - djangogirls.org/
Python NW - meetup.com/Python-North-West-Meetup/
Open Data Manchester - opendatamanchester.org.uk/
Lambda Lounge - http://lambdalounge.org.uk/
Resources
James Fielder
LinkedIn - https://www.linkedin.com/in/jcfielder/
Cox Automotive - https://www.coxauto.co.uk/our-brands/data-solutions/
Lambda Lounge - http://www.lambdalounge.org.uk/
mypy - http://mypy-lang.org/
Dagster - https://github.com/dagster-io/dagster
Marquez - https://marquezproject.github.io/marquez/
Waimak - https://github.com/CoxAutomotiveDataSolutions/waimak
Mentions
Claire McDonald - https://twitter.com/squarejazz
Gemma Cameron - https://twitter.com/ruby_gem
Li Haoyi - https://twitter.com/li_haoyi / https://www.lihaoyi.com/
Lambda Lounge - https://twitter.com/lambdamcr
Social
Meetup -meetup.com/PyData-Manchester/
Slack - http://bit.ly/35KGOgR
Twitter - @PyDataMCR
Episode 17 - Data Engineering and Game Theory with Eslene Bikoumou
Welcome to PyDataMCR Episode 17, Eslene Bikoumou, Jennifer and John are talking about Game Theory and what does Data Engineering look like at a Manchester based start up.
Sponsors
Cathcart Associates - cathcartassociates.com/
Horsefly Analytics - horseflyanalytics.com/
Our Collaborators
HER+data - meetup.com/HER-Data-MCR/
Pyladies - twitter.com/pyladiesnwuk
Django Girls - djangogirls.org/
Python NW - meetup.com/Python-North-West-Meetup/
Open Data Manchester - opendatamanchester.org.uk/
Lambda Lounge - http://lambdalounge.org.uk/
Resources
Eslene Bikoumou
LinkedIn https://www.linkedin.com/in/eslene-bikoumou/
Agent Software https://www.agentsoftware.net/
Mentions
Phillip Bates - https://www.linkedin.com/in/phillip-bates-1a33a94b/
Dr James Burridge - https://www.port.ac.uk/about-us/structure-and-governance/our-people/our-staff/james-burridge
Social
Meetup -meetup.com/PyData-Manchester/
Slack - http://bit.ly/35KGOgR
Twitter - @PyDataMCR
Episode 16 - Academia and back again with Adam Fletcher
Welcome to PyDataMCR Episode 16, Adam Fletcher, Jennifer and John are talking about Adams journey from industry, to academia and back again; and some insight into the world of data science consulting.
Sponsors:
Cathcart Associates - cathcartassociates.com/
Horsefly Analytics - horseflyanalytics.com/
Our Collaborators:
HER+data - meetup.com/HER-Data-MCR/
Pyladies - twitter.com/pyladiesnwuk
Django Girls - djangogirls.org/
Python NW - meetup.com/Python-North-West-Meetup/
Open Data Manchester - opendatamanchester.org.uk/
Lambda Lounge - http://lambdalounge.org.uk/
Resources:
Adam Fletcher
LinkedIn https://www.linkedin.com/in/adamfletcheruk/
Equal Experts https://www.equalexperts.com/
Social:
Meetup -meetup.com/PyData-Manchester/
Slack - http://bit.ly/35KGOgR
Twitter - @PyDataMCR
Episode 15 - Open Contracts and Volunteering in Tech with Hera
Welcome to PyDataMCR Episode 15, Hera Hussain, Jennifer and John are talking about Open Contracting - what it is and why it is important, as well as Volunteering in Tech - how to get into it and how to get it right.
Sponsors
Cathcart Associates - cathcartassociates.com/
Horsefly Analytics - horseflyanalytics.com/
Our Collaborators:
HER+data - meetup.com/HER-Data-MCR/
Pyladies - twitter.com/pyladiesnwuk
Django Girls - djangogirls.org/
Python NW - meetup.com/Python-North-West-Meetup/
Open Data Manchester - opendatamanchester.org.uk/
Lambda Lounge - http://lambdalounge.org.uk/
Resources:
Hera: https://twitter.com/herahussain ; https://www.linkedin.com/in/herahussain/
Contracts Finder https://www.gov.uk/contracts-finder
Chayn https://chayn.co/
Tech for Good Live https://www.techforgood.live/
Global Shapers https://www.globalshapers.org/hubs/manchester-hub-57f7919c6b-hub
Nighat Dad https://twitter.com/nighatdad @ Digital Rights PK
Mor https://twitter.com/Morchickit @ Open Heroines
https://twitter.com/wethecatalysts @ The Catalyst Project
Cassie https://twitter.com/cassierobinson @ National Lottery
Dan https://twitter.com/dansutch @ Cast
Bex https://twitter.com/rebeccawho @ Tech for Good Live
Julian https://twitter.com/Julianlstar // Sam https://twitter.com/milsomsam @ Open Data Manchester
Emer https://twitter.com/emercoleman @ Federation Community
Dama https://twitter.com/Dama_Yanthy @ Bethnall Green Ventures
Social
Meetup -meetup.com/PyData-Manchester/
Slack - http://bit.ly/35KGOgR
Twitter - @PyDataMCR
Episode 14 - Johns Favourite Topic: Recommender Systems
Welcome to PyDataMCR Episode 14, today Jennifer and John are talking about Recommender Systems, where you can find them, and why they are still so difficult
Sponsors
Cathcart Associates - cathcartassociates.com/
Horsefly Analytics - horseflyanalytics.com/
Our Collaborators:
HER+data - meetup.com/HER-Data-MCR/
Pyladies - twitter.com/pyladiesnwuk
Django Girls - djangogirls.org/
Python NW - meetup.com/Python-North-West-Meetup/
Open Data Manchester - opendatamanchester.org.uk/
Lambda Lounge - http://lambdalounge.org.uk/
Resources:
Netflix Prize https://en.wikipedia.org/wiki/Netflix_Prize
Youtube Recommendation System https://arxiv.org/abs/1607.07326
Google Recommenation System Course https://developers.google.com/machine-learning/recommendation
Social
Meetup - meetup.com/PyData-Manchester/
Slack - http://bit.ly/35KGOgR
Twitter - @PyDataMCR
Episode 13 - The Joys of Open Data
Welcome to PyDataMCR Episode 13, today we are talking to our friends at Open Data Manchester about how it came to be, and what the past decade has held in store.
Guests
Julian - @Julianlstar
Sam - @milsomsam
http://www.opendatamanchester.org.uk
Sponsors
Cathcart Associates - cathcartassociates.com/
Horsefly Analytics - horseflyanalytics.com/
Our Collaborators:HER+data - meetup.com/HER-Data-MCR/
Pyladies - twitter.com/pyladiesnwuk
Django Girls - djangogirls.org/
Python NW - meetup.com/Python-North-West-Meetup/
Open Data Manchester - opendatamanchester.org.uk/
Lambda Lounge - http://lambdalounge.org.uk/
Resources:How can open data support public policy?
https://twitter.com/thomasforth/status/1233103022970081281?s=20
Admiration:
InnovateHer - https://www.innovateher.co.uk/
Tech for Good Live - https://www.techforgood.live
Open Heroines - https://openheroines.org/
SocialMeetup - meetup.com/PyData-Manchester/
Slack - http://bit.ly/35KGOgR
Twitter - @PyDataMCR
Episode 12 - Sentiment Analysis and NLP
Welcome to PyDataMCR Episode 12, This episode is all about Natural Language Processing, specifically Sentiment Analysis. Listen for our take on the approach to the complexity of text data, how we prepare the data, and the tools we can use to tease out sentiment.
Sponsors
LadBible - ladbible.com/
Cathcart Associates - cathcartassociates.com/
Horsefly Analytics - horseflyanalytics.com/
Resources
Tutorials
Natural Language Processing
https://www.kaggle.com/learn/natural-language-processing
Word embeddings for sentiment analysis https://towardsdatascience.com/word-embeddings-for-sentiment-analysis-65f42ea5d26e
Sentiment Analysis in R
https://www.kaggle.com/rtatman/tutorial-sentiment-analysis-in-r
Tools
NLTK - https://www.nltk.org/
SpaCy - https://spacy.io/
TextBlob - https://textblob.readthedocs.io/en/dev/
VADER - https://github.com/cjhutto/vaderSentiment
Google Natural Language API - https://cloud.google.com/natural-language
Azure Text Analytics API https://azure.microsoft.com/en-gb/services/cognitive-services/text-analytics/
Social
Meetup - meetup.com/PyData-Manchester/
Slack - http://bit.ly/35KGOgR
Twitter - @PyDataMCR
Episode 11 - A Year in Review
This month we thought we would take a break from our usual episode format and have ourselves as the guests. Listen to us reflect on our year volunteering with PyDataMCR. We talk about what we did this year, including Google Next! We also talk a little about learning to rank. We realise this is an insider view, so this months meetup will be an open retro so have a think about what you want from next year.
Request: If you know about recording events for YouTube, and can help us out, feel free to dm us (see our social channels below).
Sponsors
LadBible - ladbible.com/
Cathcart Associates - cathcartassociates.com/
Horsefly Analytics - horseflyanalytics.com/
Our Collaborators:HER+data - meetup.com/HER-Data-MCR/
Pyladies - twitter.com/pyladiesnwuk
Django Girls - djangogirls.org/
Python NW - meetup.com/Python-North-West-Meetup/
Open Data Manchester - opendatamanchester.org.uk/
Lambda Lounge - http://lambdalounge.org.uk/
What we’ve done this year…hacktoberfest - hacktoberfest.digitalocean.com
blog posts - tinyurl.com/tnkzafr
tinyurl.com/urn8twp
tinyurl.com/too86wj
Learning to rank resources
- Reinforcement Learning to Rank with Markov Decision Process- http://bigdatalab.ac.cn/~junxu/publications/SIGIR2017_RL_L2R.pdf
- Reinforcement Learning to Rank in E-Commerce Search Engine: Formalization, Analysis, and Application arxiv.org/abs/1803.00710 medium.com/@alitech_2017/unlocking-insights-from-multi-round-searches-with-reinforcement-learning-74f7143acf08 youtube.com/watch?v=AXa3CW68xks
Matt Crooks Medium: Louvain clustering - medium.com/@DrMattCrooks
Google NEXTGoogle NEXT - cloud.withgoogle.com/next/uk/
Cloud Build - cloud.google.com/cloud-build/
Cloud Run - cloud.google.com/run/
Trifactor - trifacta.com
Cloud Dataprep by Trifactor - cloud.google.com/run/
Tool Great Expectations - github.com/great-expectations/great_expectations
SocialMeetup - meetup.com/PyData-Manchester/
Slack - http://bit.ly/35KGOgR
Twitter - @PyDataMCR
Episode 10 - Hacktoberfest Special
Welcome to PyDataMCR episode 10 , This episode was recorded throughout our weekend Hacktoberfest event. We ran this event in tandem with PythonNW, DjangoGirls, PyLadiesNW, Rladies and HER+Data and as such this podcast dips into multiple different communities.
Listen on as we talk to maintainers of open source libraries, contributors and more.
Sponsors
Cathcart Associates - https://www.cathcartassociates.com/
Horsefly Analytics - https://horseflyanalytics.com/
CodeNation - https://wearecodenation.com/
Show Notes - Pick N Mix
Cheuk Ting Ho - twitter.com/cheukting_ho
Pick N Mix Repo - github.com/picknmix/picknmix
Python Sprints - twitter.com/py_sprints
AI Club Gender & Minority - twitter.com/AIClubGenderMin
Show Notes - libreML
Adam - twitter.com/adds68
Chris - twitter.com/cphang909
LibreML - gitlab.com/libreml/libreml
LibreML Twitter - twitter.com/LibreMl
Show Notes - PythonNW
Lucy Bridge - twitter.com/LinuxLucy
Adam Shackleton - twitter.com/Adamshackleton
Python NW - twitter.com/pythonnw
Episode 9 - Smart Meters, more like Dumb Meters Ft. Ellen Talbot
Welcome to PyDataMCR episode 9 , today we are talking to Ellen Talbot, a PhD candidate at the University of Liverpool in the Geographic Data Science Lab. Ellen talks with us about smart people in academia and smart meters in energy usage.
Sponsors
LadBible - https://www.ladbible.com/
Cathcart Associates - cathcartassociates.com/
RLadies Manchester - https://www.meetup.com/rladies-manchester/
Rstudio - https://rstudio.com/
Jupyter - https://jupyter.org/
Spyder - https://www.spyder-ide.org/
Jupyterlab - https://jupyterlab.readthedocs.io/en/stable/
LockeData - https://itsalocke.com/
Honeycomb Analytics - https://honeycomb-analytics.com/
Admires
Hannah Fry - https://twitter.com/FryRsquared
Deep mind podcast - https://deepmind.com/blog/article/welcome-to-the-deepmind-podcast
Hello World book - https://www.amazon.co.uk/Hello-World-How-Human-Machine/dp/0857525247
Episode 8 - Mucking around and making things work Ft. Tom Liptrot
Welcome to PyDataMCR episode 8 , today we are talking to Tom Liptrot who is a Consultant Data Scientist at Ortom, a Manchester based Data Science Consultancy.
We talk about how flexibility at work can lead to great data products, some various meetups Tom will be attending and even some stand up comedy.
Show NotesSponsors
Arctic Shores - arcticshores.com/
Cathcart Associates - cathcartassociates.com/
Meetups
Pydatamcr - https://www.meetup.com/PyData-Manchester/
Macnml - https://www.mancml.io/
Chester Data Insights - https://www.meetup.com/Chester-Data-Insights/
AiFrenzy - labs.uk.barclays/ai
Crap talks - https://www.meetup.com/CRAP-Talks-CRO-Analytics-Product-Manchester/
Bright festival - http://www.brightclub.org/
Tom
Ortom Data Science Consultancy - https://ortom.co.uk/
DSF blog post - https://ortom.co.uk/2019/03/22/data-fest-2019.html
Packages
Tidyverse - https://www.tidyverse.org/
Pandas - https://pandas.pydata.org/
dplyr - https://dplyr.tidyverse.org/
brms - https://cran.r-project.org/web/packages/brms/index.html
pymc3 - https://docs.pymc.io/
XGBoost - https://cran.r-project.org/web/packages/xgboost/index.html
glmnet - https://cran.r-project.org/web/packages/glmnet/index.html
keras - https://keras.io/
Book recommendation
Andrew Gelman - https://www.amazon.com/Red-State-Blue-Rich-Poor/dp/0691143935
Who you admire
Demis Hassabis - twitter.com/demishassabis?lang=en
Jeff Dean - en.m.wikipedia.org/wiki/Jeff_Dean_(computer_scientist)
Hadley Wickham - http://hadley.nz/
Hannah Fry - http://www.hannahfry.co.uk/
Andy Clark - https://twitter.com/fluffycyborg?lang=en
Karl J. Friston - en.m.wikipedia.org/wiki/Karl_J._Friston
Martin Eastwood - http://www.pena.lt/y/blog.html
Peak.ai - https://peak.ai/
Kayle Haynes - https://twitter.com/KayleaHaynes
Chris boddington - https://www.linkedin.com/in/christopher-boddington-449555112/
Episode 7 - Open Science and Imposters syndrome Ft. Rachael Ainsworth
Welcome to PyDataMCR episode 7, today we are talking to Rachael Ainsworth who is a Research Associate in Radio Astronomy and Open Science Champion at the Jodrell Bank Centre for Astrophysics, which is part of the University of Manchester, UK. Rachael talks about Open Science, Imposters Syndrome and construction of the Square Kilometre Array. Show Notes
Square kilometre array - https://en.wikipedia.org/wiki/Square_Kilometre_Array
Astropy - https://www.astropy.org/
Docker - https://www.docker.com/
Singularity - https://singularity.lbl.gov/
The Carpentries - https://carpentries.org/
Rachaels’ TedX Macclesfield talk - https://www.youtube.com/watch?v=c-bemNZ-IqA
Bluedot - https://www.discoverthebluedot.com/
HER+Data - https://www.meetup.com/HER-Data-MCR/
FOSTER - https://www.fosteropenscience.eu/
Rachels’ Github - https://rainsworth.github.io/
Figshare - https://figshare.com/
Zenodo - https://zenodo.org/
Shout outs
Beckie Taylor https://twitter.com/RTaylor81
Professor Anna Scaife https://twitter.com/radastrat
Kirsty Devlin https://twitter.com/Kirstydevlin1
Faye Gabe - https://twitter.com/FeyAgape
SponsorsCathcart Associates - https://www.cathcartassociates.com/
Episode 6 - Developer Advocacy and dealing with burnout Ft. Tania Allard
Welcome to PyDataMCR episode 6, today we are talking to Tania Allard who is a Developer Advocate for Microsoft here in Manchester. Find out what a developer advocate does. We discuss advice for dealing with burnout in the modern world.
Show Notes
Tania twitter.com/@ixek
Jupyter - https://jupyter.org/
Azure - https://azure.microsoft.com/en-gb/
QuantStack - https://quantstack.net/
NIPS - https://nips.cc/
Binder - https://gke.mybinder.org/
Shout outs
- Carol Willing - twitter.com/@WillingCarol
- Katharine Jarmul - twitter.com/@kjam
- Satya Nandela - twitter.com/@satyanadella
- Wes McKinney - twitter.com/@wesmckinn
- Limor Fried - https://www.linkedin.com/in/ladyada/
Sponsors
Cathcart Associates - https://www.cathcartassociates.com/
Episode 5 - Data in Dating Ft. Ian Forrester
Today we talk about data in dating(I am sure there is an obvious pun there). Are you aware of the white label sites that share your data? How can a dating sights interests align with yours if it’s in their interests to keep you single? We talk with self-professed dating expert Ian Forrester to find out more.
Ian Forrester is a well known character on the digital scene in the UK and Europe. Living in Manchester, UK, he works for the BBC's R&D Future Experiences team. He specialises in open innovation and new disruptive opportunities; by creating tangible value with open engagement and collaborations with start-ups, universities and early adopters.
His achievements recently were noticed by the Inclusive board, landing him in the top 100 diverse leaders in the UK. Previously a founder of the dataportability.org group, social geek events, including London Geekdinners, BarCampLondon, London Hackday, Edinburgh TV Un-Festival and Over the Air.
Show Notes
https://clickclickclick.click - spooky tracking website
Match Group profits - https://tcrn.ch/2ZksBTI
Dataclysm - https://amzn.to/2wVk9hG
HDI - https://bbc.in/2MExaqA
Jeni Tennison - https://twitter.com/jenit?lang=en
Episode 4 - Data Science in Production Ft. Leanne Fitzpatrick
Welcome to PyDataMCR episode 4 , today we are talking with Leanne Fitzpatrick, Head of Data at Hello Soda. Leanne is a Passionate data leader with experience developing, implementing and growing a data analytics and data science function within (what was) a start up business, and is an advocate of getting data science straight into production. We gave some fan service by talking about american football for our american audience, and some beer chat for our Manchester audience too!
Show Notes
Docker - https://www.docker.com/
Jupyter - https://jupyter.org/
Anaconda - https://www.anaconda.com/
Martin Fowler on refactoring - https://www.martinfowler.com/books/refactoring.html
Lime - R package - https://cran.r-project.org/web/packages/lime/vignettes/Understanding_lime.html
Airflow - https://airflow.apache.org/
Git lfs - https://git-lfs.github.com/
Kubernetes - https://kubernetes.io/
Go cd - https://www.gocd.org/
Hypotheses - https://hypothesis.readthedocs.io/en/latest/
Brewery recommendations
Zilker Brewery
Hops & Grain
Blue Owl
Lazarus
Jester King
Special mention to cool, welcoming bars: Latchkey, Hi Hat
Edited by Jack Ridley
Episode 3 - Forecasting Ft. Kaylea Haynes
Welcome to PyDataMCR episode 3 , today we are talking about forecasting with Kaylea Haynes, data science team leader at Peak here in Manchester. Kaylea is involved in RLadies Manchester and has a great presence in the data community here in Manchester and seems to have the impressive skill of being everywhere at once.
Show NotesARIMA Python - bit.ly/2Z6VMKB
ARIMA R - bit.ly/2IpErqs
Fb prophet -bit.ly/2X6U9uS
Forecasting book - amzn.to/2KsLuBr
Croston method - bit.ly/2Ueupuy
Ts intermittent - bit.ly/2Z6iMtn
Forecast package r - bit.ly/2UfmTzE
Tidyverse - bit.ly/2KuX139
Rob Hyndman forecasting book - bit.ly/2v1wq3a
Hyndsight blog - bit.ly/2KsT4Mk
Nikolaos Kourentzes and One number forecast -bit.ly/2GgWeyw
Hadley wickham - bit.ly/2Z7w2Oo
Great women in tech - https://bit.ly/2GeeGH8
Peak - peak.ai/
Episode 2 - Getting a Job in Data Ft. Liam Wilson of Cathcart Associates
Welcome to PyDataMCR episode 2, today we are talking to Liam Wilson from Cathcart Associates, an independent technology recruitment company. If you’re in Data in Manchester or Edinburgh chances are you already know Liam as one of the organisers of MancML, and also the sponsor of PyDataMCR and PyDataEdinburgh.From what we have seen of Liam, and Cathcart Associates is that they have a better understanding of what data science actually consists of, and give back to the communities they are a part of.
We discuss the elusive “trophy data scientist”, as well as some tips for anybody in data and a discussion of expected salaries.
Show NotesCathcart associates - https://www.cathcartassociates.com/
Speakers Form - https://buff.ly/2MZtdJw
Slack link - https://bit.ly/2v60ieu
Episode 1 - Geospatial data with Rebecca Davey and David Mulcahy
PyDataMCR - Episode 1 - Geospatial data with Rebecca Davey and David Mulcahy
Welcome to the official PyDataMCR podcast, In this episode we are joined by Rebecca and David from INRIX. We zip through topics including GPS tracking, Hidden Markov Models and open source tools.
Show notes:
Libraries:
Geopandas - http://geopandas.org/
Dockerfile for geopandas - https://hub.docker.com/r/jaspajjr/geopandas
Folium - https://python-visualization.github.io/folium/
HMM-learn - https://hmmlearn.readthedocs.io/en/latest/
Valhalla - https://wiki.openstreetmap.org/wiki/Valhalla
Graphhopper - https://www.graphhopper.com/
Openstreetmap - https://www.openstreetmap.org
Openstreetmap wiki - https://wiki.openstreetmap.org
Mapping Mobility in Stockport - https://bit.ly/2TZFal5
HMM for mapmatching - https://bit.ly/2Nqb5ZL
Tech heroes:
Professor Anna Scaife: https://twitter.com/radastrat
Cathy O'Neil - https://twitter.com/mathbabedotorg
Safiya Noble - https://twitter.com/safiyanoble
Tom MacWright - https://macwright.org/
Our Sponsor
Cathcart Associates is a technology recruitment company with offices in Leeds and Manchester covering all things tech, but with an experienced team focusing on Data Science in the North West. We’re good at what we do. We understand what our candidates do, and what our clients need, and we really care about making sure you both get what you want. We’ve been sponsoring PyDataMCR since its inception because we’re nice guys and we like pizza. Check out our website to get in touch – cathcartassociates.com
Contact
Twitter - twitter.com/pydatamcr
Slack - bit.ly/2v60ieu
Meetup - meetup.com/PyData-Manchester/