Do you want to boost your career as a data scientist? Our podcast helps you in achieving this by teaching you relevant knowledge about all the different aspects of becoming a more effective data scientist.
…
continue reading
Interviews with data scientists, data engineers, machine learning engineers and professionals in other relevant areas to data. We chat about how they ended up where they are now and what kind of projects they work on.
…
continue reading
A deep dive into data scientists' day-to-day work, tools and models they use, how they tackle problems, and their career journeys. This podcast helps you grow a successful career in data science. Listening to an episode is like having lunch with an experienced mentor. Guests are data science practitioners from various industries, AI researchers, economists, and CTOs of AI companies. Host: Daliana Liu, an ex-Amazon senior data scientist with 180k followers on Linkedin. Join 20k subscribers at ...
…
continue reading
1
Why data scientists are tired, six real data scientists' frustrations - The Data Scientist Show #089
42:22
42:22
Spill senere
Spill senere
Lister
Lik
Likt
42:22
Daliana interviewed 6 data scientists from her meetup in New York City. It's a unique episode where you get to hear the real frustrations of data scientists. We talked about struggles working in healthcare, finance, data quality and AI, how to advocate for yourself, and align with your managers. Subscribe to Daliana's newsletter on www.dalianaliu.…
…
continue reading
1
Why 80% of A/B tests fail, how to 10X your experimentation velocity - Kristi Angel - The Data Scientist Show #088
43:46
43:46
Spill senere
Spill senere
Lister
Lik
Likt
43:46
Most experimentations fail, Kristi Angel shares her expertise on scaling experimentation and avoiding common A/B testing pitfalls. Learn five things that can help boost test velocity, designing impactful experiments, and leveraging knowledge repos. (Chapters below) Kristi Angel’s LinkedIn: https://www.linkedin.com/in/kristiangel/ Subscribe to Dali…
…
continue reading
1
From physics PhD to data science leader, unexpected challenges in survey data, Python vs R, EDA best practices, building MLOps toolkit - Julia Silge - The Data Scientist Show #087
46:18
46:18
Spill senere
Spill senere
Lister
Lik
Likt
46:18
Julia Silge is an engineering manager at Posit PBC, formerly know as R-studio, where she leads a team of developers building open source software MLOps. Before Posit, she finished a PhD in astrophysics, worked for several years in the nonprofit space, and was a data scientist at Stack Overflow where some of her most public work involved the annual …
…
continue reading
1
Why he created Pandas, the future of data systems, why he left his CTO role to become a chief architect - Wes McKinney - The Data Scientist Show #086
52:24
52:24
Spill senere
Spill senere
Lister
Lik
Likt
52:24
Wes McKinney is the co-creator of pandas library and he is the cofounder of Voltron data. Currently he is a principal Architect at Posit and an investor in data systems. Daliana's Twitter: https://twitter.com/DalianaLiu Daliana’s LinkedIn: https://www.linkedin.com/in/dalianaliu/ Wes' LinkedIn: https://www.linkedin.com/in/wesmckinn/ (00:00:00) I…
…
continue reading
1
From financial analyst to director of analytics, how to get promoted quickly, 7 elements of influence - Christopher Fricker - The Data Scientist Show #085
1:13:51
1:13:51
Spill senere
Spill senere
Lister
Lik
Likt
1:13:51
Christopher Fricker is a senior director in analytics and BI at Renaissance Learning. He started his career in finance and later became a data science consultant working with Meta, Netflix, and pre-IPO tech companies doing analytics. We talked about the mental models that helped him grow from a finance analyst to an analytics leader. Subscribe to D…
…
continue reading
1
Adapters: the game changer for fine-tuning - Geoffrey Angus - The Data Scientist Show #084
52:45
52:45
Spill senere
Spill senere
Lister
Lik
Likt
52:45
I interviewed Geoffery Angus, ML team lead @Predibase to talk about why adapter-based training is a game changer. We started with an overview of fine-tuning and then discussed five reasons why adapters are the future of LLMs. Later we also shared a demo and answered questions from the live audience. Try fine-tuning for free: https://pbase.ai/GetSta…
…
continue reading
1
Landing a job by analyzing Seattle's crime data, from data scientist to founder of interview query, building a lifestyle business - Jay Feng - The Data Scientist Show #083
35:41
35:41
Spill senere
Spill senere
Lister
Lik
Likt
35:41
Jay Feng created a viral project using Seattle crime data and later got into data science. He later founded "Interview Query" helping data scientists get jobs. We'll talk about how he landed his data science job through his blog, and his journey from data scientist to founder. Subscribe to Daliana's newsletter on www.dalianaliu.com for more on da…
…
continue reading
1
Case studies from the GenAI frontier, scaling ML teams, from biologist to machine learning consultant- Erik Gafni - The Data Scientist Show #082
1:03:40
1:03:40
Spill senere
Spill senere
Lister
Lik
Likt
1:03:40
Erik Gafni builds AI systems and teams. He founded Eventum AI (https://bit.ly/eventum-ai), an ML consulting company working with high-growth startups. We talked about GenAI projects he worked on, how he built production ML systems, how to scale ML teams, and his journey from biologist to ML researcher. Interested in working with Erik: https://bit.l…
…
continue reading
1
Data science job market in 2024, softskills for interviews, AI engineering - Jay Feng - The Data Scientist Show #081
1:07:14
1:07:14
Spill senere
Spill senere
Lister
Lik
Likt
1:07:14
Jay Feng is the CEO of interview query, a service that help data scientists get jobs. Previously he worked as a data scientist at Nextdoor, Monster. We talked about data science job market, the rise of AI engineering, and the softskills people overlook during interviews. Subscribe to Daliana's newsletter on www.dalianaliu.com for more on data scien…
…
continue reading
1
How to handle being laid off (as data scientists), severance negotiation, full-time employment vs independent consultant - The Data Scientist Show #080
1:06:33
1:06:33
Spill senere
Spill senere
Lister
Lik
Likt
1:06:33
We are joined by two data scientists who have firsthand experience with layoffs. We’ll talk about how to negotiate severance packages, how to handle stress, strategies for job hunting post-layoff, and how to reduce risks in full-time employment. Working with Daliana on personal branding: https://forms.gle/heNuZzaHjaAMQwLu6 Her email: daliana@dalian…
…
continue reading
1
From data analyst to sales engineer, personality-based career design, sales skills for data people - Jenny Wu - The Data Scientist Show #079
57:26
57:26
Spill senere
Spill senere
Lister
Lik
Likt
57:26
Jenny Wu is a data analyst turned sales engineer for data products at Hex. We talked about sales engineer vs data analyst, how to design a career based on your personality, and how to transition into a customer-facing role. Jenny’s LinkedIn: https://www.linkedin.com/in/jenny-wu-...Daliana's Twitter: https://twitter.com/DalianaLiuDaliana’s LinkedIn:…
…
continue reading
1
As a Statistician - Do You Take the Back Seat or Do You Drive Yourself? (Episode 24)
21:37
21:37
Spill senere
Spill senere
Lister
Lik
Likt
21:37
Discussion with Alexander Schacht and Benjamin Piske how it relates to your goals, what it takes to think strategically, which role innovation has here, what practical steps to take to drive teams forward, which knowledge to acquire to lead teams successfully, how this relates to influencing, and how your attitude will play a big role in this.…
…
continue reading
1
The future of data science teams, integrating AI into data science workflows, building data apps for stakeholders - Barry McCardel - The Data Scientist Show #078
1:04:56
1:04:56
Spill senere
Spill senere
Lister
Lik
Likt
1:04:56
Barry McCardel is the cofounder and CEO of Hex(free trial: hex.tech/dsshow), a collaborative data workspace. Their customers include FiveTran, Notion, and Anthropic. We talked about what does the future of data team look like, how to tackle challenges of data team collaborations, and how to leverage AI in data science’s workflow.60-day Free Trial: …
…
continue reading
1
Product data science for Microsoft AI, data scientist's role of GenAI, how to deal with burn out - Sid Sharan - The Data Scientist Show #077
58:57
58:57
Spill senere
Spill senere
Lister
Lik
Likt
58:57
Siddhartha Sharan is a Senior Data and Applied Scientist at Microsoft, helping product teams make data-driven decisions. Currently he is working on an AI product built with OpenAI APIs for sentiment analysis. We talked about how he evaluates AI products built with large language models at Microsoft, product data science, and how he went from a busi…
…
continue reading
1
On Building Your Own Company (Episode 23)
44:32
44:32
Spill senere
Spill senere
Lister
Lik
Likt
44:32
Interview with Shafi ChowdhuryIn this episode, we’ll cover an amazing story by one of the best programmers and mentors I ever worked with - Shafi Chowdhury (www.shaficonsultancy.com).We’ll explore how it changed from being a freelance programmer only to building his company on the side. He had a great vision in mind, that drove him forward.You’ll a…
…
continue reading
1
How she doubled her salary in a year as a data analyst, SQL in the real world, is job hopping bad? - Jess Ramos - The Data Scientist Show #076
1:07:48
1:07:48
Spill senere
Spill senere
Lister
Lik
Likt
1:07:48
Jess Ramos is a Senior Data Analyst at Crunchbase, a LinkedIn Learning Instructor, and a content creator in the data space. She has a bachelor's degree in Math, Spanish, and Business from Berry University and a master's in Business Analytics from University of Georgia. Today we’ll talk about SQL in the real world, data analyst vs data scientist, is…
…
continue reading
Discussion with Alexander Schacht and Paolo Eusebi 000000E2 000000E0 00002DDD 00002879 000099FA 0000A568 00007FBA 00006CBA 00004C39 0000A224Av Alexander Schacht and Paolo Eusebi
…
continue reading
1
Dimension Reduction with PCA (Episode 21)
21:31
21:31
Spill senere
Spill senere
Lister
Lik
Likt
21:31
Discussion with Alexander Schacht and Paolo Eusebi 0000016B 0000015F 000045CA 00003CF4 0013891F 000E045E 00007FBA 00006E82 00004C39 001366D6Av Alexander Schacht and Paolo Eusebi
…
continue reading
1
How he got into machine learning and Gen AI at Amazon, how we went from "enemies" to allies - Mehdi Noori - The Data Scientist Show #075
1:32:22
1:32:22
Spill senere
Spill senere
Lister
Lik
Likt
1:32:22
Mehdi Noori is an applied science manager at the Generative AI Innovation Center at Amazon. I used to work with Mehdi while we were at the Machine Learning Solutions Lab at AWS. So before Amazon, Mehdi was a data scientist working on marketing intelligence. Mehdi has a PhD from University of Central Florida in civil engineering and sustainability. …
…
continue reading
1
Why she quit her finance job to become a farmer, exploring a different path from the modern life - Misty Arnold - The Data Scientist Show #074
1:10:28
1:10:28
Spill senere
Spill senere
Lister
Lik
Likt
1:10:28
My friend Misty moved to a farm in Portugal after her 20 years of career in finance. We talked about her experience moving from the busy corporate life to the farm life where she does a lot of manual work. Was it challenging, how does her finance work, and what is her advice to other people who also want to explore a different path outside of the m…
…
continue reading
1
Why he left his MLE job for product data science at Meta, data science at Uber, Linkedin, and Truecar - Pan Wu - The Data Scientist Show #073
1:13:01
1:13:01
Spill senere
Spill senere
Lister
Lik
Likt
1:13:01
Pan Wu is a senior manager of data science at Meta. We talked about why he moved from machine learning to product data science, projects he worked on at Uber, Linkedin, and Meta, and how he transitioned from IC to manager. Subscribe to Daliana's newsletter on www.dalianaliu.com for more on data science and career. Pan’s LinkedIn: https://www.linked…
…
continue reading
1
How to Write Impactful and Effective Emails While Avoiding Common Mistakes (Episode 20)
34:27
34:27
Spill senere
Spill senere
Lister
Lik
Likt
34:27
Do you have a lot of email ping-pong, where emails go back and forth many times – too many times?Are you aware about the brand of you, that you communicate with your email style?Is email your default communication tool?Then this episode is for you. We have researched various articles on good email writing copies and distilled the best for you in th…
…
continue reading
1
Machine learning in cybersecurity, computer vision in sports, from business analyst to ML engineer - Betty Zhang - The Data Scientist Show #072
55:12
55:12
Spill senere
Spill senere
Lister
Lik
Likt
55:12
Betty Zhang is a data scientist currently working at a cloud security company, previously she was a data scientist at Amazon Web Services. Today we’ll talk about her computer vision projects in Sports, data science use cases in cyber security, from business major to data scientist, what’s her experience working in startups vs big tech companies. Su…
…
continue reading
1
Stop abusing A/B testing, toxic experimentation culture, how to run A/B tests with rigor - Che Sharma - The Data Scientist Show #071
1:03:42
1:03:42
Spill senere
Spill senere
Lister
Lik
Likt
1:03:42
Che Sharma came back to discuss toxic behaviors in experimentation culture and provide actionable advice on how to handle those situations, how to have rigor and integrity when designing and analyzing A/B tests. Che was the 4th data scientist at Airbnb, later he joined Webflow as an early employee. In 2021 he founded Eppo, a next-gen A/B experiment…
…
continue reading
1
Tips and Tricks to Reduce Your Email Burden Including the Option of Last Resort (Episode 19)
30:00
30:00
Spill senere
Spill senere
Lister
Lik
Likt
30:00
Discussion with Alexander Schacht and Benjamin Piske By listening to this episode, you’ll learn about these topics: What are helpful mindsets about emails Five step approach to managing emails Good habits to establish like Reply in a timely manner Send and respond less to receive less. Tips on how to set up filters Smartphone vs desktop email check…
…
continue reading
1
Academia vs. Industry for Machine Learning, Research at Uber AI Labs, ML for Wind Farms - Jason Yosinski - The Data Scientist Show #070
1:16:09
1:16:09
Spill senere
Spill senere
Lister
Lik
Likt
1:16:09
Jason Yosinski was a founding member of Uber AI Labs. He is also a co-founder of WinscapeAI a company dedicated to using custom sensor networks and machine learning to increase the efficiency and sustainability of wind farms. Jason holds a PhD in computer science from Cornell University. We talked about his experience at Uber AI, his research in de…
…
continue reading
1
Data Visualization Part 2 (Episode 18)
23:10
23:10
Spill senere
Spill senere
Lister
Lik
Likt
23:10
Discussion with Alexander Schacht and Paolo EusebiIn this episode, we share out ideas and experiences, which mindset sets up statisticians for success. We cover topics around:Av Alexander Schacht and Benjamin Piske, biometricians, statisticians and leaders in the pharma industry
…
continue reading
1
Data Visualization Part 1 (Episode 17)
24:18
24:18
Spill senere
Spill senere
Lister
Lik
Likt
24:18
Av Alexander Schacht and Benjamin Piske, biometricians, statisticians and leaders in the pharma industry
…
continue reading
1
Success Starts in Your Head - Thoughts About the Mindset of Being Successful (Episode 16)
23:34
23:34
Spill senere
Spill senere
Lister
Lik
Likt
23:34
Discussion with Alexander Schacht and Benjamin Piske In this episode, we share our ideas and experiences, which mindset sets up for success. We cover topics around: Leading people Convincing business partners Delivering value and selling it–and what does selling mean Thinking outside the status quo to improve things in the long run Always learning …
…
continue reading
1
Ads forecasting at Netflix and Spotify, how to build your personal moat - Jeff Li - The Data Scientist Show #069
1:26:29
1:26:29
Spill senere
Spill senere
Lister
Lik
Likt
1:26:29
Jeff Li is a senior data scientist at Netflix, focusing on Ads forecast. Previously he was a data science manager at Spotify, worked on supply forecasting, demand forecasting, and data infrastructure. He studied business at the University of Southern California. We talked about Ads forecasting, career path as a manager vs IC, culture in Spotify vs …
…
continue reading
1
What Are the Questions to Ask If You Get a New Project? (Episode 15)
21:18
21:18
Spill senere
Spill senere
Lister
Lik
Likt
21:18
Av Alexander Schacht and Paolo Eusebi
…
continue reading
1
A/B testing at Airbnb, building next-gen experimentation platform at Eppo - Che Sharma - The Data Scientist Show #068
1:14:15
1:14:15
Spill senere
Spill senere
Lister
Lik
Likt
1:14:15
Che Sharma was the 4th data scientist at Airbnb, later he joined Webflow as an early employee. In 2021 he founded Eppo, a next-gen A/B experimentation platform designed for modern data and product teams to run more trustworthy and advanced experiments. We talked about A/B testing best practices, A/B testing for ML models, and Che’s career journey. …
…
continue reading
1
From data scientist@Meta to full-time YouTuber (500k+ sub), AI engineering, future of work - Tina Huang - The Data Scientist Show #067
1:54:52
1:54:52
Spill senere
Spill senere
Lister
Lik
Likt
1:54:52
We talked about self-learning, productivity, how Tina navigates her career change and how she thinks AI could change the future of work. Tina's YouTube: www.youtube.com/@TinaHuang1 Lonely Octopus: www.lonelyoctopus.com Subscribe to Daliana's newsletter on www.dalianaliu.com for more on data science and career. Tina Huang is a data scientist turned …
…
continue reading
1
Making LLMs hallucinate less, how to diagnose ML models, from PM in Google AI to CEO of Galileo - Vikram Chatterji - The Data Scientist Show #066
1:26:50
1:26:50
Spill senere
Spill senere
Lister
Lik
Likt
1:26:50
Vikram is the co-founder of Galileo – an AI diagnostics and explainability platform used by data science teams building NLP, LLMs and Computer Vision models across the Fortune 500 and high growth startups. Prior to Galileo, Vikram led Product Management at Google AI, where his team built models for the Fortune 2000 across retail, financial services…
…
continue reading
1
Data Science "Mix Martial Arts", applied re-inforcement learning, scaling AI workloads using Ray - Max Pumperla - The Data Scientist Show #065
1:53:28
1:53:28
Spill senere
Spill senere
Lister
Lik
Likt
1:53:28
Max Pumperla designed his own career path in data science. He is a freelance software engineer at AnyScale, and also a data science professor. We talked about reinforcement learning, open source contributions, Ray for data scientists, and his view on the data scientists role. If you enjoy the show, subscribe to the channel and leave a 5-star review…
…
continue reading
1
A Picture Says More Than 1000 Tables (Episode 14)
45:41
45:41
Spill senere
Spill senere
Lister
Lik
Likt
45:41
Av Gyom
…
continue reading
1
Uber's ML Systems (Uber Eats, Customer Support), Declarative Machine Learning - Piero Molino - The Data Scientist Show #064
1:50:05
1:50:05
Spill senere
Spill senere
Lister
Lik
Likt
1:50:05
Piero Molino was one of the founding members of Uber AI Labs. He worked on several deployed ML systems, including an NLP model for Customer Support, and the Uber Eats Recommender System. He is the author of Ludwig , an open source declarative deep learning framework. In 2021 he co-founded Predibase, the low-code declarative machine learning platfor…
…
continue reading
1
Writing Reproducible Reports using Quarto (Episode 13)
16:58
16:58
Spill senere
Spill senere
Lister
Lik
Likt
16:58
Discussion with Paolo and ThomasCommunicating data is so important! Quarto is a fantastic tool for writing reproducible reportsusing literate programming. Literate programming allows us to incorporate documentation andcode in the same program. The data science community has embraced this idea by adoptingRmarkdown and Jupyter Notebooks. Using Quarto…
…
continue reading
1
Data science in transportation, the intersection of operations research and ML - Holger Teichgraeber - The Data Scientist Show #063
46:53
46:53
Spill senere
Spill senere
Lister
Lik
Likt
46:53
Holger Teichgraeber is a Data Science Manager at Archer Aviation. Previously, he worked at Convoy as a Research Scientist on their trucking marketplace, and at various companies in the energy space. Holger has a Bachelor's degree in Mechanical Engineering from Aachen, Germany, and a Masters and Ph.D. with research focus on machine learning and opti…
…
continue reading
1
Sharing your Code with R Packages (Episode 12)
22:20
22:20
Spill senere
Spill senere
Lister
Lik
Likt
22:20
Resources: R Packages (2e)Av Alexander Schacht and Paolo Eusebi
…
continue reading
1
How to Effectively Structure Data Science Projects in R (Episode 11)
21:36
21:36
Spill senere
Spill senere
Lister
Lik
Likt
21:36
In this episode, Paolo and Thomas dive into the fundamental principles for a well-structured data science project. These include practical advice on: • organizing files into folders, • documenting and commenting code, • using version control systems and much more. Although the episode focuses on applying these fundamental principles in R projects, …
…
continue reading
1
Dichotomization and Proportional Odds Model (Episode 10)
15:30
15:30
Spill senere
Spill senere
Lister
Lik
Likt
15:30
In this episode, we move from the logistic regression model to proportional odds model, with emphasis oninterpretation and the checking of assumptions (visually and analytically). We also speak about theopportunities and challenges of dealing with the dichotomization of ordinal or continuous variables. Resources: ● McCullagh, Peter, and John A. Nel…
…
continue reading
1
Tackling data quality issues, 5 pillars of data observability, from management consultant to CEO of Monte Carlo - Barr Moses -The Data Scientist Show #062
1:21:31
1:21:31
Spill senere
Spill senere
Lister
Lik
Likt
1:21:31
Barr Moses is a consultant turned CEO & Co-Founder of Monte Carlo, a data reliability company. She started her career as a management consultant at Bain & Company and a research assistant at the Statistics Department at Stanford University. Later, she became VP of Customer Operations at customer success company Gainsight, where she built the data a…
…
continue reading
Logistic regression is a beautiful tool for modeling a binary dependent variable, although many morecomplex extensions exist. In the show, we will speak about the generalized linear model family, logit andprobit functions, interpretations, and practicalities. Resources: ● McCullagh, Peter, and John A. Nelder. Generalized linear models. Routledge, 1…
…
continue reading
1
3 steps to make your research more reproducible with Heidi Seibold (Episode 8)
33:40
33:40
Spill senere
Spill senere
Lister
Lik
Likt
33:40
Creating reproducible research is crucial for data scientists as it ensures transparency, understanding, and accuracy in the research process. Not only does it help others understand your work, but it also allows for the reproduction and verification of your results in the future. Heidi Seibold, an expert in reproducible research, suggests three st…
…
continue reading
1
The art of communicating data with Hana Khan
30:45
30:45
Spill senere
Spill senere
Lister
Lik
Likt
30:45
Alexander interviewed Hana Khan about her path from being a data analyst to a data visualizer. Hana runs Hanalytx, her own company, which is specialized in helping others in presenting and visualizing data. Hana also runs the Art of Communicating Data podcast. In this episode, Hana and Alexander discussed super interesting topics like sources of in…
…
continue reading
1
Bayesian inference and probabilistic programming
49:59
49:59
Spill senere
Spill senere
Lister
Lik
Likt
49:59
Interview with Alex Andorra Interviewing Alex Andorra about bayesian inference, probabilistic programming, and more was a pleasure. Alex is a data scientist and modeler at the PyMC Labs consultancy. He's also an open-source enthusiast and core contributor to the python packages PyMC and ArviZ. Alex is also a contributor and instructor in the "Intui…
…
continue reading
1
Is search dead? Google vs ChatGPT, from Google Search to enterprise search at Glean, machine learning in search, tech layoffs - Deedy Das - The Data Scientist Show #061
1:27:06
1:27:06
Spill senere
Spill senere
Lister
Lik
Likt
1:27:06
Deedy Das is a founding engineer at Glean, an enterprise search startup. Previously, he was a Tech Lead at Google Search working on query understanding and the sports product in New York, Tel Aviv, and Bangalore. Before that, he was an engineer at Facebook New York and graduated from Cornell University. Outside of work, Deedy writes on his blog. He…
…
continue reading
1
The 100-hour work week of an self-taught machine learning researcher, how he got into Google Brain, why he started Omni - Jeremy Nixon - The Data Scientist Show #060
1:42:52
1:42:52
Spill senere
Spill senere
Lister
Lik
Likt
1:42:52
Jeremy Nixon is a machine learning researcher, software engineer, and startup founder. Previously he was a software engineer at Google Brain working on deep learning. Now, he is the co-founder and CEO of Omni, building an immersive information retrieval system for you and your team. He studied applied math at Harvard University. Today we’ll talk ab…
…
continue reading
1
Everything to know to write programs like a pro - Principles for good programming (Episode 5)
21:35
21:35
Spill senere
Spill senere
Lister
Lik
Likt
21:35
Interview with Shafi Chowdhury Click here to get the quick guide! Shafi ChowdhuryThis image has an empty alt attribute; its file name is shaffi.webpHe has have over 20 years of experience as a statistical programmer in the Pharma industry. He worked for Pharma companies and CROs across Europe in many different therapeutic areas and in all phases of…
…
continue reading