エピソード

  • #129 Bayesian Deep Learning & AI for Science with Vincent Fortuin
    2025/04/02

    Proudly sponsored by PyMC Labs, the Bayesian Consultancy. Book a call, or get in touch!

    • Intro to Bayes Course (first 2 lessons free)
    • Advanced Regression Course (first 2 lessons free)

    Our theme music is « Good Bayesian », by Baba Brinkman (feat MC Lars and Mega Ran). Check out his awesome work!

    Visit our Patreon page to unlock exclusive Bayesian swag ;)

    Takeaways:

    • The hype around AI in science often fails to deliver practical results.
    • Bayesian deep learning combines the strengths of deep learning and Bayesian statistics.
    • Fine-tuning LLMs with Bayesian methods improves prediction calibration.
    • There is no single dominant library for Bayesian deep learning yet.
    • Real-world applications of Bayesian deep learning exist in various fields.
    • Prior knowledge is crucial for the effectiveness of Bayesian deep learning.
    • Data efficiency in AI can be enhanced by incorporating prior knowledge.
    • Generative AI and Bayesian deep learning can inform each other.
    • The complexity of a problem influences the choice between Bayesian and traditional deep learning.
    • Meta-learning enhances the efficiency of Bayesian models.
    • PAC-Bayesian theory merges Bayesian and frequentist ideas.
    • Laplace inference offers a cost-effective approximation.
    • Subspace inference can optimize parameter efficiency.
    • Bayesian deep learning is crucial for reliable predictions.
    • Effective communication of uncertainty is essential.
    • Realistic benchmarks are needed for Bayesian methods
    • Collaboration and communication in the AI community are vital.

    Chapters:

    00:00 Introduction to Bayesian Deep Learning

    04:24 Vincent Fortuin’s Journey to Bayesian Deep Learning

    11:52 Understanding Bayesian Deep Learning

    16:29 Current Landscape of Bayesian Libraries

    21:11 Real-World Applications of Bayesian Deep Learning

    23:33 When to Use Bayesian Deep Learning

    28:22 Data Efficiency in AI and Generative Modeling

    30:18 Integrating Bayesian Knowledge into Generative Models

    31:44 The Role of Meta-Learning in Bayesian Deep Learning

    34:06 Understanding Pack Bayesian Theory

    37:55 Algorithms for Bayesian Deep Learning Models

    続きを読む 一部表示
    1 時間
  • #128 Building a Winning Data Team in Football, with Matt Penn
    2025/03/19

    Proudly sponsored by PyMC Labs, the Bayesian Consultancy. Book a call, or get in touch!

    • Intro to Bayes Course (first 2 lessons free)
    • Advanced Regression Course (first 2 lessons free)

    Our theme music is « Good Bayesian », by Baba Brinkman (feat MC Lars and Mega Ran). Check out his awesome work!

    Visit our Patreon page to unlock exclusive Bayesian swag ;)

    Takeaways:

    • Matt emphasizes the importance of Bayesian statistics in scenarios with limited data.
    • Communicating insights to coaches is a crucial skill for data analysts.
    • Building a data team requires understanding the needs of the coaching staff.
    • Player recruitment is a significant focus in football analytics.
    • The integration of data science in sports is still evolving.
    • Effective data modeling must consider the practical application in games.
    • Collaboration between data analysts and coaches enhances decision-making.
    • Having a robust data infrastructure is essential for efficient analysis.
    • The landscape of sports analytics is becoming increasingly competitive.
    • Player recruitment involves analyzing various data models.
    • Biases in traditional football statistics can skew player evaluations.
    • Statistical techniques should leverage the structure of football data.
    • Tracking data opens new avenues for understanding player movements.
    • The role of data analysis in football will continue to grow.
    • Aspiring analysts should focus on curiosity and practical experience.

    Chapters:

    00:00 Introduction to Football Analytics and Matt's Journey

    04:54 The Role of Bayesian Methods in Football

    10:20 Challenges in Communicating Data Insights

    17:03 Building Relationships with Coaches

    22:09 The Structure of the Data Team at Como

    26:18 Focus on Player Recruitment and Transfer Strategies

    28:48 January Transfer Window Insights

    30:54 Biases in Football Data Analysis

    34:11 Comparative Analysis of Men's and Women's Football

    36:55 Statistical Techniques in Football Analysis

    42:48 The Impact of Tracking Data on Football Analysis

    45:49 The Future of Data-Driven Football Strategies

    47:27 Advice for Aspiring Football Analysts

    続きを読む 一部表示
    58 分
  • #127 Saving Sharks... with Python, Causal Inference and Aaron MacNeil
    2025/03/05

    Proudly sponsored by PyMC Labs, the Bayesian Consultancy. Book a call, or get in touch!

    • Intro to Bayes Course (first 2 lessons free)
    • Advanced Regression Course (first 2 lessons free)

    Our theme music is « Good Bayesian », by Baba Brinkman (feat MC Lars and Mega Ran). Check out his awesome work!

    Visit our Patreon page to unlock exclusive Bayesian swag ;)

    Thank you to my Patrons for making this episode possible!

    Yusuke Saito, Avi Bryant, Ero Carrera, Giuliano Cruz, Tim Gasser, James Wade, Tradd Salvo, William Benton, James Ahloy, Robin Taylor,, Chad Scherrer, Zwelithini Tunyiswa, Bertrand Wilden, James Thompson, Stephen Oates, Gian Luca Di Tanna, Jack Wells, Matthew Maldonado, Ian Costley, Ally Salim, Larry Gill, Ian Moran, Paul Oreto, Colin Caprani, Colin Carroll, Nathaniel Burbank, Michael Osthege, Rémi Louf, Clive Edelsten, Henri Wallen, Hugo Botha, Vinh Nguyen, Marcin Elantkowski, Adam C. Smith, Will Kurt, Andrew Moskowitz, Hector Munoz, Marco Gorelli, Simon Kessell, Bradley Rode, Patrick Kelley, Rick Anderson, Casper de Bruin, Philippe Labonde, Michael Hankin, Cameron Smith, Tomáš Frýda, Ryan Wesslen, Andreas Netti, Riley King, Yoshiyuki Hamajima, Sven De Maeyer, Michael DeCrescenzo, Fergal M, Mason Yahr, Naoya Kanai, Steven Rowland, Aubrey Clayton, Jeannine Sue, Omri Har Shemesh, Scott Anthony Robson, Robert Yolken, Or Duek, Pavel Dusek, Paul Cox, Andreas Kröpelin, Raphaël R, Nicolas Rode, Gabriel Stechschulte, Arkady, Kurt TeKolste, Gergely Juhasz, Marcus Nölke, Maggi Mackintosh, Grant Pezzolesi, Avram Aelony, Joshua Meehl, Javier Sabio, Kristian Higgins, Alex Jones, Gregorio Aguilar, Matt Rosinski, Bart Trudeau, Luis Fonseca, Dante Gates, Matt Niccolls, Maksim Kuznecov, Michael Thomas, Luke Gorrie, Cory Kiser, Julio, Edvin Saveljev, Frederick Ayala, Jeffrey Powell, Gal Kampel, Adan Romero, Will Geary, Blake Walters, Jonathan Morgan, Francesco Madrisotti, Ivy Huang, Gary Clarke, Robert Flannery, Rasmus Hindström, Stefan, Corey Abshire, Mike Loncaric, David McCormick, Ronald Legere, Sergio Dolia and Michael Cao.

    Takeaways:

    • Sharks play a crucial role in maintaining healthy ocean ecosystems.
    • Bayesian statistics are particularly useful in data-poor environments like ecology.
    • Teaching Bayesian statistics requires a shift in mindset from traditional statistical methods.
    • The shark meat trade is significant and often overlooked.
    • Ray meat trade is as large as shark meat trade, with specific markets dominating.
    • Understanding the ecological roles of species is essential for effective conservation.
    • Causal language is important in ecological research and should be encouraged.
    • Evidence-driven decision-making is crucial in balancing human and ecological needs.
    • Expert opinions are...
    続きを読む 一部表示
    1 時間 4 分
  • #126 MMM, CLV & Bayesian Marketing Analytics, with Will Dean
    2025/02/19

    Proudly sponsored by PyMC Labs, the Bayesian Consultancy. Book a call, or get in touch!

    • Intro to Bayes Course (first 2 lessons free)
    • Advanced Regression Course (first 2 lessons free)

    Our theme music is « Good Bayesian », by Baba Brinkman (feat MC Lars and Mega Ran). Check out his awesome work!

    Visit our Patreon page to unlock exclusive Bayesian swag ;)

    Takeaways:

    • Marketing analytics is crucial for understanding customer behavior.
    • PyMC Marketing offers tools for customer lifetime value analysis.
    • Media mix modeling helps allocate marketing spend effectively.
    • Customer Lifetime Value (CLV) models are essential for understanding long-term customer behavior.
    • Productionizing models is essential for real-world applications.
    • Productionizing models involves challenges like model artifact storage and version control.
    • MLflow integration enhances model tracking and management.
    • The open-source community fosters collaboration and innovation.
    • Understanding time series is vital in marketing analytics.
    • Continuous learning is key in the evolving field of data science.

    Chapters:

    00:00 Introduction to Will Dean and His Work

    10:48 Diving into PyMC Marketing

    17:10 Understanding Media Mix Modeling

    25:54 Challenges in Productionizing Models

    35:27 Exploring Customer Lifetime Value Models

    44:10 Learning and Development in Data Science

    Thank you to my Patrons for making this episode possible!

    Yusuke Saito, Avi Bryant, Ero Carrera, Giuliano Cruz, Tim Gasser, James Wade, Tradd Salvo, William Benton, James Ahloy, Robin Taylor,, Chad Scherrer, Zwelithini Tunyiswa, Bertrand Wilden, James Thompson, Stephen Oates, Gian Luca Di Tanna, Jack Wells, Matthew Maldonado, Ian Costley, Ally Salim, Larry Gill, Ian Moran, Paul Oreto, Colin Caprani, Colin Carroll, Nathaniel Burbank, Michael Osthege, Rémi Louf, Clive Edelsten, Henri Wallen, Hugo Botha, Vinh Nguyen, Marcin Elantkowski, Adam C. Smith, Will Kurt, Andrew Moskowitz, Hector Munoz, Marco Gorelli, Simon Kessell, Bradley Rode, Patrick Kelley, Rick Anderson, Casper de Bruin, Philippe Labonde, Michael Hankin, Cameron Smith, Tomáš Frýda, Ryan Wesslen, Andreas Netti, Riley King, Yoshiyuki Hamajima, Sven De Maeyer, Michael DeCrescenzo, Fergal M, Mason Yahr, Naoya Kanai, Steven Rowland, Aubrey Clayton, Jeannine Sue, Omri Har Shemesh, Scott Anthony Robson, Robert Yolken, Or Duek, Pavel Dusek, Paul Cox, Andreas Kröpelin, Raphaël R, Nicolas Rode, Gabriel Stechschulte, Arkady, Kurt TeKolste, Gergely Juhasz,...

    続きを読む 一部表示
    55 分
  • #125 Bayesian Sports Analytics & The Future of PyMC, with Chris Fonnesbeck
    2025/02/05

    Proudly sponsored by PyMC Labs, the Bayesian Consultancy. Book a call, or get in touch!

    • Intro to Bayes Course (first 2 lessons free)
    • Advanced Regression Course (first 2 lessons free)

    Our theme music is « Good Bayesian », by Baba Brinkman (feat MC Lars and Mega Ran). Check out his awesome work!

    Visit our Patreon page to unlock exclusive Bayesian swag ;)

    Thank you to my Patrons for making this episode possible!

    Yusuke Saito, Avi Bryant, Ero Carrera, Giuliano Cruz, Tim Gasser, James Wade, Tradd Salvo, William Benton, James Ahloy, Robin Taylor,, Chad Scherrer, Zwelithini Tunyiswa, Bertrand Wilden, James Thompson, Stephen Oates, Gian Luca Di Tanna, Jack Wells, Matthew Maldonado, Ian Costley, Ally Salim, Larry Gill, Ian Moran, Paul Oreto, Colin Caprani, Colin Carroll, Nathaniel Burbank, Michael Osthege, Rémi Louf, Clive Edelsten, Henri Wallen, Hugo Botha, Vinh Nguyen, Marcin Elantkowski, Adam C. Smith, Will Kurt, Andrew Moskowitz, Hector Munoz, Marco Gorelli, Simon Kessell, Bradley Rode, Patrick Kelley, Rick Anderson, Casper de Bruin, Philippe Labonde, Michael Hankin, Cameron Smith, Tomáš Frýda, Ryan Wesslen, Andreas Netti, Riley King, Yoshiyuki Hamajima, Sven De Maeyer, Michael DeCrescenzo, Fergal M, Mason Yahr, Naoya Kanai, Steven Rowland, Aubrey Clayton, Jeannine Sue, Omri Har Shemesh, Scott Anthony Robson, Robert Yolken, Or Duek, Pavel Dusek, Paul Cox, Andreas Kröpelin, Raphaël R, Nicolas Rode, Gabriel Stechschulte, Arkady, Kurt TeKolste, Gergely Juhasz, Marcus Nölke, Maggi Mackintosh, Grant Pezzolesi, Avram Aelony, Joshua Meehl, Javier Sabio, Kristian Higgins, Alex Jones, Gregorio Aguilar, Matt Rosinski, Bart Trudeau, Luis Fonseca, Dante Gates, Matt Niccolls, Maksim Kuznecov, Michael Thomas, Luke Gorrie, Cory Kiser, Julio, Edvin Saveljev, Frederick Ayala, Jeffrey Powell, Gal Kampel, Adan Romero, Will Geary, Blake Walters, Jonathan Morgan, Francesco Madrisotti, Ivy Huang, Gary Clarke, Robert Flannery, Rasmus Hindström, Stefan, Corey Abshire and Mike Loncaric.

    Takeaways:

    • The evolution of sports modeling is tied to the availability of high-frequency data.
    • Bayesian methods are valuable in handling messy, hierarchical data.
    • Communication between data scientists and decision-makers is crucial for effective model use.
    • Models are often wrong, and learning from mistakes is part of the process.
    • Simplicity in models can sometimes yield better results than complexity.
    • The integration of analytics in sports is still developing, with opportunities in various sports.
    • Transparency in research and development teams enhances decision-making.
    • Understanding uncertainty in models is essential for informed decisions.
    • The balance between point estimates and full distributions is a...
    続きを読む 一部表示
    58 分
  • #124 State Space Models & Structural Time Series, with Jesse Grabowski
    2025/01/22

    Proudly sponsored by PyMC Labs, the Bayesian Consultancy. Book a call, or get in touch!

    • My Intuitive Bayes Online Courses
    • 1:1 Mentorship with me

    Our theme music is « Good Bayesian », by Baba Brinkman (feat MC Lars and Mega Ran). Check out his awesome work!

    Visit our Patreon page to unlock exclusive Bayesian swag ;)

    Takeaways:

    • Bayesian statistics offers a robust framework for econometric modeling.
    • State space models provide a comprehensive way to understand time series data.
    • Gaussian random walks serve as a foundational model in time series analysis.
    • Innovations represent external shocks that can significantly impact forecasts.
    • Understanding the assumptions behind models is key to effective forecasting.
    • Complex models are not always better; simplicity can be powerful.
    • Forecasting requires careful consideration of potential disruptions. Understanding observed and hidden states is crucial in modeling.
    • Latent abilities can be modeled as Gaussian random walks.
    • State space models can be highly flexible and diverse.
    • Composability allows for the integration of different model components.
    • Trends in time series should reflect real-world dynamics.
    • Seasonality can be captured through Fourier bases.
    • AR components help model residuals in time series data.
    • Exogenous regression components can enhance state space models.
    • Causal analysis in time series often involves interventions and counterfactuals.
    • Time-varying regression allows for dynamic relationships between variables.
    • Kalman filters were originally developed for tracking rockets in space.
    • The Kalman filter iteratively updates beliefs based on new data.
    • Missing data can be treated as hidden states in the Kalman filter framework.
    • The Kalman filter is a practical application of Bayes' theorem in a sequential context.
    • Understanding the dynamics of systems is crucial for effective modeling.
    • The state space module in PyMC simplifies complex time series modeling tasks.

    Chapters:

    00:00 Introduction to Jesse Krabowski and Time Series Analysis

    04:33 Jesse's Journey into Bayesian Statistics

    10:51 Exploring State Space Models

    18:28 Understanding State Space Models and Their Components

    続きを読む 一部表示
    1 時間 36 分
  • #123 BART & The Future of Bayesian Tools, with Osvaldo Martin
    2025/01/10

    Proudly sponsored by PyMC Labs, the Bayesian Consultancy. Book a call, or get in touch!

    • My Intuitive Bayes Online Courses
    • 1:1 Mentorship with me

    Our theme music is « Good Bayesian », by Baba Brinkman (feat MC Lars and Mega Ran). Check out his awesome work!

    Visit our Patreon page to unlock exclusive Bayesian swag ;)

    Takeaways:

    • BART models are non-parametric Bayesian models that approximate functions by summing trees.
    • BART is recommended for quick modeling without extensive domain knowledge.
    • PyMC-BART allows mixing BART models with various likelihoods and other models.
    • Variable importance can be easily interpreted using BART models.
    • PreliZ aims to provide better tools for prior elicitation in Bayesian statistics.
    • The integration of BART with Bambi could enhance exploratory modeling.
    • Teaching Bayesian statistics involves practical problem-solving approaches.
    • Future developments in PyMC-BART include significant speed improvements.
    • Prior predictive distributions can aid in understanding model behavior.
    • Interactive learning tools can enhance understanding of statistical concepts.
    • Integrating PreliZ with PyMC improves workflow transparency.
    • Arviz 1.0 is being completely rewritten for better usability.
    • Prior elicitation is crucial in Bayesian modeling.
    • Point intervals and forest plots are effective for visualizing complex data.

    Chapters:

    00:00 Introduction to Osvaldo Martin and Bayesian Statistics

    08:12 Exploring Bayesian Additive Regression Trees (BART)

    18:45 Prior Elicitation and the PreliZ Package

    29:56 Teaching Bayesian Statistics and Future Directions

    45:59 Exploring Prior Predictive Distributions

    52:08 Interactive Modeling with PreliZ

    54:06 The Evolution of ArviZ

    01:01:23 Advancements in ArviZ 1.0

    01:06:20 Educational Initiatives in Bayesian Statistics

    01:12:33 The Future of Bayesian Methods

    Thank you to my Patrons for making this episode possible!

    Yusuke Saito, Avi Bryant, Ero Carrera, Giuliano Cruz, Tim Gasser, James Wade, Tradd Salvo, William Benton, James Ahloy, Robin Taylor,, Chad Scherrer, Zwelithini Tunyiswa, Bertrand Wilden, James Thompson, Stephen Oates, Gian Luca Di Tanna, Jack Wells, Matthew Maldonado, Ian Costley, Ally Salim, Larry Gill, Ian Moran, Paul Oreto, Colin...

    続きを読む 一部表示
    1 時間 32 分
  • #122 Learning and Teaching in the Age of AI, with Hugo Bowne-Anderson
    2024/12/26

    Proudly sponsored by PyMC Labs, the Bayesian Consultancy. Book a call, or get in touch!

    • My Intuitive Bayes Online Courses
    • 1:1 Mentorship with me

    Our theme music is « Good Bayesian », by Baba Brinkman (feat MC Lars and Mega Ran). Check out his awesome work!

    Visit our Patreon page to unlock exclusive Bayesian swag ;)

    Takeaways:

    • Effective data science education requires feedback and rapid iteration.
    • Building LLM applications presents unique challenges and opportunities.
    • The software development lifecycle for AI differs from traditional methods.
    • Collaboration between data scientists and software engineers is crucial.
    • Hugo's new course focuses on practical applications of LLMs.
    • Continuous learning is essential in the fast-evolving tech landscape.
    • Engaging learners through practical exercises enhances education.
    • POC purgatory refers to the challenges faced in deploying LLM-powered software.
    • Focusing on first principles can help overcome integration issues in AI.
    • Aspiring data scientists should prioritize problem-solving over specific tools.
    • Engagement with different parts of an organization is crucial for data scientists.
    • Quick paths to value generation can help gain buy-in for data projects.
    • Multimodal models are an exciting trend in AI development.
    • Probabilistic programming has potential for future growth in data science.
    • Continuous learning and curiosity are vital in the evolving field of data science.

    Chapters:

    09:13 Hugo's Journey in Data Science and Education

    14:57 The Appeal of Bayesian Statistics

    19:36 Learning and Teaching in Data Science

    24:53 Key Ingredients for Effective Data Science Education

    28:44 Podcasting Journey and Insights

    36:10 Building LLM Applications: Course Overview

    42:08 Navigating the Software Development Lifecycle

    48:06 Overcoming Proof of Concept Purgatory

    55:35 Guidance for Aspiring Data Scientists

    01:03:25 Exciting Trends in Data Science and AI

    01:10:51 Balancing Multiple Roles in Data Science

    01:15:23 Envisioning Accessible Data Science for All

    Thank you to my Patrons for making this episode possible!

    Yusuke Saito, Avi Bryant, Ero Carrera, Giuliano Cruz, Tim

    続きを読む 一部表示
    1 時間 23 分