AI
XGBoost: How a Committee of Dumb Models Outsmarted the World's Best Algorithms
AI
pplpod by pplpod
E6046
20:25
A single brilliant expert should always beat a crowd of amateurs — right? Not in machine learning. The most dominant force in competitive data science for over a decade isn't a sophisticated neural network. It's a massive, blazing-fast committee of shallow decision trees that individually know almost nothing.
In this episode, we trace XGBoost from its humble origins as a terminal app in a University of Washington research lab to its breakout moment winning the CERN Higgs Boson challenge — and its subsequent reign as the undisputed weapon of choice on Kaggle. We break down the math that makes it "extreme": how second-order Taylor approximations (the Newton-Raphson method) let the algorithm feel both the slope and the curvature of its errors, taking smarter steps down the optimization landscape than standard gradient boosting ever co ...