E7: The Power of Benchmarking in AI Progress with Praveen Paritosh

Practically Intelligent por Sinan Ozdemir and Akshay Bhushan

Notas del episodio

In this enlightening seventh episode of Practically Intelligent, we take a look at the pivotal role of benchmarking in advancing AI with Praveen Paritosh, a leading figure in AI research. Discover why shared benchmarks are not just important, but critical in pushing the boundaries of AI technology. Praveen enlightens us on the legacy benchmarks like SQuAD, instrumental in testing early question-answer systems, and how they paved the way for early leaderboards in AI. We discuss the concept of shared benchmarks as a mechanism for the research community to collectively tackle and progress in specific challenges, drawing parallels between NLP and image recognition benchmarks like ImageNet. However, it's not all straightforward – benchmarks, while guiding us in the right direction, are merely proxies. We discuss the challenges of differentiating betwe ... 

 ...  Leer más