E7: The Power of Benchmarking in AI Progress with Praveen Paritosh

Practically Intelligent
E7: The Power of Benchmarking in ...

E7: The Power of Benchmarking in AI Progress with Praveen Paritosh

Practically Intelligent por Sinan Ozdemir and Akshay Bhushan

T01 E07

48:41

Notas del episodio

In this enlightening seventh episode of Practically Intelligent, we take a look at the pivotal role of benchmarking in advancing AI with Praveen Paritosh, a leading figure in AI research. Discover why shared benchmarks are not just important, but critical in pushing the boundaries of AI technology. Praveen enlightens us on the legacy benchmarks like SQuAD, instrumental in testing early question-answer systems, and how they paved the way for early leaderboards in AI. We discuss the concept of shared benchmarks as a mechanism for the research community to collectively tackle and progress in specific challenges, drawing parallels between NLP and image recognition benchmarks like ImageNet. However, it's not all straightforward – benchmarks, while guiding us in the right direction, are merely proxies. We discuss the challenges of differentiating betwe ...

Funcionalidades

Recursos

Podcasts

E7: The Power of Benchmarking in AI Progress with Praveen Paritosh

Practically Intelligent por Sinan Ozdemir and Akshay Bhushan

Notas del episodio