How Vik Built Moondream—A Tiny Vision Model with Big Power

AI Tinkerers - "One-Shot" por Joe Heitzeberg

Notas del episodio

Vik from Moondream AI joins Joe to demo a vision-language model that runs locally—on your laptop, your phone, even a Raspberry Pi.

From visual question answering to gaze detection and UI automation, Vik shows how Moondream is redefining edge computer vision—no cloud required.

Whether you're into robotics, home automation, or lightweight AI, this “One-Shot” is packed with insights for builders.

Try it yourself at moondream.ai 🚀

00:00 - Intro to Moondream’s compression tech for 2B parameter models

00:22 - Joe welcomes Vik from Moondream

01:53 - Shift from traditional CV to promptable vision-language models

03:23 - Playground demo: Visual Question Answering (VQA)

 ...  Leer más
Palabras clave
Real-Time Edge AIMoondream AIVision-Language ModelsVisual Question AnsweringLocal AI InferenceComputer VisionRaspberry Pi AIUI Automation AILightweight AI ModelsPromptable Vision Models