Note sull'episodio
Vik from Moondream AI joins Joe to demo a vision-language model that runs locally—on your laptop, your phone, even a Raspberry Pi.
From visual question answering to gaze detection and UI automation, Vik shows how Moondream is redefining edge computer vision—no cloud required.
Whether you're into robotics, home automation, or lightweight AI, this “One-Shot” is packed with insights for builders.
Try it yourself at moondream.ai 🚀
00:00 - Intro to Moondream’s compression tech for 2B parameter models
00:22 - Joe welcomes Vik from Moondream
01:53 - Shift from traditional CV to promptable vision-language models
03:23 - Playground demo: Visual Question Answering (VQA)
Parole chiave
Real-Time Edge AIMoondream AIVision-Language ModelsVisual Question AnsweringLocal AI InferenceComputer VisionRaspberry Pi AIUI Automation AILightweight AI ModelsPromptable Vision Models