Reader

Nexa AI unveiled Omnivision, a compact vision-language model tailored for edge devices. By significantly reducing image tokens from 729 to 81, Omnivision lowers latency and computational requirements while maintaining strong performance in tasks like visual question answering and image captioning.

By Robert Krzaczyński

Reader

Nexa AI Unveils Omnivision: A Compact Vision-Language Model for Edge AI