Reader

Nexa AI Unveils Omnivision: A Compact Vision-Language Model for Edge AI

| InfoQ | Default

Nexa AI unveiled Omnivision, a compact vision-language model tailored for edge devices. By significantly reducing image tokens from 729 to 81, Omnivision lowers latency and computational requirements while maintaining strong performance in tasks like visual question answering and image captioning.

By Robert KrzaczyƄski