This model combines vision and language with the goal of supporting real-world AI applications.
The Technology Innovation Institute, the applied research arm of the Advanced Technology Research Council, has launched Falcon Perception, a new multimodal AI model designed to help machines interpret the physical world. This system combines image processing and language processing within a single architecture.
According to the Technology Innovation Institute, this model allows machines to display, read, and understand visual data using natural language prompts. It can identify, segment, and analyze objects in complex images, supporting tasks such as robotics and word processing.
Falcon Perception is built with approximately 600 million parameters, significantly fewer than many competing systems. Nevertheless, the organization says it offers performance comparable to leading models while reducing computational demands and system complexity.
The Technology Innovation Institute said the model is designed for deployment in industrial and enterprise environments where efficiency and scalability are important, and will contribute to broader AI development efforts in the UAE.
Want to know more about AI, technology and digital diplomacy? If so, Ask the Diplo chatbot a question!
