Apple's FastVLM Video Caption AI offers instant image processing live, but there's a catch

AI Video & Visuals


The latest Apple-made artificial intelligence model, FastVLM, is now available for users to try it out. This provides a very fast video caption AI that can explain what the camera captures on the device.

The report claims that the capabilities and capabilities of the AI ​​model provide high resolution image processing nearby, making it one of the most notable technologies available for live captions.

It has been revealed that Apple has been developing the MLX open framework since 2023, and reports on the development of rumors of Apple smart glasses have speculated that the technology will be available in wearables.

Apple's FastVLM Video Caption AI is currently live

Apple FastVlm
Isaiah Richard/Technology

According to 9TO5MAC, Apple released FastVLM technology a few months ago, leveraging Visual Language model (VLM) that uses an ML open framework specifically designed for MLX or Apple silicon.

With current iterations, video captions are 85 times faster, and are as small as similar models available on the market.

According to the report, video caption AI provides information about what the camera captures and encounters, providing a myopia response. However, before creating live caption information, you need to focus on the objects that the user wants to process.

At the time of reporting, there are only a few prompts for AI to follow live captioning features, such as commands such as “Describe what is displayed in one sentence” and “Identify the text or written content displayed.”

There's a catch to try Apple's FastVLM AI

Users can try out the technology via uploads Apple shares with the Hugging Face Repository, or use the lighter web-based version of FastVLM 0.5B.

However, it is important to note that this technology is designed for use in Apple silicon with the MLX framework. Additionally, 9TO5MAC reported that the main version of AI takes time to complete the load, despite the fact that it already uses an M2 MAC with 16GB of memory.

Apple Smart Glasses and AR Tech

There were multiple reports earlier this year that Apple exploded over the development of its own smart glasses. It features daily wearables that compete with Ray-Banmeta.

Initially, it was claimed that the device will be available in 2026, but Apple claims it is not ready for technology yet, targeting 2027 to debut wearables along with camera-equipped AirPods.

There are also other wearables reportedly in the pipeline due to Apple's plans to enter the AR market, including next year's new Vision Air Device. The Vision Air is a lightweight version of the Vision Pro XR headset and is not recommended for daily use.

Analyst Ming-Chi Kuo previously argued that Apple is now looking at the world of wearables as it considers head-mounted wearables to be the next major trend in consumer technology.



Source link

Leave a Reply

Your email address will not be published. Required fields are marked *