Gemini may be getting a big update on mobile in the near future, adding some new features including text box overlays. Details of the upgrade are from industry insider His AssembleDebug, whose findings have been shared in several publications.
PiunikaWeb gave us some insight into the overlay and it's very interesting to see it in action. It turns the AI input box into a small floating window at the bottom of your phone's display that stays there even when you close the app. For example, you can talk to Gemini while browsing the Internet or checking your email.
AssembleDebug was able to activate windows and work on mobile phones when using X (the platform formerly known as Twitter). His demo video shows it working exactly like the Gemini app. Ask the AI a question and after a few seconds you will receive a complete response including source links, images, and even a YouTube video if you need further inquiries. Depending on the answer, it may be difficult to see the app behind it.
The AssembleDebug video reveals that the length varies depending on whether the question requires a long-form answer. It's worth mentioning that the overlay is multimodal, so you can write queries, give verbal commands to the AI, and upload images.
smarter AI
Other notable changes were shared with Android Authority. First, Gemini on Android will now be able to accept many more file types than just photos. The image shows a tester uploading her PDF and asking the AI to summarize the text within it. Apparently, this feature exists in the current version of Gemini, but nothing happens when I activate it. Android Authority speculates that this update may be limited to either Google Workspace or Gemini Advanced. Maybe both. It's hard to tell at this point.
The second is a very basic but still useful tool.[テキストの選択]It is called. The way Gemini currently works is that it forces you to copy entire blocks of text, even if it's only a small part.[テキストを選択]solves this problem by allowing you to retrieve a specific line or paragraph.
No, it's not a flashy upgrade. Almost every app in the world has the same functionality. Still, the tool has a “significant impact on Gemini's usability.” The ease of use of AI is greatly improved as it is no longer as restrictive.
#Google✨ Gemini Android app finally no longer forces you to copy the entire prompt response 🔥 Read on AndroidAuthority – https://t.co/M1EFGwfbNJ#Google #Gemini #AI pic.twitter .com/BFkKCbKylRApril 23, 2024
A fourth small update was discovered by AssembleDebug. This is known as real-time response. The descriptive text next to it claims that this tool allows you to see your answers written out in real time. However, as PiunikaWeb points out, this is just an animation change. There are no “substantive benefits”. Instead of waiting for Gemini to generate his response as one solid blob, you can choose to have the AI write everything out line by line for you, similar to its desktop counterpart.
Google I/O 2024 opens in about three weeks on May 14th. It's not clear when these features will be rolled out, but we'll likely learn more during the event.
While you wait, if you're considering an upgrade, check out TechRadar's roundup of the best Android smartphones of 2024.
