Microsoft recently hosted its annual developer conference Build 2024, showcasing a number of new features and announcements across its product line. During the event, the tech giant announced a number of upcoming new features, including real-time video translation capabilities. This new AI-powered feature allows users to experience live translation of audio content on platforms such as YouTube, LinkedIn, and Coursera, providing both dubbing and subtitles.
Microsoft says the new translation feature will initially support translations from Spanish to English, English to German, Hindi, Italian, Russian, and Spanish. This feature will be expanded to other languages and websites in the future. This new feature aims to make video content more accessible to a wider audience, including those who are deaf or hard of hearing, by providing live subtitles.
In addition to education platforms like YouTube and Coursera, Edge extends this real-time translation capability to videos from major news websites like Reuters, CNBC, and Bloomberg. This is consistent with giving users access to a wide range of content, regardless of the language of origin.
In particular, Microsoft Edge's new real-time translation feature is part of a broader suite of AI enhancements integrated into Edge through Microsoft's Copilot initiative. Edge users are already benefiting from AI tools such as Video Summarization, which utilizes available transcripts to create text summaries of his YouTube videos. Our new translation feature takes this even further by translating audio content in real-time without the need for an existing transcript.
Meanwhile, apart from AI translation, Microsoft announced a lot about Windows and AI during its Build 2024 keynote. One of the highlights was the introduction of new capabilities for the Copilot AI agent, which is designed to handle simple tasks such as monitoring emails, automating workflows, helping with employee onboarding, and performing data entry. did. These AI agents are intended to assist businesses by taking over repetitive tasks, rather than replacing jobs. These new features will be available in Copilot Studio later this year.
Microsoft also announced Phi-3 Vision, a new compact multimodal AI model that can read text and analyze images, optimized for mobile devices. This model is part of his Phi-3 family announced in April and is currently available in preview. Phi-3 Vision's design enables advanced image analysis on smartphones, improving the usability of AI for mobile users.
In addition to AI advancements, Microsoft also announced custom emoji support for Microsoft Teams. This will allow users to add personalized emojis within their organization starting in July. Other updates include Qualcomm's new Snapdragon Dev Kit for Windows, powered by the Snapdragon can. In addition, PowerToys for Windows 11 includes advanced paste functionality that uses AI to convert clipboard content into various formats and requires an OpenAI API key for full functionality.
