OpenAI is reportedly planning to integrate its Sora AI video generation model into ChatGPT, expanding the platform’s capabilities for multimedia content creation.
Perplexity has launched voice mode for Perplexity Computer, enabling users to interact with its AI platform through spoken commands and hands free computing.
Google rolls out Nano Banana 2 as its default AI image generation tool, enhancing prompt accuracy, visual quality and enterprise integration capabilities.
OpenAI is reportedly developing a new voice model as it prepares for an AI hardware launch, signalling a stronger push into multimodal and voice-based AI systems.
Aionos expands multimodal AI and agent based solutions as its CTO forecasts rapid adoption across healthcare and enterprise sectors.
Healthify launches Ria Voice, a real time multimodal AI health coach built with OpenAI to offer personalised voice based wellness guidance.
Google unveils Gemini 3, stating major improvements in reasoning, math, coding and multimodal tasks, positioning it ahead of leading AI models in global benchmark tests.
Multimodal AI is reshaping customer engagement by combining text, voice, image, and video insights to deliver real-time, personalized, and emotionally intelligent brand interactions across industries.
Agora partners with OpenAI to integrate Realtime API, enabling multimodal AI agents that deliver seamless real-time voice, video, and text interactions for enterprises.