OpenAI Gives ChatGPT a Voice and Adds Audio and Image Prompts
OpenAI has expanded the capabilities of ChatGPT, its AI chatbot, by introducing new multimodal features. ChatGPT now supports verbal conversations and can process images in addition to text. However, these features are currently limited to ChatGPT Plus and ChatGPT Enterprise subscribers. It is expected that they will be made available to free users and developers in the near future.
ChatGPT Speaks
The most significant update to ChatGPT is its new ability to understand speech and respond using synthesized human-like voices. Users can now engage in back-and-forth conversations with ChatGPT using voice commands. OpenAI’s speech recognition system, Whisper, transcribes spoken words. Additionally, ChatGPT offers five different voices synthesized from professional actors.
As part of this update, OpenAI announced a partnership with Spotify to introduce a podcast translation feature. Podcast hosts will be able to create their own synthetic voice models to perform translated transcripts of their shows. This allows podcasters to expand their audience and maintain the authenticity of their content.
ChatGPT Sees
OpenAI has also equipped ChatGPT with the ability to understand images. Users can show the chatbot an image, and it will provide relevant responses based on the visual input. This feature opens up possibilities for image-based question-answering and visual storytelling. Users can take a picture or select an image, and they can also use the drawing tool on the ChatGPT mobile app to highlight specific parts of the image for the AI to focus on.
OpenAI has taken precautions to ensure responsible usage of the image understanding feature. The models used for image analysis have been tested for risk and have limitations in place to protect individuals’ privacy and prevent misuse.
With these new voice and image features, OpenAI aims to enhance the conversational and analytical capabilities of ChatGPT, making it a more versatile and interactive AI assistant.
Read more: OpenAI Gives ChatGPT a Voice and Adds Audio and Image Prompts