The week in AI: ChatGPT news and upgrades

Plus: The Writer's Guild strike is officially over

Welcome to The Dispatch! We are the newsletter that keeps you informed about AI. Each Thursday, we aggregate the major developments in artificial intelligence; we pass along the news, useful resources, tools or services, and exciting projects in open source. Even if you aren’t an engineer, we’ll keep you in touch with what’s going on in AI.

ChatGPT is going multimodal: OpenAI has announced that the model can now ‘see, hear and speak'. New image and voice capabilities will let you show GPT what you’re talking about, or have a spoken conversation with it. OpenAI also announced that web-browsing capability is back in ChatGPT through Bing, complete with direct links to sources.

The web-browsing update is live now; the voice and image capabilities will be rolled out over the next two weeks. Voice is coming on iOS and Android (opt-in in your settings), and images will ultimately be available on all platforms including the free version of ChatGPT.

In our social media/videos section below (ChatGPT has been supercharged!), Two Minute Papers covers the upcoming multimodality. We share their enthusiasm - pretty incredible stuff from OpenAI.

Google Bard users' private conversations with the chatbot are being indexed by Google Search, potentially exposing sensitive information. SEO consultant Gagan Ghotra discovered links to Bard chats showing up in search results. Bard provides an option to share conversations (colleagues, friends), but most users likely aren’t aware this could make their chats suddenly become public.

After getting called out on social media, Google acknowledged this was ‘unintended’ and that they’re working to prevent Bard conversations from appearing in search results without users' consent. These repeated privacy issues reflect poorly on Google, who is currently in the midst of an anti-trust case.

Amazon is investing up to $4B into AI startup Anthropic as part of a broad collaboration to “develop the most reliable and high-performing foundation models in the industry”. AWS users will find Anthropic’s Claude LLM more deeply embedded through Amazon Bedrock, to include model customization and fine tuning.

This looks like a big win for both sides. Claude is building an increasingly impressive portfolio of enterprise users across industries, including LexisNexis, SourceGraph and thousands of other companies. If you haven’t given Claude a shot yet, you should - it’s excellent. Anthropic recently posted a helpful ‘prompt-engineering’ guide to help users make the most out of Claude’s impressive 100k token long context window (great for analyzing long documents/research - even books).

Spotify is launching a new pilot program that uses AI to provide translations of podcasts into different languages - all while retaining the original speaker's voice. ‘Voice Translations’ will leverage OpenAI's new voice generation capabilities to mimic the speaking style and vocal characteristics of podcast hosts as their shows are translated. Several top podcasters are participating in the initial trial, and voice-translated episodes from pilot creators will be available worldwide to both premium and free users. We listened to a Lex Fridman episode in Spanish - the quality of the translation and the voices (even down to the intonation) aren’t perfect, but still very compelling.

Meta’s Connect 2023 is a two-day virtual event this week focused on the tech giant’s AI and virtual, mixed and augmented reality developments. The Quest 3 mixed-reality headset has reported already sold over 20 million units. Early reports on it are mostly positive, and it’s reasonable to assume a large chunk of early adopters in America will soon be experiencing a new version of the metaverse.

More AI this week:

Trending AI Tools & Services:

  • Copilot by CommandBar: an embedded AI assistant that contextually teaches users and can complete actions on their behalf

  • GPT Slides Maker: converts text descriptions, summaries of videos/PDFs/web pages into visually appealing slides in seconds

  • Podwise: AI-powered mindmapping app for podcast listeners

  • MyAsk AI: Create your own ChatGPT, add your content, launch it anywhere - updated with Slack AI assistant connected to your Google Drive/Notion

  • Antimetal: Save 75% on your AWS bill in 2 minutes

  • Backyard Design AI: your dream backyard, landscaped in under an hour

  • (For devs) Digma: see what your code is doing wrong - as you code - in the IDE

Guides/useful/lists:

Social media/videos/podcasts:

  • The impact of generative AI in healthcare - with athenahealth’s Senior AI Architect [Podcast]

  • ChatGPT has been supercharged! [YouTube]

  • Cheat at life by automating your tasks with AI [YouTube]

  • 21 AI illusions and how to make them [X]

  • AI is making the future of the internet look grim [X]

  • (Discussion) An Nvidia AI scientist breaks down Tesla’s Optimus robot [Reddit]

  • (Discussion) ChatGPT can now code from a whiteboard drawing. Wow. [Reddit]

Open source/technical:

That’s it for the week! See you next Thursday.