OpenAI unveils DALL-E 3 with support for text and typography

Plus: Elon Musk's attempt to hide the deaths of Neuralink's primate test subjects

Welcome to The Dispatch! We are the newsletter that keeps you informed about AI. Each weekday, we scour the web to aggregate the many stories related to artificial intelligence; we pass along the news, useful resources, tools or services, technical analysis and exciting developments in open source. Even if you aren’t an engineer, we’ll keep you in touch with what’s going on under the hood in AI.

Good morning. Today in AI:

  • WIRED exposes Elon Musk’s attempt at hiding the deaths of Neuralink’s primate test subjects

  • Meanwhile, Neuralink is now officially looking for human patients!

  • OpenAI announces improved DALL-E 3 text-to-image service coming next month

  • Amazon Alexa will be powered by its own large language model

  • Add ‘Game of Thrones’ author George R.R. Martin (and others) to the growing list of authors suing OpenAI

  • Google’s search engine is prioritizing an AI-generated selfie over a famous historical Tiananmen Square photo in search

  • A Reddit thread discussing China’s AI ambitions, trending tools & more

Image: OpenAI/DALL-E 3

The story: OpenAI has unveiled the third version of their AI image generation model called DALL-E, which can generate high-quality images from text prompts. DALL-E 3 looks like a huge improvement over the lackluster (compared to other state of the art offerings) DALL-E 2 model, with enhanced abilities to render readable text and understand spatial relationships described in prompts.

More details:

  • DALL-E 3 can now generate images with readable text baked directly into them. This allows for more natural integration of text instead of overlaying it, including font control.

  • The model has a better grasp of spatial relationships described in prompts, allowing prompts to be rendered more accurately in the generated image. A prompt describing separate objects in relation to each other can be translated visually.

  • OpenAI says the model understands context much better than DALL-E 2, and that they have taken steps to limit harmful or biased content present in the older models.

Takeaways: DALL-E 2 came out nearly a year and a half ago, so these improvements aren’t totally unexpected. But this offering might put OpenAI ahead of new entrants like Ideogram and continuing innovators like Midjourney. OpenAI plans to roll out DALL-E 3 to premium ChatGPT users in October.

A new poll commissioned by the AI Policy Institute reveals that 63% of Americans surveyed want regulation to actively prevent the development of AI that is smarter than humans. Vox argues that the development of society-altering technologies like AGI should involve more democratic input and oversight from the populations who will be impacted.

“Building AGI is a deeply political move. Why aren’t we treating it that way?”

Amazon has announced an upgrade to its Alexa voice assistant that incorporates AI to make conversations more natural and human-like. The new Alexa LLM will be available as a free preview to US users soon. It will offer real-time information, reduced latency, and the ability to perform actions like sending messages or making recommendations, not just answering questions.

Amazon will also allow developers to integrate their own custom LLMs with Alexa. Some of the new capabilities enabled by Alexa's upgrades include more expressive and contextual text-to-speech, improved automatic speech recognition using a massive transformer model, and visual ID so Alexa recognizes users just by looking at them on Echo Show devices for personalized responses.

Obligatory note: Amazon shares your personal data with social media and marketing companies. Amazon employs hundreds of people to read and annotate the transcripts of voice recordings to ‘improve the Alexa service’, but claims Alexa does not record unless it is activated.

From our sponsors:

Unlock Top Investments, Built For You

Join over half a million investors, who like you, are on their investment journey with us. A library of expert stock recommendations are only a few clicks away. Join today and start potentially multiplying your net worth.

-

We’ve been covering retrieval augmented generation quite a bit lately: RAG is a technique that uses large language models to generate responses augmented by retrieving and incorporating relevant information from a search backend. But the common approach of simply embedding a user's query and searching over vector databases is limited.

"Query understanding" - using language models to parse the user's input - can help plan a more sophisticated search strategy, and dispatch queries across multiple backends like databases, search engines, email, calendars, etc. Rather than just returning text search results, these systems can reformat outputs for each backend and combine results into a unified response - a powerful paradigm for building language model APIs that can interoperate cleanly with existing infrastructure.

Trending AI Tools & Services:

  • Spacebar: speaking is more natural than writing - turn your conversations into tangible insights and solutions

  • Dubbah: clone your voice and translate your videos to 28 different languages

  • Helpkit AI: turns your Notion knowledge base into a smart, 24/7 AI assistant that provides precise and instant answers to your users

  • Rise: a beautiful AI-powered calendar that uses hundreds of signals to find the best times to meet, resolve conflicts and blocks off focus time

  • Superhuman: AI-powered email built for high-performing teams

  • Risk Assessment AI: automates the completion of security questionnaires received from customers and prospects

Guides/useful/lists:

Social media/video/podcast:

  • [Beyond GPU] AI hardware for computer vision - with Adam Burns of Intel [Podcast]

  • (Discussion) China aims to replicate human brain in bid to dominate global AI [Reddit]

  • (Elon Musk) The first human patient will soon receive a Neuralink device. This ultimately has the potential to restore full body movement. [X]

  • This latest DALL-E model is absolutely incredible, I have been blown away by what it is able to generate. [X]

  • How to install Stable Diffusion XL while you wait for DALL-E 3 [YouTube]

  • Funky AI-generated spiraling medieval village captivates social media [Ars Technica]

Did you know? 

Uber is rolling out new AI-powered features and accepting additional payment options on its Uber Eats food delivery platform (including federal healthcare program waivers and SNAP benefits), in an effort to target low-income households. The company plans to launch a Google-powered AI assistant to help users find deals and explore food options, as well as a new "Sales Aisle" section to showcase promotional offers. The moves come as food delivery platforms increasingly invest in AI and attempt to offer more curated, convenient service.

Imagine if Stephen Hawking had had this.

Elon Musk, Founder of controversial brain implant company Neuralink, September 2023