May 21, 2024 The fastest voice model

Today’s newsletter was brought to you by Mercury.

Deeper Learning

Written by Aaron O’Leary and Sarah Wright.

Hey folks! It’s Tuesday, you know what that means — it’s time to go deeper.

In today’s edition, we’re talking about a new AI tool that will handle all of your meeting prep, a new super fast AI voice model from Stanford PhDs, and of course some of the best AI tools that have launched over the last week.

Let’s dive in!


This AI-powered tool will handle your meeting prep for you

TimeOS is on a mission to create a more autonomous productivity system. It was launched in 2022 as a Chrome extension that lets you easily take notes while in the middle of a meeting, and now the team is back with a big update. 

With the new TimeOS you can automatically sync and hover over your scheduled meetings. From there, you can pull in context from Google Calendar, LinkedIn, and previous meeting notes. It’s the same new tab interface you may have become accustomed to, but with an automated approach.

You can also chat to Prep, the company’s always-on AI-powered voice assistant. Prep is like Iron Man’s Jarvis, but for meetings. You can gain more context by asking, “Who introduced me to this person?” or “Give me a good icebreaker to open with.” You can even get it to handle your calendar by telling it to set up or reschedule calls. 

It will also send you proactive AI reminders about your calls, giving you updates on things like the proposed agenda, important discussion points, and information from previous calls, so you won't have to scramble through meeting notes. 

To top it all off, TimeOS will automatically take notes during your meeting. Once you’re finished, it will format them using community-built templates designed for maximum efficiency and send them to you, making it easier to review the most important points. 


🤖 Actual robots: If you were following OpenAI before the company launched ChatGPT, you might remember the company’s robotics division. They abandoned those dreams in 2020 for a number of reasons. Fast forward to present day and the AI company is hiring Robot Engineers once again.

🍇 Blowin’ raspberries: Rasberry Pi, the darling micro-computer of hackers, developers, and general builders everywhere is getting an AI upgrade. The company is partnering with Hailo to offer an AI add-on for $70.

Stanford PhDs release their faster, ultra-realistic generative voice model

Voice technology has lagged for the last decade. We started envisioning a futuristic universe when we got voice assistants like Siri, but haven’t got much further than speakers that can turn on a light. One company called Cartesia wants to take it that bit further with AI. 

Cartesia Sonic is an AI voice model – a state space model (SSM) that the founders invented while working as PhDs at the Stanford AI Lab. The team (which also boasts backgrounds from Google Brain and Snorkel AI) has spent years building the theory behind SSMs, now being used in academia and industry for vision, robotics, and biology. 

The SSM architecture enables Sonic to quickly process vast amounts of data, providing seamless and natural voice interactions. With Sonic, you can generate high-quality, lifelike speech with minimal latency. The AI boasts a response time of just 135ms, making it ideal for applications requiring real-time feedback, such as customer support, entertainment, and content creation.

One of the standout features is Sonic’s ability to customize voices on the fly. Users can adjust parameters like speed and emotion and instantly clone voices for different needs.

Sonic also excels in speech generation, achieving a 2x lower word error rate and a 1-point higher quality score than traditional models. This ensures that Sonic generates speech effectively and understands it accurately.

Cartesia larger agenda is to expand well beyond voice — aiming for its models to instantly understand and generate content in any modality across any device


For Work

  • Bolty turns videos into short-form content in a single click.

  • BuglabAI is like spellcheck but for correcting UI/UX mistakes.

For Life

  • Eve is an AI-powered coach designed to relieve work-related stress.

  • Oh, a Potato is an AI-powered recipe app designed to minimize food waste. 

  • Fryderyk is an AI-powered companion to help with making music.

For Developers

  • Standuply launched a tool that uses AI for time-tracking in Jira.

  • MarsCode is a GPT-4 powered cloud-based IDE.

Thanks for going deeper with us!

Here via forward? Subscribe here.

Have feedback?

Did you enjoy today's newsletter?

Login or Subscribe to participate in polls.

Join the conversation

or to participate.