July 2, 2024 Dictate your work to AI

Deeper Learning

Hey there!

Last week, our subject line, “talking to machines,” was about a new AI model for hyper-realistic voice-to-voice conversations. This week, I could use the same subject line, but today’s One Big Read features a product that’s much more applicable and useful to most of you. It’s a transcription tool that sees what your computer is seeing.

Let’s get into it! - Sarah Wright

ONE BIG READ

Dictate your work to your computer

Image: Kyutai

Context matters. AI has demonstrated this for us again and again, like in this photo of “salmon” swimming upstream. 😆

In the past couple of weeks, we’ve seen startups trying to push AI further by making sure that it is context-aware. In our Product Hunt Dev tools newsletter, I wrote about Pieces Copilot+ and its Live Context feature, which captures context across everything devs are working on in real-time to help devs work faster and across tools.

Now TalkTastic is trying to tackle this for the rest of us with a context-aware AI voice keyboard app. It works by integrating across all your macOS apps, from Slack to Messages to your browser, to not only transcribe your speech with accuracy but refine and rewrite what you say based on your screen’s context.

So imagine you’re going about your day-to-day. With Talktastic, you’d hit the little microphone button and start dictating your response back to an investor in an email. The tool will write your reply while fixing any vocal tics, adjust words for tone, and spell the names of your investors correctly. Then hop over to Slack and shoot your coworker a DM and TalkTastic adjusts for that audience and context. In other words, TalkTastic interprets what you're saying based on what it sees on your computer screen.

TalkTasticcombines the capabilities of Apple Dictation, on-device Whisper, ChatGPT, Claude, and Google Gemini into one powerful, easy-to-use package,” explains the website. Founder and CEO Matt Mireles says the team plans to publish a scientific research paper on its “breakthrough in speech recognition.” It’s not Mireles’s first rodeo. He previously founded a Google Ventures-backed startup working on video transcription powered by speech recognition, which was acquired in 2012. He then went on to work as an investor and founded OASIS AI, which you can think of as a proof of concept for TalkTastic.

Ready to bring voice features to life, but unsure where to start? Imagine launching your first Speech AI product with confidence. No more worrying about accuracy, complex integrations, or handling real-world speech variations.
Build with industry‑leading Speech-to-Text models. Our API offers:

  • High-accuracy transcribing and understanding accented speech, background noise, and natural conversations

  • Easy integration with models for speech recognition, speaker detection, summarization, and more

  • Low-latency processing for real-time applications

Start your Speech AI journey with 100 free transcription hours. Don't just build— innovate with AssemblyAI!

PRODUCT HIGHLIGHT

Test and debug your APIs faster with this tool

If you're a seasoned developer, you know the importance of testing your code, especially if you're gearing up to launch a product to potentially millions of people. After years of building, testing, and releasing products to massive audiences, founders Abhishek Saikia and Sourabh Gawande decided to collaborate and explore ways to make the testing process more intuitive. The result is KushoAI.

Kusho is an AI agent explicitly designed for API testing. It helps developers automatically find bugs in their APIs before they deploy them. Drop in an API spec, and Kusho will generate an exhaustive set of tests based on real-world scenarios to identify any issues that could break your code in production.

From there, you can run each test individually if you want to go through everything with a fine-tooth comb or bundle them up and run them simultaneously. Kusho will then use AI to generate detailed assertions for each scenario so you can test the accuracy and reliability of your APIs.

Like other developer-orientated AI apps, Kusho uses a variety of services entwined together to create something bigger than the sum of its parts. Groq is one of them. It is an alternative to OpenAI, which Kusho founders described as a "blazingly fast large language model for code generation."

MORE TOOLS

For productivity

  • cre[ai]tion lite lets you create AI images and objects using a visual workflow.

  • Jobright is an AI job search copilot for opportunities tailored to your goals.

  • TTSynth.com converts text into natural-sounding speech.

  • Hubflo lets you use AI to create a white-label client portal.

  • Vitamin AI helps automate marketing, sales, customer support, administrative, and recruiting tasks.

  • BrainyAI is a free browser sidebar plugin for AI chat, search, and web browsing.

For makers

  • MindPal lets you build customizable AI agents trained on your specific knowledge sources.

  • Released generates stunning release notes from Jira tickets.

AI IN THE HEADLINES

Thanks for going deeper with us!

Here via forward? Subscribe here.

Have feedback?

Did you enjoy today's newsletter?

Login or Subscribe to participate in polls.

Reply

or to participate.