scroll to top arrow or icon

{{ subpage.title }}

Midjourney

Too scruffy for Zoom? Send in the AI

Have you ever had to get in front of a camera, but you really, really didn’t want to? Maybe you were too tired, too lazy, too disheveled to film something that day. What if a proxy could handle that for you? Well, now that’s possible.

Using Synthesia, an AI-powered video tool, I created a virtual avatar of myself. It’s essentially a digital puppet constructed from my skin, with invisible strings that carefully lift my eyelids and my eyebrows, open and close my mouth to align with the words I want it to say. My ventriloquism is commanded by a text prompt – a string of words I have written for this virtual Scott to say aloud.

Synthesia is a British startup founded in 2017 by a global cohort of researchers from Stanford, University College London, Technical University of Munich, and Cambridge who have raised $156 million in venture capital. It’s a pricey tool — starting at $22 a month, with a $67 a month tier getting you more features and hours of video, and custom pricing for enterprise use — but the kind people at Synthesia allowed me to test it out for free.

Alright, my avatar will take it from here:

Synthesia

There’s a common term in science fiction and tech criticism called the “uncanny valley,” a phenomenon that occurs when humans see something that seems nearly human. It evokes an eerie feeling, one I felt watching the fake version of myself speak on screen.

Everything with Synthesia seems nearly right. My voice sounds nearly right, and my face nearly moves like it should when mouthing the words I wrote. But it’s not quite there yet, and that disparity could mean the difference between success and failure. Having an avatar you can effectively deploy for a sales presentation is great — but one that simply creeps out your clients is a waste. (The company also offers hundreds of premade avatars you can use if you don’t want to appear, in any form, “on camera.”)

But this is the simple, at-home version. It takes 10 minutes to film — I followed a script and recorded it at my kitchen table — and Synthesia had it ready for me a day later. Once you record a video using your avatar, it generates in mere minutes.

There’s a studio version too that costs $1,000 per year on top of a subscription. You can go to one of the company’s partner studios in Europe or North America and get an improved expressive avatar with a transparent background that you can drop into any presentation. It uses AI to read your text prompt and match the emotion it thinks you want to convey to your avatar’s face and voice.

On a Zoom call, Alexandru Voica, Synthesia’s head of corporate affairs and policy, walked me through the product’s many features and showed me a preview of where the technology is going. He said the company is almost exclusively focused on enterprise solutions for businesses, intending for the technology to be used for training videos, sales pitches, and marketing material. That said, he’s seen some consumer uses too including a social media account that used the avatars to make history-focused videos.

To prevent deception and misinformation, Synthesia has strict content standards. It doesn’t allow profanity, hate speech, or misinformation. “We’re not a marketplace of ideas. We don’t pretend to be a social media company. We’re pretty much an enterprise-focused video solution platform, therefore we don’t need to necessarily have these philosophical debates about harmful content and what’s misinformation and what’s not misinformation. We’ve set very robust rules in place,” Voica said. It doesn’t even allow you to record news content unless you’re a news organization with an enterprise subscription. And it checks that every avatar created is filmed by the person it claims to be to prevent nonconsensual deepfakes. That way, the content moderation happens at the point of creation, rather than trying to stop its distribution.

Synthesia, Voica maintains, is for work rather than personal use. That’s a different tone than many generative AI companies trying to prove their worth to consumers. Later this year, Voica said, Synthesia is releasing a choose-your-own-adventure platform for video creation that allows viewers to personalize the content they receive.

But crossing that uncanny valley — for the at-home avatars, at least — will be key for the company’s success. Readers of this newsletter will recall a few months ago when I tested out the ElevenLabs voice cloning technology, which I gave high marks.

Synthesia performs nearly as well for audio — it’s slightly more robotic and unnatural, but still very good. But, the person you see on the screen needs to seem either fully human or fully AI — and, while the technology may improve, nearly human might not be good enough.

A Pinocchio puppet.

Tell me lies, tell me sweet little AIs

Generative AI models have been known to hallucinate, or make things up and state them as facts (in other words, lie). But new research suggests that despite that shortcoming, AI could be a key tool for determining whether someone – a human – is telling the truth.

Read moreShow less
IMAGO/Wolfgang Maria Weber via Reuters Connect

What is “human-washing”?

You’ve heard of greenwashing, pinkwashing, and sportswashing. But what about human-washing? That’s a newfangled term reserved for those scenarios when artificial intelligence pretends to be, well, human. AI researcher Emily Dardaman used the term in an interview with Wired after seeing a startup claim “We’re not AIs” while using a deepfake version of its CEO in an ad.

Read moreShow less
three girls in graduation gowns hold their caps in the air
Photo by Leon Wu on Unsplash

Hard Numbers: Professor ChatGPT, SoftBank’s search engine play, Nokia goes shopping, Voice actors are worried

10: Generative AI is sweeping academic research. According to one estimate, about 10% of all academic articles published this year will contain some artificial intelligence-generated text. That’s about 150,000 papers per year.

Read moreShow less
Midjourney

Is Claude funny now?

Meet Claude – Claude 3.5 Sonnet, that is. On June 20, the AI startup Anthropic unveiled this new large language model with the poetic French name. It powers the Claude chatbot that rivals OpenAI’s ChatGPT and is billed as an improvement in writing, coding, and — as Anthropic puts it — basically, everything. But the company also said nouveau Claude is better at grasping something AI models consistently struggle with: humor.

“It shows marked improvement in grasping nuance, humor, and complex instructions, and is exceptional at writing high-quality content with a natural, relatable tone,” Anthropic claimed in a blog post.

Read moreShow less
Midjourney

Welcome to your AI video fever dream

Generative AI lets people craft sprawling essays, create detailed images, and even clone their own voice with remarkable precision. But taking an AI-generated video service for a spin made me realize that the technology is still far from creating convincing or cinematic video. In fact, the entire experience was surreal.

Luma AI’s Dream Machine, a free text-to-video service, warns users that they’re limited to 10 videos per day, and 30 videos per month, due to high demand — unless they pay at least $29.99 a month for the starting subscription tier. But I only needed to wait a couple of minutes to get my first prompts turned into … very, very strange videos.

Read moreShow less
Will AI further divide us or help build meaningful connections?
Will AI further divide us or help build more connections? | GZERO AI

Will AI further divide us or help build meaningful connections?

In this episode of GZERO AI, Taylor Owen, professor at the Max Bell School of Public Policy at McGill University and director of its Centre for Media, Technology & Democracy, takes stock of the ongoing debate on whether artificial intelligence, like social media, will further drive loneliness—but at breakneck speed, or help foster meaningful relationships. Further, Owen offers insights into the latter, especially with tech companies like Replika recently demonstrating AI's potential to ease loneliness and even connect people with their lost loved ones.

Read moreShow less

A student of National University of Malaysia walks past displays of the country's "Stripes of Glory" flags at its campus in Bangi outside Kuala Lumpur August 22, 2007.

Bazuki Muhammad via Reuters

Hard Numbers: Microsoft takes Malaysia, Massive (and unknown) startup, Safety first, Don’t automate my news

2.2 billion: Microsoft has its eye on Southeast Asia. The computing giant announced it’ll pour $2.2 billion into Malaysia’s cloud infrastructure over the next four years and will establish a national AI center with the government. This investment is the latest in a string of Microsoft infusions in local economies to help develop AI: In the past month, the company announced a $2.9 billion investment in Japan, $1.7 billion in Indonesia, and a new data center in Thailand, plus a $1.5 billion stake in the UAE firm G42.

Read moreShow less

Subscribe to our free newsletter, GZERO Daily

Latest