The AI Download is your weekly guide to navigating the rapidly evolving world of AI and digital technology. Written by Jim Christian, a digital strategy consultant and former tech educator, this newsletter cuts through the noise to deliver practical insights and actionable strategies. Each week, you’ll get behind-the-scenes access to real-world experiments with cutting-edge AI tools, automation strategies, and emerging technologies.
Share
AI Voice Cloning and Video Generation Are Revolutionizing Content Creation
Published about 1 month ago • 5 min read
The Download #014
February 21st, 2025
Dear Reader,
I spent the majority of time this and last week keeping my poorly kids entertained at home while trying to look for contract work and create an online course, with video. It became clear very quickly that I wasn't going to be able to do everything, but I had nobody else on hand to help me hit my targets. That is, unless you count...me.
On one of those days I had an unexpected two-hour gap where there was nobody in the house - that's all the time it took for me to create a new AI clone of myself to help get some of my heavy lifting done.
As we move deeper into 2025, AI continues to transform how we create and consume content. This week, I’m diving into the latest advancements in voice cloning and text-to-video technology that are making professional content creation more accessible than ever.
Let's get to it.
Text-to-Video: From Words to Visual Stories
Text-to-video technology has evolved dramatically in recent years, with several key players transforming how we create video content. Synthesia, which bills itself as “the world’s first AI video communications platform,” launched Synthesia 2.0 in December 2024, allowing users to transform texts, PowerPoints, PDFs, or URLs into professional videos.
While Synthesia offers impressive capabilities like multilingual translation across 120+ languages with automatic updates it’s important to note that it’s primarily designed for corporate-style videos rather than storytelling or cinematic visuals. The platform excels in training videos, internal communications, and customer support content.
Other significant players in this space include:
Runway: A powerful AI video creation platform with features like text-to-video, image-to-video, and advanced camera controls, although it's more creative in nature, akin to Sora or MidJourney.
HeyGen: Another leading competitor offering AI avatar technology and video generation capabilities. HeyGen is very popular among the AI influencer crowd as it allows users to easily chunk up videos in formats ready for social media.
For businesses and educators, these text-to-video platforms offer several advantages:
Elimination of expensive studio time and equipment
Access to pre-designed templates (over 300 in Synthesia’s case)
This was my second take with creating a Synthesia clone. The first time, I didn't take into account my background, lighting, how I was framed in the shot, or where my eyes were looking! This second time around, I'm much happier with the results. Synthesia does take about 24 hours to turn a clone around, so it's worth trying to get these things right the first time.
The future of this technology looks promising, with Synthesia planning to introduce interactive capabilities like clickable hotspots, embedded forms, quizzes, and personalized calls-to-action.
There are some limitations to the platform though, which I'm not completely sold on yet. I'm only limited to 10 minutes of video creation a month on the lowest plan. I was able to "shoot" 7 course intro videos and keep them all around a minute, but I'm not at the point where I'm generating full courses with it...yet.
However, potential users should evaluate which platform best suits their specific needs, as each has different strengths in terms of avatar realism, customization options, and output quality.
Voice Cloning: Finding Your Digital Voice
Voice cloning technology has also made remarkable strides, with the market projected to grow annually at a 26.1% CAGR from 2023 to 2030. Today’s tools can create incredibly realistic synthetic voices from just short audio samples.
Several platforms stand out in 2025, with ElevenLabs continuing to lead innovation in the space. The company just made headlines this week by expanding beyond voice generation with the launch of Scribe, its first standalone speech-to-text model. This new offering claims an impressive 97% accuracy rate for English, outperforming competitors like Google Gemini 2.0 Flash and OpenAI’s Whisper Large V3 in benchmark tests.
What makes Scribe particularly impressive is its support for over 99 languages, with 25 languages achieving “excellent accuracy” (less than 5% word error rate), including English, French, German, Hindi, Japanese, Polish, Spanish, and Ukrainian. The model includes advanced features like speaker diarization, word-level timestamps, automatic sound event tagging, and direct video transcript capabilities.
Other notable voice AI platforms include:
Resemble AI, focusing on security and deepfake detection alongside professional voice cloning
Play.ht, providing high-fidelity cloning with extensive voice control settings
The most exciting developments include multilingual capabilities that maintain the speaker’s unique vocal characteristics across languages, and emotional expressiveness that conveys subtle nuances from joy to empathy.
I've been using ElevenLabs for more than a year, and run my podcast on it, using a mix of the system voices that are available, as well as a re-recorded clone of my own voice that I "remastered" this week.
I haven't found anything better than ElevenLabs yet - for my money it's the one to watch.
🛠️ Tool of the Week: Obsidian – A Powerful Free Note-Taking App
If you’re looking for a flexible, offline-first note-taking tool that keeps your thoughts organised without locking you into a specific ecosystem, Obsidian is worth checking out.
Unlike other note-taking apps, Obsidian is built on plain-text Markdown files, meaning your notes are fully in your control—no forced cloud storage, no proprietary formats. But the real magic happens when you start linking your notes together.
I've yet to find one app that "does everything" in terms of dashboards and second brains, but Obsidian is pretty damn close (sorry, Notion 😢), primarily because I can access it all offline and my data's not going to someone else's cloud unless I expressly want it to.
You can even use third-party or local AI to query your notes.
Key features:
✅ Bi-directional linking – Connect ideas and build your own knowledge web ✅ Offline-first – No internet? No problem ✅ Cross platform - sync across your computers, tablets and phones, wherever you are ✅ Markdown support – Keep your notes lightweight and future-proof ✅ Customisable with plugins – Supercharge it with themes, templates, and automation - including AI integration ✅ Free for personal use – No subscriptions needed
🚀 Best Use Case? Obsidian is perfect for knowledge workers, researchers, solopreneurs, and creatives who want a powerful, distraction-free way to organise their thoughts—whether it’s for brainstorming, writing, project planning, or even journaling.
💡 Pro Tip: Start small. Use daily notes + linking to create a simple second brain, then explore community plugins when you’re ready.
The AI Download is your weekly guide to navigating the rapidly evolving world of AI and digital technology. Written by Jim Christian, a digital strategy consultant and former tech educator, this newsletter cuts through the noise to deliver practical insights and actionable strategies. Each week, you’ll get behind-the-scenes access to real-world experiments with cutting-edge AI tools, automation strategies, and emerging technologies.
The AI Download #017 March 21st, 2025 Dear Reader, Have you ever wondered how we (as a species) determine if a machine is truly "intelligent"? Long before ChatGPT entered our daily lives, notable mathematician Alan Turing was already thinking about this question. His simple yet profound test has shaped how we evaluate AI for over 70 years--and as these systems grow more capable, the ways we measure them continue to evolve in fascinating ways. The Original Question: Can Machines Think? In...
The AI Download #016 March 14th, 2025 Greetings from rainy and wet Valencia, where I’ve been deep in research mode for my latest project. This has me thinking about how dramatically our research methods have evolved in just the past year. If you’ve been following along, you know I’ve been exploring alternatives to traditional search engines. Today, I want to share why this matters for your business and how early adoption of these tools can give you a significant competitive edge. Let’s get to...
The AI Download #015 March 7th, 2025 Dear Reader, Ever felt like you're getting stuck in circles with ChatGPT (or other chatbots, for that matter)? You're not alone. Despite their impressive capabilities, AI assistants lack something fundamental: human intuition. They don't naturally question their own methods or adapt their communication style to match your needs. This is why your prompting strategy matters more than you might think. Be Direct: Clarity Creates Better Results When I first...