Planning, scripting, recording narration, designing subtitles, editing. A single short typically takes 2-3 hours, right? Now you just type a topic in the terminal and you're done. AI handles everything from script to final video.

TL;DR
Enter topic AI generates script Free TTS narration Auto video synthesis Finished short

Gemini free tier + Edge-TTS (free) + FFmpeg (free). Completely free.


What Is It?

The short-form video market is booming. Valued at roughly $34.8 billion in 2024, it's projected to grow at over 30% CAGR to reach $289.5 billion by 2032. Over 90% of marketers report positive ROI from short-form video ads.

But here's the problem — consistently creating short-form videos is genuinely exhausting. You need to post 3-5 times a week to ride the algorithm, and planning, filming, and editing every single one is nearly impossible for individuals or small teams.

Open-source AI shorts generators are popping up to solve exactly this. The core technology boils down to three things:

1
LLM Script Generation

Models like Gemini, GPT-4, and DeepSeek automatically write video scripts from just a topic.

2
Free TTS Narration

Microsoft Edge-TTS provides 300+ high-quality voices without an API key. Gemini 2.5's native TTS can even handle emotional expression.

3
Automated Video Synthesis

FFmpeg's Ken Burns effect (that slow zoom-in/zoom-out you've seen) turns static images into dynamic video.

In February 2026, @build_daemon shared an "AI Shorts Auto Generator" on Threads that went viral with 328 likes. Similar tools already exist on GitHub: MoneyPrinterTurbo (49,500 Stars), ShortGPT (7,100 Stars), and the MCP-integrated Short Video Maker.


What's Different?

Sure, there are paid services like Runway, Pika, and HeyGen. But the open-source tools we're looking at are a different breed entirely.

Paid SaaS (Runway, Pika, etc.) Open-Source Generators
Cost $8–95/month (by plan) Free (only minor API costs)
Control Limited to platform templates Full code-level customization
Video Style AI-generated video (live-action/animation) Slides + narration + subtitles (info-driven)
Mass Production Credit limits apply Unlimited (runs locally)
Best For Ads, music videos, visual effects Educational, news, summary content
Technical Difficulty A few clicks in a browser Requires Python & terminal basics

In short: Runway and Pika are great for making "polished-looking videos", while open-source tools excel at producing "consistent daily content".

If you're a channel operator who needs to post shorts daily, a marketer repurposing blog content into video, or a creator mass-producing news summaries — open-source tools are the clear winner.

Let's compare the major tools. For beginners, we recommend MoneyPrinterTurbo.

Project Stars LLM Support TTS Key Features
MoneyPrinterTurbo 49.5k GPT, Gemini, DeepSeek, Qwen + 12 more Edge-TTS, Azure Web UI, batch generation, largest community
ShortGPT 7.1k OpenAI ElevenLabs, Edge-TTS 30 languages, built-in translation engine
Short Video Maker 965 MCP integration (any LLM) Kokoro TTS MCP/REST API, Docker deploy, video in 30 seconds
@build_daemon New Gemini Free TTS Ken Burns effect, one-click automation, cross-platform

Quick Start Guide

We'll walk through MoneyPrinterTurbo since it has the largest community. The flow is similar for other tools.

1
Prerequisites

Install Python 3.10+, FFmpeg, and ImageMagick. On Mac: brew install ffmpeg imagemagick. On Windows, download from their official sites.

2
Clone & Run
git clone https://github.com/harry0703/MoneyPrinterTurbo.git
cd MoneyPrinterTurbo
pip install -r requirements.txt
python webui.py

The web UI opens in your browser.

3
Set Up API Key

Get a free Gemini API key from Google AI Studio → enter it in the web UI settings. Select Edge-TTS for completely free narration.

4
Enter Topic & Generate

Be specific, like "Explain Bitcoin halving in 30 seconds." Set the aspect ratio to 9:16 for direct upload to Shorts/Reels.

If you prefer the CLI, check out @build_daemon's project. The cinematic Ken Burns effect is its standout feature.


Want to Go Deeper?

Core Tools

MoneyPrinterTurbo — GitHub Repository The undisputed leader with 49,500 Stars. Has a web UI so you can get started without touching code, and supports 12+ LLMs with Edge-TTS. https://github.com/harry0703/MoneyPrinterTurbo
ShortGPT — GitHub Repository The multilingual powerhouse supporting 30+ languages. Includes auto-subtitles, auto source video collection, and a built-in translation engine. https://github.com/RayVentura/ShortGPT
Short Video Maker — GitHub Repository A next-gen tool with MCP protocol support. Docker deployment, 30-second video in one minute. https://github.com/gyoridavid/short-video-maker

Technical References

Gemini API — TTS Documentation Guide to Gemini 2.5's native TTS capabilities. Supports 24 languages, emotional expression, and multi-speaker output. https://ai.google.dev/gemini-api/docs/speech-generation
Edge-TTS — GitHub Repository A Python package offering 300+ high-quality voices for free, no API key required. https://github.com/rany2/edge-tts
Bannerbear — FFmpeg Ken Burns Effect Guide Tutorial on implementing the Ken Burns effect using FFmpeg's zoompan filter. https://www.bannerbear.com/blog/how-to-do-a-ken-burns-style-effect-with-ffmpeg/
Pixazo — Top 10 Open-Source AI Video Generation Models in 2026 Comparison of the latest models including HunyuanVideo, CogVideoX, and SkyReels. https://www.pixazo.ai/blog/best-open-source-ai-video-generation-models