← Home

AI Video Editor

Private

An LLM-autonomous editor that turns whole TV seasons into short-form compilation edits.

What it is

A self-hosted system that ingests full TV series, or movies, indexes every scene with rich metadata, and lets an autonomous LLM agent assemble polished (debatable) 15–30 second TikTok/Reels-style edits from a plain-English instruction. The agent searches the indexed clip library, builds a two-track timeline, applies effects and overlays, and renders the result. It is built for a single operator running their own media library and GPU hardware.

How it works

Why it's interesting

The interesting part is closing the perception loop: the editing agent doesn't just emit edit commands blind — it renders preview frames and feeds them back to a vision model, so it can judge cuts and transitions the way a human reviewing the timeline would. Character recognition runs entirely on local ONNX models with deliberately conservative auto-matching, keeping ambiguous identities provisional rather than guessing.

Status

Private hobby project, work in progress — runs end-to-end via Docker Compose.