Pocket TTS is an open-source text-to-speech model that runs on CPUs, clones voices from 5 seconds of audio, and keeps voice ...
LTX-2 is an open source AI video model with 14B video and 15B audio parameters, giving you synced clips and local control.