LiveTranscribe 🎙️📝 [ new TOX + setup video ] (Patreon)

Published:

2024-09-22 19:22:04

Imported:

2024-11

Content

Hey! I've got a new tool for the toolbox. Versions of this have been on the discord, but I think it is ready for more to try ! Here is a video going over setting up, installing, and all the parameters for LiveTranscribe. Enjoy!

Real-time (as close as I could get) speech-to-text transcription + fast local file transcription
Local fast whisper transcription or assemblyai realtime transcription api
Windows + Mac - one click install [ see video or readme for prequisites ]
Callback system for custom event handling (onTranscriptFinal, onTranscriptPartial, onSessionStart, onSessionEnd, onServerReady)

Grab that TOX here !

Files

LiveTranscribe for TouchDesigner [ new TOX + setup ]

To access the operator, visit https://www.patreon.com/dotsimulate Links from the video + info: Mac: How to install Homebrew on Mac - https://docs.brew.sh/Installation / video https://www.youtube.com/watch?v=IWJKRmFLn-g Once installed, run this in terminal:brew install python@3.11 Windows: Cuda 11.8 or cuda 12.1 installed. CUDNN Archive - https://developer.nvidia.com/rdp/cudnn-archive ( More info on CUDNN - https://docs.nvidia.com/deeplearning/cudnn/latest/installation/windows.html ) Python 3.11.9 (windows) - https://www.python.org/downloads/release/python-3119/ Git for windows - https://git-scm.com/download/win AssemblyAI - https://www.assemblyai.com/ OpenAI's Whisper original release - https://openai.com/index/whisper/ Real time whisper: https://github.com/collabora/WhisperLive https://github.com/SYSTRAN/faster-whisper 00:00 Introduction 00:33 Installation prerequisites 02:50 Setup process 04:43 Using local Whisper model 07:40 Assemblyai [ non-local ] setup 08:36 Getting Assemblyai API Key 09:29 Main Settings 10:31 Additional features and settings 12:00 Callbacks brief look 12:25 Troubleshooting info 12:51 Outro