Home Artists Posts Import Register

Content

Hey! I've got a new tool for the toolbox. Versions of this have been on the discord, but I think it is ready for more to try ! Here is a video going over setting up, installing, and all the parameters for LiveTranscribe. Enjoy!

  • Real-time (as close as I could get) speech-to-text transcription + fast local file transcription

  • Local fast whisper transcription or assemblyai realtime transcription api

  • Windows + Mac - one click install [ see video or readme for prequisites ]

  • Callback system for custom event handling (onTranscriptFinal, onTranscriptPartial, onSessionStart, onSessionEnd, onServerReady)

Grab that TOX here !

Files

LiveTranscribe for TouchDesigner [ new TOX + setup ]

To access the operator, visit https://www.patreon.com/dotsimulate Links from the video + info: Mac: How to install Homebrew on Mac - https://docs.brew.sh/Installation / video https://www.youtube.com/watch?v=IWJKRmFLn-g Once installed, run this in terminal:brew install python@3.11 Windows: Cuda 11.8 or cuda 12.1 installed. CUDNN Archive - https://developer.nvidia.com/rdp/cudnn-archive ( More info on CUDNN - https://docs.nvidia.com/deeplearning/cudnn/latest/installation/windows.html ) Python 3.11.9 (windows) - https://www.python.org/downloads/release/python-3119/ Git for windows - https://git-scm.com/download/win AssemblyAI - https://www.assemblyai.com/ OpenAI's Whisper original release - https://openai.com/index/whisper/ Real time whisper: https://github.com/collabora/WhisperLive https://github.com/SYSTRAN/faster-whisper 00:00 Introduction 00:33 Installation prerequisites 02:50 Setup process 04:43 Using local Whisper model 07:40 Assemblyai [ non-local ] setup 08:36 Getting Assemblyai API Key 09:29 Main Settings 10:31 Additional features and settings 12:00 Callbacks brief look 12:25 Troubleshooting info 12:51 Outro

Comments

Andy Harper

Hey Lyell - can't seem to DL this TOX as a long time member

Muhamad Ridho

Can this be combined with your StreamDiffusionTD?

Moritz Maas

hello, looks great! is it not included in my subscription?

Andy

Somehow the grab tox link also didn't work for me as a subscriber, just paid for that one anyway to support! Thx for all you do for the community.

Manuel Rodrigues

Hi! Just bought your TOX but the setup is not running on my TD. I checked my Python version and CUDA version as well and everything is up to date. Can you help me, please?

Gus

Hi - Im a subscriber too but cant download ?

Lyell Hintz

This is for 10$ tier which currently gives access to all of the things I've released. 5$ tier is for StableDiffusion stuff only.

Lyell Hintz

This is for 10$ tier which currently gives access to all of the things I've released. 5$ tier is for StableDiffusion stuff only.

Lyell Hintz

This is for 10$ tier which currently gives access to all of the things I've released. 5$ tier is for StableDiffusion stuff only.

Dromp Alex

From what i know and tested, whisper can take any language without specifying the language and it will transcribe it, i've tested it with faster-whisper large-v3 model and did a great job in french , english , german and romanian, sending the transcription as an prompt to an LLM like mistral or llama in dual/triple language is no issue also. Is there an update for this on the horizon, not being limited by a language select menu before running the whisper model?

상우 백

Does this work with Python 3.10?

Lyell Hintz

This is tested on 3.11.9, however, it doesn't actually import the venv into TD , so it could work with 3.10.9 as far as I know. I will be posting an update to LiveTranscribe within the next week or two. Working on bringing that into my LOPs family.

Nyan Linn Aung

hello ive downloaded the tox and followed the steps from the video but it's not working. I'm using python 3.10 with cuda 11.8. thank you.