How to run Universal (Speech) Translator on Colab - SeamlessM4T with Web UI

  Рет қаралды 10,682

1littlecoder

1littlecoder

11 ай бұрын

This is a tutorial on how to run Meta AI's Seamless M4T model on Google Colab.
This also goes in-depth in testing the tasks of Facebook's new Universal translator.
Seamless M4T - colab.research.google.com/git...
Camenduru's Github - github.com/camenduru/seamless...
Meta AI Announcement - ai.meta.com/resources/models-...
❤️ If you want to support the channel ❤️
Support here:
Patreon - / 1littlecoder
Ko-Fi - ko-fi.com/1littlecoder

Пікірлер: 63
@GAllium14
@GAllium14 11 ай бұрын
A tamilan in this growing AI community is amaying !!!! ❤ from tamil nadu
@1littlecoder
@1littlecoder 11 ай бұрын
Nandrigal pala 🙏🏽
@venkateshkumar104
@venkateshkumar104 11 ай бұрын
@1littlecoder valthugal bro
@jverart2106
@jverart2106 11 ай бұрын
Thank you! I hope you can bring us more info on the uses of seamlessm4t, its applications and so on. This will probably evolve into something extremely useful. Translators and interpretes will have to rethink their careers
@mahmoudmuhiz1073
@mahmoudmuhiz1073 11 ай бұрын
Thank you, as someone not that literate in Code. This was helpful
@rajivraghu9857
@rajivraghu9857 11 ай бұрын
1little coder lives in Namma Bengaluru ❤❤
@1littlecoder
@1littlecoder 11 ай бұрын
😁 yep yep
@QEDAGI
@QEDAGI 11 ай бұрын
Was just thinking about this. Thanks.
@1littlecoder
@1littlecoder 11 ай бұрын
You're welcome
@ilianos
@ilianos 11 ай бұрын
I just wrote the comment on the other video, hoping you would cover this. THAT was fast! 😛
@henkhbit5748
@henkhbit5748 11 ай бұрын
Great video, it would be nice to have a end to end video using gradio and chat using your voice submitting to the llm and translated back the response to voice..
@cloudsystem3740
@cloudsystem3740 11 ай бұрын
thank you very much! do you know the colab limitations for running time ?
@yaswanthfinds
@yaswanthfinds 11 ай бұрын
Hi, Bro I am from Andhra I love your videos
@1littlecoder
@1littlecoder 11 ай бұрын
Thanks for the support bro :)
@trendavira5128
@trendavira5128 11 ай бұрын
really great, for arabic, the spoken text differs from the transcript in some words, but remain the same meaning.
@user-xu8zy7ge1x
@user-xu8zy7ge1x 11 ай бұрын
as a arabic speaker i can confirm that was crazy ! so accurate too .. thx for the video
@robins.storey6038
@robins.storey6038 11 ай бұрын
really nice open source projet, can you test TTS with differents voices? can we use custom voice model ?
@bardaiart
@bardaiart 11 ай бұрын
One interesting thing I noticed with Arabic is that the audio translation was a little bit different from the text translation -- it wasn't much, just one different word. Though the quality of the translation is good :)
@mohamed1248
@mohamed1248 11 ай бұрын
Is it possible to train different voices for translating?
@KevinKreger
@KevinKreger 11 ай бұрын
Thanks ❤
@1littlecoder
@1littlecoder 11 ай бұрын
You're welcome 😊
@ajmalkhan8013
@ajmalkhan8013 11 ай бұрын
Can we use it to provide speech on youtube videos
@samuelm5766
@samuelm5766 11 ай бұрын
Hi bro I'm also from Tamilnadu ❤ 4:10
@1littlecoder
@1littlecoder 11 ай бұрын
Nice bro :)
@jurandfantom
@jurandfantom 11 ай бұрын
You can translate things to Polish language - I watch basically each your video, so I can provide feedback about quality of such translations :) You mentioned that we can download such colab-book into our local jupiter book. Could you explain more? I already figured out, that I can just copy-paste each line that is inside of colab-book, and installation should be fine, but I was wonder about that additional solution (jupiter book). Can't wait to see improved versions of GUI around github (as windows exe file or something ).
@leemark7739
@leemark7739 7 ай бұрын
me too
@ratside9485
@ratside9485 11 ай бұрын
I am looking for a tutorial for a local installation. This is really a game changer, also for LLMs hope it will be built in oobabooga.
@leemark7739
@leemark7739 7 ай бұрын
yep
@mmet5772
@mmet5772 11 ай бұрын
Hi thanks for this i have a question can i use this in my flutter app for speech to text task?
@leemark7739
@leemark7739 7 ай бұрын
same question
@silicomic
@silicomic 5 ай бұрын
Does any facing problem with longer audio like me?? any solution?
@__________________________6910
@__________________________6910 11 ай бұрын
Hey why it taking only first 19-20s audio
@fredsakay994
@fredsakay994 2 ай бұрын
Could you please make this service as web-page on site?
@shadabalam2513
@shadabalam2513 11 ай бұрын
bro can you please help me ...I got some error when I run on colab --- TypeError: Translator.__init__() got an unexpected keyword argument 'sample_rate'
@usofdashtban105
@usofdashtban105 11 ай бұрын
Me Too! How We Can Solve It ?
@andthensome9277
@andthensome9277 7 ай бұрын
I got an error issue. in Window 10 OS, Colab. ERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts. fairseq2n 0.2.0 requires torch==2.1.1, but you have torch 2.0.0 which is incompatible. torchaudio 2.1.1 requires torch==2.1.1, but you have torch 2.0.0 which is incompatible. torchtext 0.16.0 requires torch==2.1.0, but you have torch 2.0.0 which is incompatible. torchtext 0.16.0 requires torchdata==0.7.0, but you have torchdata 0.6.0 which is incompatible. torchvision 0.16.0+cu118 requires torch==2.1.0, but you have torch 2.0.0 which is incompatible. how to solve it ? is it impossible to build in Window OS?
@leemark7739
@leemark7739 7 ай бұрын
same problem same question
@user-kk4zf9hk4b
@user-kk4zf9hk4b 11 ай бұрын
wow , just got to know you are Tamil 🥰.keep shine
@1littlecoder
@1littlecoder 11 ай бұрын
Yes 😊 Thank you
@learnweb3603
@learnweb3603 11 ай бұрын
Bro how to run the civit ai models using code or google collab, please make a video
@1littlecoder
@1littlecoder 11 ай бұрын
Do you mean stable diffusion model?
@learnweb3603
@learnweb3603 11 ай бұрын
@@1littlecoder yup yup, fine tuned models of stable diffusion models
@sunkwolf
@sunkwolf 10 ай бұрын
can this we run on local in windows ?
@1littlecoder
@1littlecoder 10 ай бұрын
Yes if you have nvidia GPU
@1littlecoder
@1littlecoder 10 ай бұрын
Otherwise it'd take longer time
@sunkwolf
@sunkwolf 10 ай бұрын
@@1littlecoder i have a nvidia 4090, but im using windows, i cant switch to linux because almost all the programs i use to work only works in windows
@leemark7739
@leemark7739 7 ай бұрын
@@sunkwolf it seems to work on linux not windows,wonder wsl2 works?please let me know if you make it
@leemark7739
@leemark7739 6 ай бұрын
@@sunkwolf have you succeed?
@mvkrishna760
@mvkrishna760 11 ай бұрын
Tamil - awesome!!!
@user-mx4wm2hl9u
@user-mx4wm2hl9u 11 ай бұрын
when I feel its hard to use I feel like go to KZfaq you will find Indian Man learn that Here We are thank You about Arabic translation its 80% true because its use ward say listing to you tube video but we say ward like watch video
@lokeshart3340
@lokeshart3340 11 ай бұрын
Can you make a real time skin cancer Streamlit app?
@YasserHashem
@YasserHashem 11 ай бұрын
arabic translate in vedio is good translate every thing right
@1littlecoder
@1littlecoder 11 ай бұрын
Thanks for confirming
@giantworks1366
@giantworks1366 10 ай бұрын
yes it's correct in arabic @1littlecoder 👍
@lynnqi6451
@lynnqi6451 11 ай бұрын
Except saying KZfaq in Mandarin, others is ok. But I think Microsoft text to speech is better😊
@ZhechengLi-wk8gy
@ZhechengLi-wk8gy 11 ай бұрын
This model does not seem to work very well for Mandarin translation and pronunciation
@susannemalok6561
@susannemalok6561 11 ай бұрын
Task: S2TT (Speech to Text translation) Warning Input audio is too long. Only the first 60 seconds is used. 🤔
@allhellloose7632
@allhellloose7632 11 ай бұрын
non commerical
@listentomusic8160
@listentomusic8160 5 ай бұрын
Terrible Hindi 😂
@leemark7739
@leemark7739 6 ай бұрын
any possible to run it on windows locally
⚡️Best FREE Speech To Text is NOW FASTER!!!
7:50
1littlecoder
Рет қаралды 9 М.
How Many Balloons Does It Take To Fly?
00:18
MrBeast
Рет қаралды 166 МЛН
Mom's Unique Approach to Teaching Kids Hygiene #shorts
00:16
Fabiosa Stories
Рет қаралды 24 МЛН
Heartwarming Unity at School Event #shorts
00:19
Fabiosa Stories
Рет қаралды 19 МЛН
Realtime Speech Translation with Facebook's SeamlessM4T
9:59
Jarods Journey
Рет қаралды 7 М.
POWERFUL Single Model for EVERYTHING AI SPEECH & TRANSLATION!!!
8:40
Google Releases AI AGENT BUILDER! 🤖 Worth The Wait?
34:21
Matthew Berman
Рет қаралды 226 М.
Use Your Self-Hosted LLM Anywhere with Ollama Web UI
10:03
Decoder
Рет қаралды 66 М.
SeamlessM4T: Andrew Ng, OpenAI Multimodal Whisper - AI Paper Explained
13:48
How is THIS Coding Assistant FREE?
5:19
Alex Ziskind
Рет қаралды 144 М.
Adobe is horrible. So I tried DaVinci Resolve
45:17
Bog
Рет қаралды 105 М.
You need to learn AI in 2024! (And here is your roadmap)
45:21
David Bombal
Рет қаралды 676 М.
Voice Cloning In Multiple Languages - Open Source
16:49
Prompt Engineering
Рет қаралды 80 М.
How Many Balloons Does It Take To Fly?
00:18
MrBeast
Рет қаралды 166 МЛН