AI LIP READING

  Рет қаралды 1,201,324

carykh

carykh

5 жыл бұрын

Check out Brilliant.org for fun STEMmy courses online! First 200 people to sign up here get 20% off their annual premium subscription cost: brilliant.org/CaryKH/
Thanks to Liza for animating the beginning of this video: lizadesya?...
GitHub repo for this project:
github.com/carykh/videoToVoice
(I haven't uploaded all files here yet, especially the December ones. They'll be coming soon!)
James WoMa's channel:
/ @jameswoma1140
If you wanna help animate for my videos, here's my Twitter I suppose: / realcarykh
Raw output of the lip-reading AI: • AI lip reading test ou...
Original video of me reading the Bee Movie Script: • carykh's full reading ...
Gentle Python library: github.com/lowerquality/gentle
Adam Geitgey's face recognition Python library: github.com/ageitgey/face_reco...
** MUSIC **
Everything here is licensed under a Creative Commons Attribution licence (creativecommons.org/licenses/...)
Lee Rosevere - Wireless
freemusicarchive.org/search/?s...
Lobo Loco - Railroad (ID 1003)
freemusicarchive.org/search/?s...
Nikolai Rimsky-Korsakov - Flight of the Bumblebee (surprisingly fitting. It's also in the public domain)
BODYSURFER - Call Your Grandma
freemusicarchive.org/search/?s...
"Childhood Memories of Winter" from: Music4YourVids.co.uk
"Skyline" by JujuMas
/ skyline_self_made_trac...
Long Road Ahead by Kevin MacLeod (incompetech.com)
Licensed under Creative Commons: By Attribution 3.0 License
creativecommons.org/licenses/b...
Final Count by Kevin MacLeod (incompetech.com)
Licensed under Creative Commons: By Attribution 3.0 License
creativecommons.org/licenses/b...
Sippie Jepper - Branchless
/ branchless
Song: Fredji - Happy Life (Vlog No Copyright Music)
Music provided by Vlog No Copyright Music.
Video Link: • Fredji - Happy Life (V...

Пікірлер: 3 500
@KurtHugoSchneider
@KurtHugoSchneider 5 жыл бұрын
now we need the full bee movie uploaded, but with the actual audio replaced by your dramatic reading of the script...
@carykh
@carykh 5 жыл бұрын
omg I have the 70 minute video of my voice on my iPhone, I suppose I have no choice but to upload it! check back in 1 hour. I bet somebody will edit it all together
@TastyBaldEagle
@TastyBaldEagle 5 жыл бұрын
@@carykh please
@PlasmaSabre
@PlasmaSabre 5 жыл бұрын
@@carykh I would watch this :D Great work on the project btw, love your videos.
@kutip1027
@kutip1027 5 жыл бұрын
Please I still want this
@kutip1027
@kutip1027 5 жыл бұрын
If I need to I will volunteer as tribute
@boyinaband
@boyinaband 5 жыл бұрын
I love these videos.
@UmMeAmberE
@UmMeAmberE 5 жыл бұрын
OOF IVE FOUND YOU
@MrZkitZ
@MrZkitZ 5 жыл бұрын
@@UmMeAmberE same
@yoyochinb3742
@yoyochinb3742 5 жыл бұрын
Wow
@4ltrz555
@4ltrz555 5 жыл бұрын
Hello!
@RubenFedop
@RubenFedop 5 жыл бұрын
So thats how i found your channel
@toasttimestwo
@toasttimestwo 4 жыл бұрын
Cary: Read the lips of this guy. Computer: *S U M M O N S S A T A N*
@jobisTheWorst
@jobisTheWorst 4 жыл бұрын
WHO SUMMONED ME
@72jysmith
@72jysmith 4 жыл бұрын
Cary:ME
@lameking2839
@lameking2839 4 жыл бұрын
God: Let me introduce myself
@wolfyowoz
@wolfyowoz 3 жыл бұрын
666 likes I'm not gonna ruin that
@reinatr4848
@reinatr4848 3 жыл бұрын
Still 666 likes
@YoshTea
@YoshTea 4 жыл бұрын
Holy hecc this is useful for animation
@CA19
@CA19 2 жыл бұрын
YES
@rj9959
@rj9959 5 жыл бұрын
Only about 40% of words are able to be made out by the best lip readers. The rest of the words are assumed based on context. So this project has huge limitations to start with.
@dylanwijaya1662
@dylanwijaya1662 5 жыл бұрын
@Eric Lee you like cereals>:)?
@dylanwijaya1662
@dylanwijaya1662 5 жыл бұрын
@Eric Leeyou like mum buy cereal type >:) ?
@dylanwijaya1662
@dylanwijaya1662 5 жыл бұрын
@Eric Lee ohhhh children school they give milk like teachers to student. it good because I can eat cereal with milk it free. So teacher give milk to children. Okeh?
@GaJ42
@GaJ42 5 жыл бұрын
Okay not is it
@dylanwijaya1662
@dylanwijaya1662 5 жыл бұрын
@@GaJ42 you like cereals>:)?
@NeedForMadnessSVK
@NeedForMadnessSVK 5 жыл бұрын
"We just need to pick the right transcript" Me: Its going to be a Bee movie isnt it? "I read the entire Bee movie script on camera" NAILED IT.
@jurremioch316
@jurremioch316 5 жыл бұрын
It just HAD to be the Bee Movie script, I cheered so hard when he said it.
@hoodlumscraggy1801
@hoodlumscraggy1801 5 жыл бұрын
kzfaq.info/get/bejne/d7BzmcqJzaeZlpc.html here is his bee movie script video
@jessdoesstuff6783
@jessdoesstuff6783 5 жыл бұрын
thought the exact same thing
@slicerthe84th
@slicerthe84th 2 жыл бұрын
NAILY
@Amaya_Fox_20
@Amaya_Fox_20 5 жыл бұрын
"so how tough are you?" "I read the entire bee movie script" "yeah, so?" "I read it in front of my camera" "come right in, sorry for the wait"
@ashleysmith8528
@ashleysmith8528 4 жыл бұрын
You got a bottle of ketchup? yeah *Fails at opening ketchup cap Could I run this in some hot water?
@azadanzans5359
@azadanzans5359 4 жыл бұрын
Kolio Pulio Why doesnt anyone know the last line?
@user-it9qn5ju5u
@user-it9qn5ju5u 4 жыл бұрын
@@azadanzans5359 , no no
@SoshJam
@SoshJam 4 жыл бұрын
AND SUBMITTED IT FOR A COLLEGE CLASS
@legoyoda5776
@legoyoda5776 4 жыл бұрын
"Or rather, I should say *OUR* lip reading A.I" *SOVIENT ANTHEM STARTS PLAYING*
@QS1597
@QS1597 4 жыл бұрын
Antonio Sustaita ah, the sovieNt union
@QS1597
@QS1597 4 жыл бұрын
SPOTILA NAVEKI VELIKAYA RUS
@user-gc2cv6qw6i
@user-gc2cv6qw6i 4 жыл бұрын
Yes
@cailyndempster
@cailyndempster 4 жыл бұрын
Soviet
@vvg_lol
@vvg_lol 4 жыл бұрын
No please no
@thatonewierdcowboy6792
@thatonewierdcowboy6792 5 жыл бұрын
Funny thing is... I actually correctly guessed “Have you got a moment?”
@ohyeahyeahimasian392
@ohyeahyeahimasian392 5 жыл бұрын
same
@tripodgamer
@tripodgamer 5 жыл бұрын
LIAR
@bensosnowski1128
@bensosnowski1128 4 жыл бұрын
I guessed it was a question, but that’s it
@ethen1772
@ethen1772 4 жыл бұрын
I guessed are you being helpful?
@isaacphase2759
@isaacphase2759 4 жыл бұрын
That was the only one I got
@tfairfield42
@tfairfield42 5 жыл бұрын
*OUR* LIP READING AI _Soviet anthem begins_
@benos1799
@benos1799 5 жыл бұрын
Good job comrade we need you in the soviet union
@blarg2429
@blarg2429 5 жыл бұрын
mobile.twitter.com/unusualvideos/status/1069136310600777729
@guh2908
@guh2908 5 жыл бұрын
Sounds like *_COMMUNIST PROPAGANDA_* But ok
@Kasmuller
@Kasmuller 5 жыл бұрын
@@benos1799 to bad Soviet has been gone for almost 30 years
@voltagedrop5899
@voltagedrop5899 5 жыл бұрын
Daily reminder that communism doesn't work.
@eluisific3255
@eluisific3255 5 жыл бұрын
12:51 Jokes on you! I memorized the whole bee movie script!!!
@CrimsonCascade3101
@CrimsonCascade3101 4 жыл бұрын
what did he say then
@bastibob660
@bastibob660 4 жыл бұрын
Vannesa pull yourself together
@Fuley-la-joo
@Fuley-la-joo 3 жыл бұрын
According
@Crystal_500
@Crystal_500 3 жыл бұрын
@@Fuley-la-joo to
@rebert_reid
@rebert_reid 3 жыл бұрын
@@Crystal_500 all
@txrafafy
@txrafafy 4 жыл бұрын
3:53 **Their smiles slowly turning into giant frowns**
@agentstache135
@agentstache135 5 жыл бұрын
Reverse the program to animate the mouth movements EDIT: If Cary still has the animation files for some of his videos I don't think it'd be too hard to rip the mouth data from them (as a one dimensional matrix representing different mouth positions) and then use that with the audio from those videos
@iritesh
@iritesh 5 жыл бұрын
that's what China did with the news anchoring AI
@exm3266
@exm3266 5 жыл бұрын
IIRC Adobe Animate recently released a feature that would assist in lip syncing, but I'm not sure if it's anything like the logic used here.
@JeffHykin
@JeffHykin 5 жыл бұрын
You could also reverse the purpose of the AI: give it the original transcript and have it swap real words with similar-looking words. Limit it to only a few words per sentence, give it an oddly specific dictionary for substitutions, and you'd have truly automated the bad lip reading channel. Maybe that's what I'll do for my senior project.
@TheTonyMcD
@TheTonyMcD 5 жыл бұрын
That would be incredibly useful to the anime industry. And with decent enough cgi, to the entire film dubbing industry.
@michaelepica3564
@michaelepica3564 Жыл бұрын
Lol he did that
@Failzz8
@Failzz8 5 жыл бұрын
14:14 interesting, so this is what being insane feels like.
@diamondgolem6401
@diamondgolem6401 5 жыл бұрын
I'm pretty sure it's more like 3:53
@TheKillerGut
@TheKillerGut 5 жыл бұрын
*Uses headphone*...ow
@knack3381
@knack3381 5 жыл бұрын
My right headphone is broken Which makes me sane, i guess
@jakef8913
@jakef8913 4 жыл бұрын
"For example, after the word 'the' there should always be a noun" adjectives
@devinandcarrietotaldrama505
@devinandcarrietotaldrama505 4 жыл бұрын
The cat = The bad cat
@yourtypicalcube2830
@yourtypicalcube2830 2 жыл бұрын
@@pinkman_ Gerunds (-ing) are nouns, so you're using a noun there.
@robinr2770
@robinr2770 5 жыл бұрын
as a linguist, I feel for you, you took on a task way harder than you expected, good job regardless. unfortunately we can not see inside the mouth of someone speaking and that is where so much of speech happens. you can also consider the following: if you have the same vowel after 3 different consonants, your lips will always be in a different position, thus some sounds don't have unique lip positions at all. real life lip reading is mostly context and being able to tell where those highly distinguishable consonants are.
@duck7781
@duck7781 5 жыл бұрын
13:00 super easy I memorized the bee movie script
@EmanuilGlavchev
@EmanuilGlavchev 5 жыл бұрын
Overfitting in real life :D
@OneFingerYT
@OneFingerYT 5 жыл бұрын
I actually read "have you got a moment" easily. The AI needs more training in phrases.
@theepicgamer4578
@theepicgamer4578 5 жыл бұрын
Your profile pic saids it all
@JustinY.
@JustinY. 5 жыл бұрын
"Bow down to your robot overlords"
@amberjadedontcommentonoldp2717
@amberjadedontcommentonoldp2717 5 жыл бұрын
Happy new year justin
@imagineexistance4538
@imagineexistance4538 5 жыл бұрын
How are you here
@jonahlouque9621
@jonahlouque9621 5 жыл бұрын
The Demonetizer
@blaz9474
@blaz9474 5 жыл бұрын
I, for one, accept our new robot overlords.
@infiniteobject
@infiniteobject 5 жыл бұрын
Justin Y. How did you get here
@Weg002
@Weg002 4 жыл бұрын
3:54 when I try to talk/listen to someone talking in a dream
@alexandramuller9055
@alexandramuller9055 4 жыл бұрын
I love the conway's game of life reference "bring out the big guns" lmao For anyone wondering, the picture he slams on the table is a glider gun, it produces infinite gliders.
@H_fromDiscord_real
@H_fromDiscord_real Ай бұрын
timestamp?
@sikor02
@sikor02 5 жыл бұрын
Dave, although you took very thorough precautions in the pod against my hearing you, I could see your lips move. ~HAL 9000
@bapldap3324
@bapldap3324 5 жыл бұрын
I was looking for this.
@razvanflorea1166
@razvanflorea1166 5 жыл бұрын
A Space Oddisey fans unite!
@kryswilkins8615
@kryswilkins8615 5 жыл бұрын
I’m afraid I can’t do that, Dave.
@leehttucec-9985
@leehttucec-9985 5 жыл бұрын
You said what we were all thinking, thank you
@user-vn7ce5ig1z
@user-vn7ce5ig1z 5 жыл бұрын
• The takeaway from this video is to give deaf people lots of kudos. • Decimating twice isn't 20% off, it's 19% off: ((N×0.9)×0.9) Close but no zikal (I think I need more practice lip-reading). • Dubbing words onto politician's mouths has already been done. It's the audio counterpart of deep-fakes (and BadLipReading).
@matthewzeller5026
@matthewzeller5026 5 жыл бұрын
I was going to comment that but I'm not even sure what the "correct" term is. Sure you could say "20%" but does "bi-decimate" work?
@microbialdoormat
@microbialdoormat 5 жыл бұрын
I, myself, am hard of hearing. As long as I have the tiniest bit of sound, I can read lips. And with dramatic wording, like yours, I read it just fine! So hah!
@fgbeast5805
@fgbeast5805 5 жыл бұрын
I seriously thought he said “I love bobbies” 13:35
@agentstache135
@agentstache135 5 жыл бұрын
The Gosper Glider Gun (4:20) is one of the smallest guns in Conway’s Game of Life. Like I’m not saying you needed to show a HBK Gun or anything, but at least show a Cordership Gun or something
@carykh
@carykh 5 жыл бұрын
not enough pixels in a KZfaq video! And hey at least it's bigger than a queen bee
@tomryan3408
@tomryan3408 5 жыл бұрын
lol 420
@WangleLine
@WangleLine 5 жыл бұрын
Thanks for the random knowledge, stranger!
@mystery8093
@mystery8093 5 жыл бұрын
*420 blaze it*
@Calthecool
@Calthecool 5 жыл бұрын
You had a video of you reading the bee movie script for 10 months? And you didn’t post it? - respect.
@hyfi_n
@hyfi_n 4 жыл бұрын
3:47 "Yeah I know she was so..." Ha nice BFDI reference
@bobross4082
@bobross4082 5 жыл бұрын
Dude. I just started watching your videos. I don’t know what job you have. But your a genius. Your literally improving computer programming extremely. I don’t know actually terminology. But your gonna be making huge money someday if not already. Your gonna be the reason robots become a reality
@bornach
@bornach 5 жыл бұрын
Most disappointed that there was no 2001: A Space Odyssey reference to HAL9000's decision to murder the crew based on lip reading evidence.
@binaryorbitals
@binaryorbitals 5 жыл бұрын
Person: Read My Lips Cary: Say No More
@jansopi6967
@jansopi6967 4 жыл бұрын
I should say *OUR* lip reading AI. Staline aproves
@UncleSheoTV
@UncleSheoTV 5 жыл бұрын
I'm not sure if the videos are meant to induce laughing so much that you begin to hurt but I have watched 3 of your videos and they have all done this to me. Also they are very impressive!!!
@cavemann_
@cavemann_ 5 жыл бұрын
What an absolute madlad! He actually read the whole Bee Movie script!
@KrazyKyle-ij9vb
@KrazyKyle-ij9vb 5 жыл бұрын
I hope he likes jazz...
@KentoNishi
@KentoNishi 5 жыл бұрын
Roses are read Violets are blue AI can read Can Cary too?
@glanni
@glanni 5 жыл бұрын
When you said you would use the transcript of a movie i was getting very excited. When you were talking about doing the unthinkable, i knew it had to be it. When you said you read the entire bee movie script on camera, i literally started clapping before i could care about my family being in the same room. I respect you so much for this, you really gave a big sacrifice.
@robz537
@robz537 4 жыл бұрын
amazing how productive u are. great script for the video btw
@PixelBytesPixelArtist
@PixelBytesPixelArtist 5 жыл бұрын
A Traditional to simplified Chinese character converter would be amazing. If you guys want to try that project again I suggest trying to identify radicals and translate those instead of the characters themselves. Most differences between simplified and traditional are in the radicals
@caseygreyson4178
@caseygreyson4178 5 жыл бұрын
Please use this to translate Jojo Siwa so we know what she’s trying to say Also, don’t worry about the project’s accuracy. I have a Deaf sibling and when they talk to me it’s fine because I learned sign language growing up with them. But they hate lip reading because it’s so hard to read lips. Apparently opinions/studies sort of agree that lip reading is an awful way to communicate cause some sounds look the same. A pretty infamous one is “Olive juice” looking like “I love you”. They say only 30% of words can be read accurately. Pretty weird right?
@badlydrawnturtle8484
@badlydrawnturtle8484 5 жыл бұрын
It's pretty obvious if you actually stop to think about it. (To quote Wikipedia for briefness) "Organs used for speech include the lips, teeth, alveolar ridge, hard palate, velum (soft palate), uvula, glottis and various parts of the tongue." Out of all of that, the only thing "lip reading" gets you information about is the lips and very occasionally the tip of the tongue; all of the rest of that critical information is invisible from the outside. It's remarkable that anybody ever thought lip reading was effective, really. Did they never stop to consider what their own mouth and throat are doing?
@caseygreyson4178
@caseygreyson4178 5 жыл бұрын
Badly Drawn Turtle Exactly! Sounds like Fa and Va look exactly the same. As well as Ga and Ka. The whole point of lip reading is that it’s just the shape of the mouth. You don’t have context or the sounds. In ASL we mouth words on most signs, but that’s just cause. If you do the sign for twins and mouth “twins”, no one is going to think you said “wins” because there is that context. But lip reading by itself (when my sibling tries to understand someone who isn’t signing) they struggle so much.
@boggers
@boggers 5 жыл бұрын
@@caseygreyson4178 yeah, there are around 40 phonemes in most languages, but traditional 2D animators use only 10 mouth shapes. eg. M B and P all use the same shape, there is one neutral looking shape that is used for about a quarter of the other sounds.
@ZombieGuts15
@ZombieGuts15 5 жыл бұрын
and, “Alligator food” looks like, “I love you”
@hoper7649
@hoper7649 5 жыл бұрын
If the computer got 47% right. Then its pretty good.
@mysterycookie_
@mysterycookie_ 5 жыл бұрын
Love your videos, thank you for uploading
@bronistevoni
@bronistevoni 5 жыл бұрын
Wow the video of you saying the bee movie script was recorded on my birthday. Best present ever!
@v.6984
@v.6984 5 жыл бұрын
carykh: *"On March the 11th, 2018, at 11 PM, I did the unthinkable."* Me: oh no, please tell me he didn't read the entire bee movie scri- carykh: *"I read the entire Bee Movie script on camera"*
@sirclashin
@sirclashin 5 жыл бұрын
Lmao
@OrangeC7
@OrangeC7 5 жыл бұрын
Honestly, and I'm not sure if this is how KZfaq does their captions, but I feel like a combination of lip reading and word recognition together would make very accurate captions, especially if it's tuned to be just right.
@sacripudding4586
@sacripudding4586 5 жыл бұрын
That causes an issue. It wont know if it sees lips or not. It could just see like, as an example, a fortnitw characters lips. Alot of gameplay channels dont have webcams. It may see the wrong thing as lips, issues like that may screw up subtitles.
@lara4268
@lara4268 5 жыл бұрын
I was so proud when I guessed "do you have a moment"
@Mastaachef
@Mastaachef 5 жыл бұрын
13:39I ACTUALLY GOT IT RIGHT OMGGG! So this is what ultra instinct feels like?
@joelbraun8584
@joelbraun8584 4 жыл бұрын
YEAH HAHA SAME "Both of you did terrible"
@SreenikethanI
@SreenikethanI 5 жыл бұрын
06:08 i swear I expecting he was gonna read the Bee movie script… AND HE DID! I'm like "YESS!"
@ChristianGates
@ChristianGates 5 жыл бұрын
Your neck moves too when you make certain syllables. Maybe you should incorporate that?
@Predated2
@Predated2 5 жыл бұрын
I think angles matter too. If he had done 2 angles, it probably would be able to look at the movements more precise and see where it went wrong. Then having 3-5 people reading the same thing both overly moving and normally, it should figure it out pretty quick.
@ChristianGates
@ChristianGates 5 жыл бұрын
Predated O exactly
@AB-Prince
@AB-Prince 5 жыл бұрын
decimated twice would be 19% off 100-(100/10)=90 90-(90/10)=81
@dolloptwerpandorange402
@dolloptwerpandorange402 5 жыл бұрын
O:08 Cary: Or I should say OUR lip reading AI *Soviet anthem starts playing*
@MarkGamed
@MarkGamed 5 жыл бұрын
We need the entire movie but with the AI instead of the actual audio EDIT: woah that’s a lot of likes
@agentstache135
@agentstache135 5 жыл бұрын
AI writes the music for the score for the Bee Movie, AI writes the script for the Bee Movie, AI animates the Bee Movie, AI makes a bad lip reading of the AI written Bee Movie, AI takes the bad lip reading of the AI written Bee Movie and writes a script to contextualize the random things, AI animates the contextualized script based on the bad lip reading of the AI written Bee Movie and animates it, and so _ad nauseam_
@NativLang
@NativLang 5 жыл бұрын
CMUdict strikes again! Looked to me like some successes here. Now you got me wondering if you'd go even further weighting words / word neighborhoods by commonness, or by taking morphosyntax into account. Oh, and so much yes to the sinking smiles at 3:54 - that slow letdown of throwing out a hopeful spike solution and watching it fail.
@hanako-kun22
@hanako-kun22 2 жыл бұрын
OH MY GOSH I GOT THE LIP READING RIGHT!!! BOTH OF THEM!! I am *GOD*
@marceltelang7825
@marceltelang7825 Жыл бұрын
wait why isn't your channel verified
@adnamamedia
@adnamamedia 5 жыл бұрын
I really like the animation at the beginning. I honestly laughed a few times cuz it was so charming
@AriaLunaCampbell
@AriaLunaCampbell 4 жыл бұрын
My technical mind: "This is pretty interesting." My linguistic mind, watching the section on the algorithm guessing syllables: "Please, for the love of everything, use the IPA! Ahhhhhhhh!" (To be clear, this is mostly a joke. At least he is using a standardized format for syllables. I just have this little part of my brain that's been spoiled by the IPA's unambiguous nature and figured there's probably someone else out there who'll get it.)
@vuxigeck5281
@vuxigeck5281 5 жыл бұрын
What a nice way to start off the year! Finding _yet another_ awesome channel I'm gonna be enjoying for a pretty long time, I think!
@janeylala
@janeylala 5 жыл бұрын
When you didn't understand anything but you still enjoyed the video. *THIS IS AMAZING! SO COOL!* Few mins later... *WHAT DOES THAT MEAN? WATEVER!*
@raball
@raball 5 жыл бұрын
the blurry voice actually sounds great. i would turn that into music so fast
@spikeus3570
@spikeus3570 5 жыл бұрын
14:16 Carykh: Quiet I want to talk! AI: LET ME TALK FIRST Carykh: Let me talk first, please *And then you loop this
@araceli7604
@araceli7604 5 жыл бұрын
3:53 me trying to have a normal conversation with someone Edit: Woah, that's a lot of likes...
@thecringeking873
@thecringeking873 5 жыл бұрын
Same here
@hanac5586
@hanac5586 5 жыл бұрын
this sounds exactly like me when I haven't slept in 24 hours but still have a lot to say
@deadbread3459
@deadbread3459 5 жыл бұрын
WhEn LibEarLs sPeAk tO mE tHeY sOuNd LIke ThaT XD XD WOW they ThInk Their so Gr8 :0) 😂😂😂😂😂😂😂
@Zorbeltuss
@Zorbeltuss 5 жыл бұрын
If you could increase or decrease the score of words based on context you could probably reduce the amount of errors that occur, also that can be trained on separate material in the form of text transcripts from other sources, making it easier to see if it hurts or helps.
@TheJustinator
@TheJustinator 4 жыл бұрын
"Automate their entire channel." That's another hint for your next channel: lazykh
@jenniferjoy31
@jenniferjoy31 4 жыл бұрын
happy 1 year of this!
@agentstache135
@agentstache135 5 жыл бұрын
There’s a Cosmo article about the video used titled “KZfaqr had one night stand with a woman, she lied afterwards about being pregnant with twins” if anyone wants to know the context of the video
@art1637
@art1637 5 жыл бұрын
Agent Stache what the fuck?
@43Jodo
@43Jodo 5 жыл бұрын
kzfaq.info/get/bejne/lbBnl6iZvtrYkoU.html Plug this into the Wayback Machine to actually watch the video. Asshole decided to delete it.
@agentstache135
@agentstache135 5 жыл бұрын
@@43Jodo How does that make him an asshole? Like it's something kinda personal and he probably just wanted it to be more as an update about why he wasn't gonna be a father to those who were following him at the time instead of a video for everyone to be able to see forever
@breakerboy365
@breakerboy365 5 жыл бұрын
what is going on lol
@Crudecoronet
@Crudecoronet 5 жыл бұрын
Agent Stache What are you talking about
@ThePotatoLlamaz
@ThePotatoLlamaz 5 жыл бұрын
You should try to make a similar program that converts audio into little animated mouth movements for animators
@ne01nvader
@ne01nvader 5 жыл бұрын
4:04 Don't blame poor computer, he is just trying to summon satan, nothing special.
@galric4270
@galric4270 5 жыл бұрын
I got the “have you got a moment” right 😃
@Officially_Unofficial-1
@Officially_Unofficial-1 5 жыл бұрын
10:13 I thought he actually died OOF
@HappyLeeHL
@HappyLeeHL 5 жыл бұрын
A really interesting idea. I had a similar idea some months ago but I couldn't do it myself. I think maybe you should focus on the link between words in order to create a meaningful sentence, like the KZfaq subtitle algorithm which can correctly transcribe audio to text most of the time. Combining that kind of algorithm with your lip reading idea, it might be good lip reading instead.
@kamaljotsingh6675
@kamaljotsingh6675 5 жыл бұрын
hey what about an AI to play Super Mario afap? that may break the wr.
@HappyLeeHL
@HappyLeeHL 5 жыл бұрын
@@kamaljotsingh6675 I've already made one, that can complete SMB almost as fast as the WR. kzfaq.info/get/bejne/prmZhMp5v86vmp8.html
@nyroysa
@nyroysa 5 жыл бұрын
Holy Moly you are that super mario TAS man
@HappyLeeHL
@HappyLeeHL 5 жыл бұрын
@@nyroysa Hi, nice to meet you here.
@LaskyLabs
@LaskyLabs 4 жыл бұрын
I think the data you used to train the ai is very useful. Thank you for making it public.
@kyrostick
@kyrostick 4 жыл бұрын
I like how I have no idea what Cary is talking about but I still watch it
@samkelson7990
@samkelson7990 5 жыл бұрын
I am actually currently trying to do the opposite. Using google speech recognition API and gentle(which I found thx to ur vid so thx) I am creating a lip syncing programming that will take audio from the mic, convert it into phonemes, then animate a character. Now that itself isn’t to hard but I want to do it live(live audio) so I am kind of struggling.
@blasttrash
@blasttrash 5 жыл бұрын
is the project on github?
@npric2883
@npric2883 5 жыл бұрын
Isnt that animoji
@samkelson7990
@samkelson7990 5 жыл бұрын
@@blasttrash no not yet
@Kitulous
@Kitulous 5 жыл бұрын
@@npric2883 animoji takes your picture and maps your muscle movement to a 3D model on a screen. Their project is to get the audio without the camera part and map it to a character on a screen.
@machodong6552
@machodong6552 5 жыл бұрын
Like vrchat?
@thatoneguy6139
@thatoneguy6139 5 жыл бұрын
Welp this is what I’m watching for the first vid of 2019
@a3dg638
@a3dg638 5 жыл бұрын
Fancy Spider same
@egg4861
@egg4861 5 жыл бұрын
Same bruhh
@ProfessionalTycoons
@ProfessionalTycoons 5 жыл бұрын
amazing how data preprocessing can aid the general problem formation.
@milesprower3488
@milesprower3488 5 жыл бұрын
0:03 It's The Captain from SpongeBob "are you ready kids, aye-aye captain! I can't hear you! AYE-AYE CAPTAIN! OHHHHHHHHHH!"
@an_annoying_cat
@an_annoying_cat 5 жыл бұрын
AI should learn to animate so Cary could be able to upload more often
@marcelinadelacruz8826
@marcelinadelacruz8826 5 жыл бұрын
COMP: LAUREL AI: YANNY I HEARD "THE EARTH IS NOT FLAT"!!!
@serglian8558
@serglian8558 5 жыл бұрын
You shouldn't reveal that you are deaf!
@greenwolf1363
@greenwolf1363 5 жыл бұрын
I hear covfefe
@paranormalstick2289
@paranormalstick2289 5 жыл бұрын
I heard commit order 66
@GrantAce
@GrantAce 5 жыл бұрын
Great Video, rlly impressive!!! Actually wrote a novella and there's a technology that reads people's lips in video, and you're the creator lolol I didn't even think that we were close to something like this being made...
@jondoe5323
@jondoe5323 4 жыл бұрын
Thanks for helping my project on a video that an AI makes. I need it to read a transcript and create accurate voice and face. It then creates a video off of seeing images of faces off of the internet
@thenimalu
@thenimalu 5 жыл бұрын
I live in Germany. It's Silvester. I am drunk. It's 6 am. I am watching Carykh. I hope I spell3d everything right. Happy new year!!!!
@godofdoor6558
@godofdoor6558 5 жыл бұрын
best ai
@adamyoung6797
@adamyoung6797 5 жыл бұрын
hsppy new yere
@LLAWLlET
@LLAWLlET 5 жыл бұрын
Frohes neues!
@data5023
@data5023 5 жыл бұрын
As soon as you said, "Which movie to pick," I instantly went, "It's Bee Movie, isn't it?" I've never seen Bee Movie to be honest.
@RandomNullpointer
@RandomNullpointer 5 жыл бұрын
Well, as you might have figured out already, speech is not only lips. There's the movement of the tongue, the pressure of the exhale and the tone controlled by the vocal cords level, etc.. This is why you've been having such a hard time with the ai... But it's a great and informative video as usual :) Thanks
@zod14c
@zod14c 5 жыл бұрын
omg i read his lips when he said "do you have a moment" perfectly
@ball56
@ball56 5 жыл бұрын
14:04 oh good, I have mono audio setting on.
@Lilli_B
@Lilli_B 5 жыл бұрын
this video is so last year
@krillbilly1435
@krillbilly1435 5 жыл бұрын
*C o m e d y*
@sappyme
@sappyme 5 жыл бұрын
Yeah I like the cool stuff from 2019 like the sequel to the Logan Paul suicide forest video and a sequel to fortnight
@izzypin942
@izzypin942 5 жыл бұрын
IN AN HOUR BOI
@zegamingcuber857
@zegamingcuber857 5 жыл бұрын
Izzy Pin TIMEZONES BOI
@imie-nazwisko
@imie-nazwisko 5 жыл бұрын
Way to start new year with a dad joke
@EandCheckmark
@EandCheckmark 3 жыл бұрын
I was scared when he pulled out the glider gun
@lafeo0077
@lafeo0077 5 жыл бұрын
this channel is under rated.
@ecicce6749
@ecicce6749 5 жыл бұрын
I think the AI works pretty well for the amount of information it has. I guess you could only improve it by choosing the correct words based on grammar and context and what words most likely are next to each other. Also an additional System to output back to audio using a network that is trained on combining lip movement and the detected phonemes into input for a network(easy trained autoencoder) that outputs your voice would make the Project complete. Would loooove to see that.
@zib350
@zib350 5 жыл бұрын
I strongly agree with the word choosing idea!
@brandonchan5387
@brandonchan5387 5 жыл бұрын
When he said "our lip reading AI" I was like "CARY'S JOINED THE UNION, COMMUNISM SHALL RULE THE WORLD" then I realised he was talking about him partnering with his capitalist friend and my hopes were dashed.
@Chizypuff
@Chizypuff 5 жыл бұрын
I nailed "Have you got a moment" but I had to watch it 3 times to make sure
@Sicira
@Sicira 5 жыл бұрын
YEEESSSS I knew he was going to read the bee movie script and I WAS SO HAPPY HE DID WHEN I FOUND OUT props to you man
@calebquadrio1131
@calebquadrio1131 5 жыл бұрын
Just saying I can lip read and the reason I can’t tell what ur saying is because no one talks like that
@RyBrown
@RyBrown 5 жыл бұрын
caluppy he was over pronouncing words and that made the AI confused I think.
@colex1222
@colex1222 5 жыл бұрын
@Radium X I was able to get Vanessa
@liamharrison8285
@liamharrison8285 5 жыл бұрын
HELL LO PEE PULL
@ganaraminukshuk0
@ganaraminukshuk0 5 жыл бұрын
@@d0nnyr0n purgatory medium toilet water stationary
@thejay8963
@thejay8963 5 жыл бұрын
Alive no direction vomit ripping
@wallacebell9719
@wallacebell9719 5 жыл бұрын
I'm proud of you, you could have used the bee movie script as clickbait, but you didn't! Good job!
@TGOS-Official
@TGOS-Official 10 ай бұрын
BFDI references 3:44, 5:27 "yeah i know she was so surprised" is the first line spoken in bfdi (by match) 12:40 flower's announcer crusher brief 15:39 "take the plunge" is the bfdi 1a name (yes i did watch the whole video four times [twice with captions], so what?)
@ToHellWithReality
@ToHellWithReality 5 жыл бұрын
9:45 Uhh... What's that censor bar supposed to be covering? Because I don't think it did what it was supposed to do.
@prokaryotesys
@prokaryotesys 5 жыл бұрын
ToHellWithReality their emails, I think.
@ToHellWithReality
@ToHellWithReality 5 жыл бұрын
@@prokaryotesys I know that, but I didn't want to spell it out for two reasons. First, I didn't want to make it obvious for people looking for that kind of info. Second, comedic effect.
@krucible4889
@krucible4889 5 жыл бұрын
@@ToHellWithReality just r/woosh them
@prokaryotesys
@prokaryotesys 5 жыл бұрын
@@krucible4889 oof i got wooshed thats one of my life goals tho
@betin731
@betin731 5 жыл бұрын
@krucible r/itswooooshwithfouros
@migs1336
@migs1336 5 жыл бұрын
0:09 cause I'm communist Edit: 2:32 he uses the URSS to convert it to spectrogram two communist references in one video
@Kitulous
@Kitulous 5 жыл бұрын
URSS = ur SS
@RichardRMM
@RichardRMM 5 жыл бұрын
@@Kitulous mein leben
@denischikita
@denischikita 4 жыл бұрын
I think you need to train netwot not only with lips, but with throat too. Because a lot of sounds became from vocal cords only
@sothisisbasicallyhow4696
@sothisisbasicallyhow4696 5 жыл бұрын
GOSH DANG IT LIZA
@alphabbbe8580
@alphabbbe8580 5 жыл бұрын
HAPPY NEW YEAR!!!
@aidanstg445
@aidanstg445 5 жыл бұрын
TofuMaster83 Happy new year!!! (In 1 hour for me)
@swordchicken5629
@swordchicken5629 5 жыл бұрын
and happy birthday bfdi!
@Willam_J
@Willam_J 5 жыл бұрын
Happy New Year to you and everyone else, as well.
@iAmTheSquidThing
@iAmTheSquidThing 5 жыл бұрын
I feel like this could be really useful if combined with voice recognition. For automatically generating synchronised video captions.
@xyanprod
@xyanprod 3 жыл бұрын
you inspired me to read the bee movie transcript
@DPedroBoh
@DPedroBoh 5 жыл бұрын
Godamnit, when you told you were going to read a whole movie script i told myself, oh no, he's doing it, isn't he? Was not disappointed.
I lost $20,000 because We are Number One
17:56
carykh
Рет қаралды 543 М.
Does my AI have better dance moves than me?
20:33
carykh
Рет қаралды 1,1 МЛН
БОЛЬШОЙ ПЕТУШОК #shorts
00:21
Паша Осадчий
Рет қаралды 7 МЛН
Homemade Professional Spy Trick To Unlock A Phone 🔍
00:55
Crafty Champions
Рет қаралды 62 МЛН
Haha😂 Power💪 #trending #funny #viral #shorts
00:18
Reaction Station TV
Рет қаралды 15 МЛН
Универ. 13 лет спустя - ВСЕ СЕРИИ ПОДРЯД
9:07:11
Комедии 2023
Рет қаралды 6 МЛН
The worst lie Mickey Mouse has ever told
13:27
carykh
Рет қаралды 2,1 МЛН
AI Learns to Write Rap Lyrics!
16:03
carykh
Рет қаралды 1,5 МЛН
How to study Japanese (no bullsh*t guide)
11:23
Tomek Cegielski
Рет қаралды 13 М.
A.I. Learns to DRIVE
16:17
Code Bullet
Рет қаралды 6 МЛН
Creating my own customized celebrities with AI
14:56
carykh
Рет қаралды 554 М.
These Google AI experiments are crazy!   This is the future.
13:06
Boyinaband
Рет қаралды 1,2 МЛН
Cary explores time travel further (EXPLICIT)
9:42
carykh
Рет қаралды 1,2 МЛН
The first artificial intelligence I ever made
17:05
carykh
Рет қаралды 815 М.
Animation vs. Geometry
9:17
Alan Becker
Рет қаралды 4,6 МЛН
How The Password Game was beaten in 59 characters
20:50
SlashedPort
Рет қаралды 2,5 МЛН
Я ВЗЯЛ МАН ЮНАЙТЕД НА 10 СЕЗОНОВ...
30:20
Вор неудачник ( Just Another Night Shift )
19:12