Microsoft's New REALTIME AI Face Animator - Make Anyone Say Anything

  Рет қаралды 181,402

AI Search

AI Search

Күн бұрын

Animate any photo with any audio in REALTIME!
#microsoft #vasa #vasa1 #openai #ai #agi #singularity #ainews #deepfake #aitools
www.microsoft.com/en-us/resea...
Newsletter: aisearch.substack.com/
Find AI tools & jobs: ai-search.io/
Here's my equipment, in case you're wondering:
GPU: RTX 4080 amzn.to/3OCOJ8e
Mic: Shure SM7B amzn.to/3DErjt1
Secondary mic: Maono PD400x amzn.to/3Klhwvu
Audio interface: Scarlett Solo amzn.to/3qELMeu
CPU: i9 11900K amzn.to/3KmYs0b
Mouse: Logi G502 amzn.to/44e7KCF
If you found this helpful, consider supporting me here. Hopefully I can turn this from a side-hustle into a full-time thing!
ko-fi.com/aisearch

Пікірлер: 905
@_Pikalika_
@_Pikalika_ Ай бұрын
Finally, a way for me to hear my dad say he's proud of me
@paudiyal
@paudiyal 26 күн бұрын
But you’ll still need audio of your dad saying that to you. Good luck 😛
@DFMoray
@DFMoray 25 күн бұрын
MVP comment of the year
@ShoyaChan
@ShoyaChan 25 күн бұрын
@@paudiyal Or you could use something like ElevenLabs and synthesize the voice to say whatever you want in the first place
@paudiyal
@paudiyal 25 күн бұрын
@@ShoyaChan what is that?
@antonost5055
@antonost5055 24 күн бұрын
Ha😅😅😅😅😅😅
@ErikBongers
@ErikBongers Ай бұрын
The biggest revolution that AI is going to result in, is an insatiable desire for authenticity.
@charliecarpenter2840
@charliecarpenter2840 Ай бұрын
I'm surprised there isn't a large offline movement yet, or maybe there is, it's just not advertised. Personally moving away from internet, harder to find deeper than surface layer technical information now and no real uncensored discussion to be had.
@jakecob864
@jakecob864 Ай бұрын
So so so true
@Gary_Hun
@Gary_Hun Ай бұрын
One can always hope public standards haven't been lowered too much yet for that.
@MikeD-tf7dk
@MikeD-tf7dk Ай бұрын
Very well put
@Diametricallyopposed00
@Diametricallyopposed00 Ай бұрын
Yep, I already have it. I think people are tired of lies, marketing, influencers, persuasion. Everyone is looking for the truth, it’s hard to find.
@redacted629
@redacted629 Ай бұрын
Up next... 300,000 people arrested for a variety of crimes due to online "evidence".
@Shunn3d
@Shunn3d Ай бұрын
This is the plot of Ryan Gossling's action drama "Fall Guy". Thanks to today's AI software fabricating video evidence he is framed for murder and on the run. The movie is still in theaters as I type this comment, and by the way, much better than kingdom of the planet of the apes.
@bran7134
@bran7134 29 күн бұрын
Exactly, but it'll probably start with political figures and then those who fit the profiles of whoever is considered the enemy.
@DecrepitBiden
@DecrepitBiden 29 күн бұрын
Trump, with all audio & video "evidence". Look at all the fake charges now, without AI.
@hanskurtmann6781
@hanskurtmann6781 27 күн бұрын
You will never get convicted by online evidence alone. Fact is online evidence legally doesn't have that much weight. Take Ring cameras the footage just helps but LE still needs physical evidence. Been a state motor officer for 20rys even traffic cams only help investigations they do not solve them.
@StrangeScaryNewEngland
@StrangeScaryNewEngland 26 күн бұрын
@@hanskurtmann6781 There was a guy who threatened to kill me years ago (he was friends of my old boss who was an asshole and I ended up quitting and told him off in the process, which made his old man friend and regular try to find me and scare me for telling off my boss). I called the cops and they couldn't really do anything UNTIL the neighbor called them saying some strange old man was banging on their door asking for someone by my name (he didn't know what house I lived in). The cops looked at their footage and called me back, asking if that was the same guy. I told them yes. After that, they paid him a visit and threatened to arrest him, and told him to grow up and act his age (he's like 75). Long story short, the Ring camera next door was enough evidence to go after this guy.
@wordpressobsessed9067
@wordpressobsessed9067 Ай бұрын
The potential for this technology is staggering, but also the potential for misuse is even moreso.
@kennymichaelalanya7134
@kennymichaelalanya7134 29 күн бұрын
This could replace a person on a Z00m meeting lol
@wordpressobsessed9067
@wordpressobsessed9067 26 күн бұрын
@@kennymichaelalanya7134 or replace every talking head on every newshow on cable tv
@kennymichaelalanya7134
@kennymichaelalanya7134 26 күн бұрын
@wordpressobsessed9067 Correct, Tim Pool today on IRL cast said the meta for the future will likely be clips of A.I combine with CGI relaying the news.
@ubermensch0072
@ubermensch0072 19 күн бұрын
Whzt possible good can come from this? Please, name one thing this can do that will be a positive benefit for humanity or even life at all?
@ButcherSevenActual
@ButcherSevenActual Ай бұрын
All I see is the terrifying new world where we go in circles for hours dealing with a smiling AI customer service talking head at the DMV, hospital, post office, or return counter.
@darrylelkins681
@darrylelkins681 Ай бұрын
You are exactly right, nothing to like about this garbage......Just more crazy BS to put up with......Im looking for the human side of life, not this crap.
@charliecarpenter2840
@charliecarpenter2840 Ай бұрын
Pretty much there already without AI
@HisXLNC
@HisXLNC Ай бұрын
So basically no change. 😂
@barnabybot
@barnabybot Ай бұрын
Yep. And then someone in the civil service will decide they need a physical human face and we'll end up talking to an AI android with zero empathy.
@DrSteveIB
@DrSteveIB Ай бұрын
Time to make your own BOT to deal with their BOTs
@sanseverything900
@sanseverything900 Ай бұрын
Every day Black Mirror is feeling less and less "fictional".
@theAIsearch
@theAIsearch Ай бұрын
Yes!
@alexpitibalrog2909
@alexpitibalrog2909 Ай бұрын
So true !
@nutterknoll69
@nutterknoll69 Ай бұрын
Yeah. Looks like the Royals used something like this for Princess Kate to give her little talk about her having cancer. I wonder where she really is?
@Innesb
@Innesb Ай бұрын
@@nutterknoll69No, someone just jumped to the wrong conclusions about a low-res version of the video.
@nemesisone8927
@nemesisone8927 Ай бұрын
@@theAIsearch nvidia has the same check it out.
@frozencreed
@frozencreed Ай бұрын
So far, the main thing that tips me off is the teeth changing sizes during speech. It seem subtle at first but then you can't unsee it.
@--Mike--
@--Mike-- Ай бұрын
I was going to say the exact same thing. Once you noticed it, you'll just watch their teeth the whole time.
@kurtisjohnson9530
@kurtisjohnson9530 Ай бұрын
Interesting. I’ll look for that. There are different tells for different clips for me. One simulated female’s bone structure narrows and widens when speaking. Transitions are not always smooth, with too sudden jumps. One “lifelike” examples skin and face looks drawn and painted (not one of he drawn styled faces). In other cases there is just something off.
@ColbyBlack
@ColbyBlack Ай бұрын
Not only is this true, I have been noticing for a while now about that quality of video on certain videos, where everything seemed real everywhere I looked except when I looked at the mouth. Utilize an experiment with digital clone technology, so I’m aware of what it’s typically required to get that level and usually requires training on a video of you so this is quite advanced if it’s simply using the picture alone. I suspect groups have been able to Use versions of this for media for at least the last year if not longer
@chino1127
@chino1127 28 күн бұрын
Same, noticed it as well. Also the mouth appears more “open” in these models - I feel like the lips should touch more in natural speech, but damn just imagine how much better it will get if this is the start.
@jonathanmeraz1190
@jonathanmeraz1190 28 күн бұрын
Another tell is those shots showing shoulders. The shoulders move much more than normal. So, up close the movement looks normal, but the teeth change. Further back, the movement isn’t normal.
@misiopuchatek152
@misiopuchatek152 Ай бұрын
What could possibly go wrong?!
@carultch
@carultch Ай бұрын
People can use this to falsely accuse you of saying and doing things you didn't say or do.
@misiopuchatek152
@misiopuchatek152 Ай бұрын
@@carultch it was a sarcasm.
@sicfrynut
@sicfrynut 28 күн бұрын
@@misiopuchatek152 does AI understand sarcasm ? that , in itself will become a problem.
@TimotejFedlimid-zo3hy
@TimotejFedlimid-zo3hy Ай бұрын
This will come in handy when Skynet announces it has become sentient, and the missiles have been launched.
@boryman2999
@boryman2999 Ай бұрын
it will happen so fast, no one will be able to know what is real
@djslip_irie
@djslip_irie Ай бұрын
It’s already happened.
@steiner554
@steiner554 Ай бұрын
That's already the case with photos and videos. Very dangerous.
@tonybp
@tonybp Ай бұрын
Still looks weird, but as we all know, it's just a matter of time.
@coreywolfh8rt506
@coreywolfh8rt506 Ай бұрын
Looks a bit better than what the reface app does but impressive that you can make them say anything.
@astrakio
@astrakio Ай бұрын
All these companies aren't gonna release their research tech until someone builds an open-source equivalent. They want someone else to open Pandora's box and absorb the legal liabilities first.
@theAIsearch
@theAIsearch Ай бұрын
interesting!
@carkawalakhatulistiwa
@carkawalakhatulistiwa Ай бұрын
They just need move to other country
@TheWarsuron
@TheWarsuron Ай бұрын
the box should have never been opened it is too dangerous and will lead only to more suffering and totalitarian control
@OneAndOnlySurge
@OneAndOnlySurge Ай бұрын
From now on everytime I talk on camera, I'm going to stick out my tongue or cover my eye with my hand or do something ridiculous with my face. AI facial expressions are aiming for "normal" so I will challenge it by being abnormal so people know it's me 😜
@eh3345
@eh3345 27 күн бұрын
I think you will soon have to start walking around with your hand over your eye all day.
@jasony549
@jasony549 Ай бұрын
that mona lisa is scary af
@theAIsearch
@theAIsearch Ай бұрын
lol
@SDW90808
@SDW90808 Ай бұрын
Pretty sure the voice was Anne Hathaway doing an impression of L'il Wayne.
@chris24gone
@chris24gone Ай бұрын
this is awesome in the same way standing on the edge of mount vesuvius must have been seconds before it awakened
@ssekagratius2danime369
@ssekagratius2danime369 Ай бұрын
we are finished
@AdamIverson
@AdamIverson Ай бұрын
I don't think so. It's only research paper with absolutely no intention of releasing it, ever. It's stated right there on their research paper. It's more of "look what we can do, but you can't have it!"
@DjHazardous
@DjHazardous Ай бұрын
*Not yet but every year it's encircling humanity more and more*
@theAIsearch
@theAIsearch Ай бұрын
even if msft doesnt release it, someone else will likely release an open source version soon
@LukasPetry
@LukasPetry Ай бұрын
@@AdamIverson a matter of months until someone else releases something like this
@carkawalakhatulistiwa
@carkawalakhatulistiwa Ай бұрын
The great time to be alive
@iblackfeathers
@iblackfeathers Ай бұрын
the facial expression's motion is still unnaturally smooth, meaning no twitches or spontaneous movement. but it's very convincing if you're not looking for it.
@reyals66
@reyals66 Ай бұрын
That's true, it looks similar to how characters from animated movies talk. However, I can't wait to see how how it's gonna shape up in a year or two.
@theAIsearch
@theAIsearch Ай бұрын
good catch. there are also some minor flaws near the edges in some cases
@marzoval9551
@marzoval9551 Ай бұрын
Yeah, but it's not enough to drag it down back into the uncanny valley for me. This like 95% out of that valley.
@ssekagratius2danime369
@ssekagratius2danime369 Ай бұрын
you wont know if they dont tell you that its AI generated
@yassersaeed2010
@yassersaeed2010 Ай бұрын
I can tell it's fake from the tooths, eyes movements and some head movements
@avinashjagdeo
@avinashjagdeo Ай бұрын
Just because you can do something doesn't mean you should.
@kobayashimaru8114
@kobayashimaru8114 Ай бұрын
Unfortunately it's part of human nature to feel compelled to do something just cause we can. This is a perfect example of "oh I'll make it but I don't plan to do anything bad with it so it's ok". It's out there now even if it's unreleased.
@spectaclesociety
@spectaclesociety 26 күн бұрын
End of discussion.
@StrangeScaryNewEngland
@StrangeScaryNewEngland 26 күн бұрын
@@kobayashimaru8114 Splitting of the atom or guns (originally designed for hunting, not killing other humans. I'm not talking about Chinese weapon invention)
@JaneDoe-yb8lo
@JaneDoe-yb8lo Ай бұрын
They look good, though I notice the teeth stretch in weird ways when the face goes into a wider smile or more open mouth pose.
@francoisbruel9163
@francoisbruel9163 Ай бұрын
Didn't really noticed at first, but when you know, this teeth stretching gets really creepy! There is often one tooth that gets all wide…
@Zarrick
@Zarrick Ай бұрын
4:00 Pretty easy to spot when she laughs. Creepy.
@4saken404
@4saken404 Ай бұрын
I also spotted some errors in the ears. But not every model has their ears very visible. And really I only knew to look because I knew these were AI generated. If I wasn't paying close attention or if these were background characters it's very unlikely I would have known anything was amiss.
@aaronjaggan
@aaronjaggan Ай бұрын
​@@Zarrickmaybe i need to see it other than my small screen phone.
@beldavius
@beldavius Ай бұрын
The folks that are developing this stuff grew up watching movies like The Terminator and The Matrix, so i have to ask, what the hell are they thinking?!
@lefty22l
@lefty22l Ай бұрын
Technically speaking this is incredibly impressive. That being said, I wish it didn't exist and never gets released.
@scottfindley1345
@scottfindley1345 21 күн бұрын
pretty much could say same about virtually all AI so far. Wheres my utopia? All i see is danger, dinimishing the value of all art everywhere with good enough, and very, very troubling technology in an already way-too-confusing-for-most world.
@AJ-vi4nl
@AJ-vi4nl 20 күн бұрын
@@scottfindley1345 All my life I've felt the world was very confusing and sometimes doesn't make such sense and I'm a 90s kid! and now they're making the world waaay more confusing with this tech, I feel so bad for the kids that will grow up with this nightmare.
@straighttalk2069
@straighttalk2069 Ай бұрын
Until they can figure a way to let people know it's artificially generated, this tech needs to stay behind closed doors.
@AdamIverson
@AdamIverson Ай бұрын
Have we figured out a way to let people know it's artificially altered with Photoshop? It has literally been around for decades.
@theAIsearch
@theAIsearch Ай бұрын
they could watermark the outputs. however, even if they do, im sure someone else will likely release an open source competitor soon without watermarking, and you could do some dangerous things with it
@G0DKILLER_
@G0DKILLER_ Ай бұрын
@@theAIsearch there are literal AI tools to remove watermarking
@PowerRedBullTypology
@PowerRedBullTypology Ай бұрын
@@theAIsearch or someone could release a tool to take the watermark out
@g-maChez
@g-maChez Ай бұрын
Heck they don't even have to tell you there's lead/mercury in vax's, mrna in pork and lettuce, divulge what's being sprayed all day/every day on us, or that there's bugs and microplastics in meat! But I agree with you!
@thcoura
@thcoura Ай бұрын
I see only the future of terrible generated HR training
@drone_video9849
@drone_video9849 Ай бұрын
that future is already here, these are actually better than the ones they give us today.
@SwitzerlandUnfiltered
@SwitzerlandUnfiltered Ай бұрын
Yeah. I was thinking the same.
@isaacsmithjones
@isaacsmithjones Ай бұрын
Don't worry, they won't be hiring humans
@TheRealDemocat
@TheRealDemocat Ай бұрын
there's a single digit number of ethical ways to use this and at least a thousand unethical ways to use this. it's jenky enough right now that it won't cause any real problems but 2 or 3 years from now this is gonna be a MAJOR issue
@theAIsearch
@theAIsearch Ай бұрын
true
@alexpitibalrog2909
@alexpitibalrog2909 Ай бұрын
Or 6-12 months
@Lerppunen
@Lerppunen Ай бұрын
People will quickly learn not to believe that the videos and audio they see and hear are real.
@nickchua5772
@nickchua5772 Ай бұрын
@@LerppunenRIP online dating😂
@bartnachtuitkijker7056
@bartnachtuitkijker7056 Ай бұрын
Exactly right. The statement of Microsoft saying "not yet releasing" conditioned on "when it is responsible to do so" is, given the fact AI is unregulated, equivalent to either "never" or "as we please". Not convincing in any way.
@anotherhurayra2024
@anotherhurayra2024 Ай бұрын
What will be so scary is not knowing if you are speaking with a real persson or not on the phone or even in a video call.
@theAIsearch
@theAIsearch Ай бұрын
Only real world interactions will be legit. Even then, you'll need to pinch each other's faces to make sure it's not steel underneath.
@Bob3D2000
@Bob3D2000 Ай бұрын
@@theAIsearch Assuming this is all real in the first place.
@borstenpinsel
@borstenpinsel Ай бұрын
"Yo, I need to know you're legit, insult me" - "of course I am real, my creators made sure...I mean... insulting another living breathing human being is not within the limits of our code of conduct, I'm afraid" Or even asking them the same question and checking if their response sounds more and more annoyed or not
@pbjam2182
@pbjam2182 Ай бұрын
Mass customer service rep layoffs coming soon.
@jimmtech
@jimmtech 29 күн бұрын
​@pbjam2182 I absolutely detest having to bypass the computer assistant when calling customer service. More and more companies use this now, and it's harder to get an actual person. I'm so tired of yelling agent, or operator! It's more maddening when the voice says I understand you want to talk to an agent, but first I'll need a little more information. Whoever created this can go to hell and rot!
@joelpeterson8424
@joelpeterson8424 Ай бұрын
We no longer have agency over our own face, over our selves.
@AAjax
@AAjax Ай бұрын
Ugg, the inventor of the wheel, isn't going to release the wheel until he figures out a way that nobody can use the wheel for things that Ugg disapproves.
@Bob3D2000
@Bob3D2000 Ай бұрын
I thought Ugg invented oversized, boot-shaped slippers? :p
@steiner554
@steiner554 Ай бұрын
This is a very dangerous tool. Imagine using a picture of a powerful leader, create a text and generate it with his voice. Now you can make them say anything that could lead to tensions between countries or even war when tensions were already high.
@sicfrynut
@sicfrynut 28 күн бұрын
all we had in the 70's was the ever infamous "party line for the telephone."
@gmorf33
@gmorf33 26 күн бұрын
I think the inverse is even worse. Catch an authoritative figure saying something incriminating and they can just write it off as fake. You fracture reporting and evidence to the realm of choosing whether or not it's true/real by what aligns with your preexisting beliefs. Humans already have that bias problem. This tech makes that orders of magnitude worse
@eSKAone-
@eSKAone- Ай бұрын
Now that Bethesda is owned by Microsoft there should be nothing in the way for better facial animations in Fallout and Starfield
@RustOnWheels
@RustOnWheels Ай бұрын
The day I will stop using internet is getting closer every day.
@heyjeySigma
@heyjeySigma 28 күн бұрын
I upvoted u but i also think ull NEVER ever reach that day lol. Eeeeverything relies on the net these days. From job searching to entertainment to news, everything. People dont even read actual paper newspapers anymore
@RustOnWheels
@RustOnWheels 27 күн бұрын
@@heyjeySigma Yeah I know what you mean but I’m actually soft planning to move into the wilderness in a few years to stop with all the modern shenanigans. I’ll never be truly 100% internet free but it will come very close to it.
@kylek29
@kylek29 Ай бұрын
One *good* use of this technology when combined with the voice synthesizers is that it can bring a whole new level of "dubbing" for movies/shows. Being able to have the original actor voices and recreating the mouth motion to the translation will open up a lot of content.
@AJ-vi4nl
@AJ-vi4nl 20 күн бұрын
I think that's the only good thing this can bring, but sadly the negatives far outweigh the positives.
@kennethfullerton7232
@kennethfullerton7232 Ай бұрын
Imagine all of the people (especially seniors) that will be scammed by people using AI animated FB profile pics for nefarious purposes.
@PatrickHoodDaniel
@PatrickHoodDaniel Ай бұрын
Noticed if the picture doesn't have teeth, it makes approximations, but still pretty believable.
@PowerRedBullTypology
@PowerRedBullTypology Ай бұрын
I find the teeth often quite fake looking, as it's usually way too white, the darkness in the mouth is way too dark
@PatrickHoodDaniel
@PatrickHoodDaniel Ай бұрын
@@PowerRedBullTypology True, also the teeth separations show a kind of aliasing.
@observingsystem
@observingsystem Ай бұрын
It's really amazing. The only thing I notice is that the teeth in some of them are moving along with the mouth, in a sort of harmonica like way, which freaked me out a bit. But I imagine the next version will have that covered as well. I'd love to be able to use it for my music videos, but I can understand it's not being put out yet for the people and I think it's probably a good thing. For every person who'd like to do something creative with it, there will be 500 scammers that will just use it to try to get easy money. I think they need to make the software somehow so it puts a code in everything it generates that people can't remove but that shows it's AI generated.
@SupaRush
@SupaRush Ай бұрын
I can only imagine all the webcam streamers making so much money off of this when they really look nothing like these people
@lolgoodbye8197
@lolgoodbye8197 Ай бұрын
I think we can still know whether it is AI or not because it is trying to look too realistic, like the way the people move a lot while talking is just unrealistic, some people examples where talking with huge scary smiles all the time, it's not joever yet
@AjaySR56789
@AjaySR56789 Ай бұрын
Very serious threat to peace & harmony in the world.
@webtrekkeruk2487
@webtrekkeruk2487 Ай бұрын
What peace & harmony?
@willrsan
@willrsan Ай бұрын
I think people will have to agree upon passwords and security questions so that when you receive a video call you will be able to confirm you are actually talking with the person you think you are
@wtcbd01
@wtcbd01 Ай бұрын
Being in the Tech industry for over 20 years and seeing the phenomonal growth of technology, I have never been more suprised to see the actual GIFTS being directly handed out to Scammers and the like. However, my family and friends decided over a year ago to have a weekly/monthly changing simple "Word". For Example to prove you are actually who you say you are you may have to use the word in a sentence like: It would be great to have tea with "Lemons", with "Lemons" being the weekly word. very simple and not easily hacked.
@BadgerLaser
@BadgerLaser Ай бұрын
as a layperson its all getting very slick at a terrifying pace - though i think in these example videos the teeth seem to constantly change width relative to each other .. .. . . .
@kennyfordham6208
@kennyfordham6208 27 күн бұрын
It's good, but there are a couple of give-aways. 1) Head movement is very limited. And when the head does move, the hair and background distort quite noticeably. 2) Real humans have muscles, under their skin, which produce natural movement. With the AI faces, the expressions look 'pasted on'. The mouth may smile, but the rest of the face doesn't move much.
@Wazza555
@Wazza555 25 күн бұрын
It is still in its infancy. I remember when mobile phones were in their infancy. Look at them now. AI is catching up very quickly.
@ChucksGhost01
@ChucksGhost01 Ай бұрын
First this...next my clothes, my boots, and my motorcycle.
@Molandria
@Molandria Ай бұрын
I would love to utilize this for my streaming. Not to mimic others, but to animate my own character. ^_^
@theAIsearch
@theAIsearch Ай бұрын
good idea!
@Jerome616
@Jerome616 Ай бұрын
For the first time ever, my brain actually started to be tricked into thinking it was a human… holy cow.
@stephenhan9680
@stephenhan9680 Ай бұрын
It’s very impressive now they throw in the settle emotion and movements. Though the hair and lip is still a giveaway if you *really* pay attention otherwise it’s scarily convincing.
@princepeterwolf
@princepeterwolf Ай бұрын
My only question is "why would you make something like this? Who asked for this?" This is so scary ...
@karlhendrikse
@karlhendrikse Ай бұрын
Same reason anyone ever climbed a mountain
@superpig5000
@superpig5000 29 күн бұрын
The copying someone's voice thing has actually no purpose except for scamming If you think about it and maybe some jokes
@luciengrondin5802
@luciengrondin5802 Ай бұрын
13:22 "It's awesome..., so we won't release it." Great meme.
@ianmlclm7044
@ianmlclm7044 Ай бұрын
Can you get a depth map from it, or shoot in stereoscopic mode to be played in stereo 3d?
@HisXLNC
@HisXLNC Ай бұрын
The one glaring flaw is the source audio sometimes doesn’t match the space where the source photo was taken. For example, seeing someone outside but their voice sounds like they were recorded in small room or recording booth.
@ssekagratius2danime369
@ssekagratius2danime369 Ай бұрын
This can make it harder to distinguish between what's physically real and what's digital.
@TomNook.
@TomNook. Ай бұрын
We are approaching the age where anything on screen will be treated as not real.
@hitmusicworldwide
@hitmusicworldwide Ай бұрын
If it's on a screen it's digital. When it's live in front of you it's real. So get out into the world and put the screens down.
@danisob3633
@danisob3633 Ай бұрын
@@hitmusicworldwide oh yea OF COURSE. how didnt we think about that! we could just be everywhere at the same time to know whats real and whats not and not miss any important info we should know!
@ssekagratius2danime369
@ssekagratius2danime369 Ай бұрын
@@hitmusicworldwide have you ever watched the Matrix movie?
@ssekagratius2danime369
@ssekagratius2danime369 Ай бұрын
@@danisob3633 yeah, its going to be hard to trust every video
@briannaporter
@briannaporter Ай бұрын
Let's see them do a black girl neck roll! 🤣
@GariSullivan
@GariSullivan Ай бұрын
I noticed the tongue isn't used very much. Look for the "L" and the "th" sounds. The lips move correctly, but there is not enough tongue movement or tongue placement for the sounds to be really generated.
@darksoulnj
@darksoulnj Ай бұрын
What's the best current, AVAILABLE program to accomplish this. Especially for singing avatars? Thanks in advance!
@ancapistaomarxista4071
@ancapistaomarxista4071 Ай бұрын
This need to be open source. Open source everything!
@theAIsearch
@theAIsearch Ай бұрын
i hope so!
@fodiographer
@fodiographer Ай бұрын
3 reasons why they will not do this: deepfakes deepfakes and deepfakes. Unfortunately people can't be trusted to use this responsibly. Microsoft will shoot themselves in the foot if they release it because than you will be sure that politics will get involved if it being used for deepfakes and as a result they will restrict AI more than it is now.
@yahanaashaqua
@yahanaashaqua Ай бұрын
If it's not a non profit l, you can't expect it to be open sourced
@stefantervoort475
@stefantervoort475 Ай бұрын
What could go wrong?!
@shelleyreynolds5810
@shelleyreynolds5810 Ай бұрын
This, actually, is terrifying.
@Heartwing37
@Heartwing37 Ай бұрын
People have a difficult time enough distinguishing between fact in fiction even now. Soon no one will know what reality actually is.
@nuralif1
@nuralif1 Ай бұрын
The World in 2024 is literally in a Cold war of AI development, in only few months we got so much AI technologies developed, imagine what 2025 would be like?
@mvstermlnd
@mvstermlnd Ай бұрын
You know whats sad about it, corrupt politicians who will do bad things will claim it was ai, thieves will claim it was ai, pornstars will claim it was ai, cheaters will claim it was ai, + on top of that itll be used to scam people. i love ai, but i dont understand what was the justification to make this ai, this specific ai, i can see only 1 good case use, to bring "alive" dead people from one picture someone might have, other than that, why would they create something that all it needs is a picture and it becomes you? Maybe cuz they already have our profile pictures stored in social media? They will be able to sell you and me without us ever knowing. Fuckin hell. Thank god im not a youtuber, theyre all fucked, all they need is a pic and 5mins of your voice. Id suggest every youtuber to wear submarine googles all the time and use a voice changer, regardless how shitty the voice changer might be, otherwise if someone really hates them, theyll do pretty bad stuff, even if this never gets published, theres tons of similiar tools.
@nuralif1
@nuralif1 Ай бұрын
@stephanieellison7834 I might just go ahead and use AI to summarize your reply😭
@superpig5000
@superpig5000 29 күн бұрын
​@stephanieellison7834 You're worried about AI taking jobs when stuff like neurolink's coming out? Be more concerned that everyone's just gonna become a robot. It'll gradually happen and we don't even realize it. There will be a sentient A I it will be us, once we start putting these ai chips in babies. Brain development won't develop. We will be like a robot. We won't think we will just do
@agnosticatheist4093
@agnosticatheist4093 Ай бұрын
What a time to be alive!
@theAIsearch
@theAIsearch Ай бұрын
😃
@anotherhurayra2024
@anotherhurayra2024 Ай бұрын
This is crazy as this is just a alpha version of this technology.. I think the fact that the audio is so fluid and the video emotion is also really good.
@joshua1846
@joshua1846 Ай бұрын
Good to know that people can make anything about anyone, this is concerning... it feels like a violation of persona...
@choppergirl
@choppergirl Ай бұрын
The whole audience started laughing when you said a 4090 was a GPU anyone could get....
@greyeyed123
@greyeyed123 Ай бұрын
What possible moral or positive application could this have?
@MarioRodriguez-gv9km
@MarioRodriguez-gv9km 27 күн бұрын
Making your own movies with a few text prompts instead of being stuck with Hollywood? Yeah I can think of a million reasons actually that are good, useful and moral.
@greyeyed123
@greyeyed123 27 күн бұрын
@@MarioRodriguez-gv9km Making anyone say anything for anyone. I'm sure this will end well.
@Atsolok
@Atsolok 18 күн бұрын
Facial expression is still kinda dreamy but looks good already. Kinda reminds me of the “Good Doctor” except with more facial expression
@BAAPUBhendi-dv4ho
@BAAPUBhendi-dv4ho Ай бұрын
The longer i watch the video, the more scared i become
@ssekagratius2danime369
@ssekagratius2danime369 Ай бұрын
im scared
@johnbutler4631
@johnbutler4631 Ай бұрын
It does still look weird, but as someone else has pointed out, you might not notice if you're not looking carefully. As someone else said, it's probably just a matter of time, and that is terrifying. Better not make any enemies.
@coolcool2901
@coolcool2901 Ай бұрын
The Transformer architecture can be scaled up to a certain point, after which the entire structure becomes unstable and crashes due to its architectural limitations. You cannot scale a transformer architecture to 2 Quadrillion parameters. The Transformer architecture cannot be scaled up to the extent where we can achieve Artificial General Intelligence (AGI). We need to redesign the transformer architecture and create a new architecture that is more scalable and flexible in its design in order to achieve AGI.
@acctest7039
@acctest7039 Ай бұрын
Ai 3d waifu when
@BAAPUBhendi-dv4ho
@BAAPUBhendi-dv4ho Ай бұрын
I can't wait any longer
@garethjohnstone9282
@garethjohnstone9282 Ай бұрын
5 years - acting will be a thing of the past. Movies - tell AI to create anything you're in the mood for. Hell, with VR you can be IN an interactive movie.
@marv_9
@marv_9 Ай бұрын
Yes, I dream of being a voice actor. With AI, finally no more different voices when a human voice actor quits and is replaced by a bad voice. Stan on American Dad has a new voice in Germany since season 18, it's terrible. Tina from Bob's Burgers has the worst voice I've heard since season 12. In English, Rick has a different voice in Rick and Morty because the narrator and creator was fired. With AI, I hope that cartoons and animated series like this will keep the same voices forever. 🎉🎉
@garethjohnstone9282
@garethjohnstone9282 Ай бұрын
@@marv_9 You can't conceive of anything better?
@superpig5000
@superpig5000 29 күн бұрын
Oh yeah, that will happen. The only concern is we will lose What makes movies interesting, And that's talking to other people about it. Because we're only just gonna be generating stuff that interests us that nobody else will watch. Will kind of take the fun out eventually, Or could just be the end of socialization between humans.
@KelelaSB
@KelelaSB 15 күн бұрын
5:24 “... hard to tell the difference.” For me the dead giveaway is the teeth. The bottom ones don't follow the jaw movement precisely, and the top ones are sometimes stretchy.
@heartshinemusic
@heartshinemusic Ай бұрын
Looks impressive, still think the EMO lip-sync (singing) thing was a bit better. What I don't understand is why both platforms use square aspect ratios to showcase their A.I. lip-sync avatars.
@azhuransmx126
@azhuransmx126 Ай бұрын
Why don't they (Alibaba and Microsoft) just release this amazing technology to the public just to animate semi-realistic avatars from comics, video games and anime instead of trying to animate photos of hyper-realistic people, which apart from being a dangerous technology in those cases it is also a lot creepier???? Cannot they see they are losing the middle of the business closing it in a 100%??🤦‍♂️
@goldiegolderman1842
@goldiegolderman1842 Ай бұрын
*THE INTELLIGENCE AGENCIES AND THE MILITARY HAVE HAD THIS FOR DECADES*
@davidgilpin5200
@davidgilpin5200 Ай бұрын
This will be a boon for gaming, especially cut scenes. But I see legal problems on the horizon, especially with likenesses pulled from art that are suddenly - and terrifyingly - talking.
@labsquadmedia176
@labsquadmedia176 24 күн бұрын
It's interesting at around 13:45 that the website's text-happy to trumpet its unique and trend-setting features over competitors-uses the phrase "like other related content generation techniques" to hide in the group when the fruit of its technique could be seen as harmful. I wonder too, why the copy uses "technique" over the word "technology" in the caveat? Is a technique harder to indict than a technology?
@The_Spooky_Boi
@The_Spooky_Boi Ай бұрын
Oh god, we're cooked Im even more scared to show my face on the Internet
@theAIsearch
@theAIsearch Ай бұрын
that's why this channel is faceless 😉
@The_Spooky_Boi
@The_Spooky_Boi Ай бұрын
@@theAIsearch fr
@LunarTikOfficial
@LunarTikOfficial Ай бұрын
*Yup.. Society is doomed..*
@HarvickOne
@HarvickOne Ай бұрын
this is gonna be so good if used for games
@AI_Image_Master
@AI_Image_Master Ай бұрын
You can debate all the ethics of this, but one thing that I find interesting is that all of these technologies can be used to eventually bring back an actor from the past that is long gone and have them act in a movie. We are almost getting to the point we this can be done in a realistic way. I find the possibilities of that interesting.
@duncanwierman
@duncanwierman Ай бұрын
It looks fake
@deepdiver849
@deepdiver849 Ай бұрын
Don’t worry it will improve
@MikeSeuss
@MikeSeuss Ай бұрын
Yes it does, however for being able to create this from only one still image is pretty incredible. Also, this just just the very beginning. Give it time and we’ll all be blown away.
@TP-yy3zx
@TP-yy3zx Ай бұрын
Uncle Ted was right, I'm moving into the woods.
@krybtix_3647
@krybtix_3647 Ай бұрын
hey man, you content is absolutely amazing but could you try making the videos a bit shorter in length (
@bamboozooka-yk7qn
@bamboozooka-yk7qn Ай бұрын
so when ai faces tell you to mask up, do you?
@blackwolfthedragonmaster
@blackwolfthedragonmaster 28 күн бұрын
Are these actually easier to lip read correctly? They look like it but I don't lip read.
@JustWasted3HoursHere
@JustWasted3HoursHere Ай бұрын
People are going to have to start wearing their own personal dash-cams at all times to record their daily experiences - encrypted in a cloud blockchain in realtime - so that if someone makes one of these videos implicating them in a poor light or a crime etc, they have proof that it wasn't them.
@historio8077
@historio8077 Ай бұрын
I wander for what purposes will it be used for.
@tandumm
@tandumm Ай бұрын
Wow! Forget the mouth on this. The eyes and eye brows are what is uncanny. The way it very very convincingly mimics the appropriate emotion to match the audio is, well, terrifying tbh
@HyperHrishiHD
@HyperHrishiHD Ай бұрын
This is just barely uncanny. Scared for the future 😮
@clqudy4750
@clqudy4750 Ай бұрын
Awesome! One step closer to not needing people anymore! Hallelujah!
@BeckVMH
@BeckVMH Ай бұрын
Obviously, this technology is in its infancy. It’s mind boggling at this point it’s so realistic. Our minds can’t even imagine the types of abuse that are possible. And we also know there is a segment of society that would not hesitate to use this tool to take advantage of others or even alter world events.
@4saken404
@4saken404 Ай бұрын
I'm so glad they actually had the sense to hit the brakes on this. This technology is advancing way faster than we can even consider the consequences of it. Chat GPT for example hasn't even been out two years yet it's _already_ disrupted society on several levels.
@InspiredByBrad
@InspiredByBrad Ай бұрын
This clearly shows the clue that all of existence itself is a type of advanced simulation, and therefore, there is no need to worry or take it too seriously!
@seanys
@seanys Ай бұрын
The sizes of a person’s head generally doesn’t change size while they’re speaking. Also, their hair doesn’t visibly grow and shrink. Not bad, though.
@arielconti3371
@arielconti3371 29 күн бұрын
When the head tilts, the direction of the eyes do not change. That is unnatural and a good way to tell if the video is fake. Also, there are no eye darts. If there are no eye darts, it’s probably fake.
@blackwolfthedragonmaster
@blackwolfthedragonmaster 28 күн бұрын
They sure could have used this in all the scenes in Madame Web where the re-recorded voice lines don't sync to the faces
@guillermozalles9303
@guillermozalles9303 23 күн бұрын
What would the practical aplication of this be?
@mackendw
@mackendw Ай бұрын
oh goodie...this should be the death knell for the borg when someone uses it to cause chaos. bring it on.
@D35TR0YM4N
@D35TR0YM4N Ай бұрын
The shifting and stretching teeth are unsettling.
@lethimwhoboasts
@lethimwhoboasts Ай бұрын
Impressive. Although if you look close, there's a lot of "rubber teeth".. Especially the lower row.
@rw9207
@rw9207 Ай бұрын
If it is a subscription based cloud service. The users details and a record of their work can be kept. So, if it's used for nefarious purposes, a record of evidence is maintained.
Free AI Audio Tools You Won't Believe Exist
17:22
Mike Russell
Рет қаралды 449 М.
INSANE OpenAI News: GPT-4o and your own AI partner
28:48
AI Search
Рет қаралды 771 М.
ТАМАЕВ vs ВЕНГАЛБИ. Самая Быстрая BMW M5 vs CLS 63
1:15:39
Асхаб Тамаев
Рет қаралды 3,8 МЛН
Balloon Stepping Challenge: Barry Policeman Vs  Herobrine and His Friends
00:28
I Challenged My AI Clone to Replace Me for 24 Hours | WSJ
7:34
The Wall Street Journal
Рет қаралды 1,3 МЛН
WHAT TO KNOW ABOUT VASA neural network for photo animation, overview of features
3:40
💬 SpeechGen - Realistic Text-to-Speech
Рет қаралды 1,9 М.
The BEST AI Video Generator you can use NOW!
28:00
AI Search
Рет қаралды 27 М.
AI Generated Videos Just Changed Forever
12:02
Marques Brownlee
Рет қаралды 8 МЛН
GPT-4o is WAY More Powerful than Open AI is Telling us...
28:18
MattVidPro AI
Рет қаралды 251 М.
Create Cinematic AI Videos for Free | Haiper AI Video Tutorial
15:35
Curious Refuge
Рет қаралды 222 М.
AI Just Changed Everything … Again
18:28
Undecided with Matt Ferrell
Рет қаралды 393 М.
Bardak ile Projektör Nasıl Yapılır?
0:19
Safak Novruz
Рет қаралды 6 МЛН
Iphone or nokia
0:15
rishton vines😇
Рет қаралды 1,7 МЛН
WWDC 2024 - June 10 | Apple
1:43:37
Apple
Рет қаралды 10 МЛН
Apple watch hidden camera
0:34
_vector_
Рет қаралды 61 МЛН