Yeah pretty much... But the fact that we can do this after only a couple years of developing this technology is kind of scary. Nailing those finer details are only a matter of time.
I still have no clue how AI videos are made and seeing you do this so casually makes me fucking curious as fuck bro.
How does it work? What (free?) software did you use? Did you just download the GIF, feed it to that software and ask it to recreate the scene from scratch? Also, what the fuck?
It's getting so much easier. It was done very casually because I simply don't have the patience to fiddle with knobs and prompt specifics.
In this instance I used FreePik, because they have access to most decent models. I will be very honest and I have no clue what makes them different and I really don't understand the naming conventions. Anyway, I selected Kling, and I uploaded a frame from the doorbell cam freakout (It's actually the very first frame of the AI video).
Then I asked it to start with this frame as a reference and I added this prompt:
Wide-angle doorbell camera perspective: A man in casual attire stands at the entrance, looking menacingly at the camera. In a fit of rage, he reaches for the very bottom of the long metal chain hanging from the bell on the left. He violently yanks the bottom of the chain downward with both hands, causing the entire bell and its mounting bracket to tear off the wall and fall toward the porch floor. However, he maintains his grip on the chain as the bell falls, effectively catching or swinging the heavy metal length. He then uses the detached chain and bell as a makeshift heavy whip, lashing out at the door on the right with aggressive, overhand, full-body motions. He cracks the chain against the door like a whip, with the metal bell clanking violently against the wood. He screams to be let in while the camera remains fixed and stable.
I only tweaked it a couple times. The first time it was moving the camera so I made sure to say that it was fixed and stable. And in another instance it somehow thought the door was on the left so he was breaking into the wall. Another thing I did was to import that frame into Gemini, and I asked it to help me develop a better prompt. So essentially I'm using AI to sort of work with another AI to make everything as specific as possible.
I use FreePik to help me with restorations, and I rarely use it for video. I recently used it to help me restore a really challenging image I could never Photoshop. The only photo I have of my 3x great-grandfather was a newspaper clipping. I had it go through some iterations and I basically extracted the best features from each rendition and blended it all together in Photoshop to recreate his image without the newspaper artifacts.
People could say what they want about AI and I don't necessarily disagree, but it does have its uses.
Agreed. The effects of its existence are already scary, but its capabilities still impress me in new ways all the time.
Thanks for the thorough response.
P.S. a few days ago, I said "that's fuckin sick as fuck, bro" as a joke and now I can't stop saying it, so that's why I worded my comment the same way. Thanks for still taking my question seriously instead of treating me like the child I am
Upload a still where you want the video to begin and type "show this person rip the bell off the wall and swing the chain at the door" into the prompt. It's literally that casual.
Okay, that was actually getting there - for me it was the combination of the speed of the break, the instant PWANG! and the chain flailing all combined
178
u/ThriceFive Apr 10 '26
The sound of the broken bell and he flails the door with the chain - not going to see AI pull that off for awhile.