AI ML DL

Boximator examples
AI ML DL

A Text-To-Video Method That Actually Generates Some Action

Though there is currently a tidal wave of research into text-to-video systems, they tend to be criticized (if at all) because the videos are very short and/or very small. However, the biggest constriction on the current state of the art in T2V is that these videos have very little actual real movement. Now, a new system from ByteDance offers a novel box-based system capable of creating footage with significant motion, taking us a little nearer the goal of genuinely photorealistic pure neural video synthesis.

Read More »
Examples of DiffBody in action
AI ML DL

Reshaping the Human Body With CGI and Stable Diffusion

Driven by the interests of the fashion industry, new methods are emerging to automatically reshape and re-pose pictures of real people. It’s a difficult task, and yet another neural procedure that’s being aided by old-school CGI. A new method from Japan now claims to have obtained the state-of-the-art in neural body-shaping, based on just one picture of a person.

Read More »
AI ML DL

An Old Algorithm May Be Holding back New Generative AI Systems

Modern generative AI systems such as Stable Diffusion may be being held back by an out-dated but really popular loss algorithm called Fréchet Inception Distance (FID). Even though FID was only invented in 2019, it did not anticipate the growth of datasets or the advent of latent diffusion systems. New research from Google takes a critical look at FID and suggests a leaner and more modern alternative.

Read More »
AI ML DL

What’s The Difference between CGI and AI?

Generative AI promises, apparently, to replace traditional CGI techniques in Hollywood, if you believe the current punditry on the topic. In truth, for some time to come, it’s more likely that the two technologies will work in tandem to create a new standard in realism for visual effects workflows.

Read More »
AI ML DL

Solving the ‘Profile View’ Crisis in Facial Image Synthesis

Facial synthesis and deepfake systems are not very good at creating profile views of people, because there isn’t much training data of this type. Because of this, many security solutions have been built around profile (side-view) verification. However, a new and exhaustive profile view dataset could be a game-changer not only for VFX practitioners, but also for online scammers.

Read More »
AI ML DL

Using Traditional CGI to Re-Sculpt Generative AI

Editing AI images using only words is a hit-and-miss affair – but a new approach offers a novel take on this challenge, by converting a relevant part of an image into a CGI mesh and allowing the user to manipulate it with decades-old methods, before re-inserting the altered image back into the photo.

Read More »
AI ML DL

Colorization Is an Obstacle to Recreating Actors of the Past

If you want to bring Marilyn Monroe back with AI, you’re going to need to train a model on pictures of her. But you’re probably going to need her to look like she’s in 2023 and not 1960 – so the type of color photos available as references are going to be quite limited. Let’s take a look at the various problems involved in colorizing archival data for use in modern facial synthesis systems, and at one new paper that claims to have made a leap forward, with a variation of a secondary system for Stable Diffusion.

Read More »
AI ML DL

Controllable Deepfakes With Gaussian Avatars

Could Gaussian Splatting become the hottest new deepfake technology since 2017? The massive surge of interest from the research sector suggests it might – and the latest innovation not only brings full controllability to neural or deepfaked faces, but also lets you become someone else at an unprecedented level of photorealism and efficiency.

Read More »
AI ML DL

Badly-Compressed Images Affect CLIP’s Performance, New Research Contends

CLIP is the new darling of the computer vision research, and of image-based generative AI, with wide uptake of the image/text analysis framework across the sector. However, new research indicates that CLIP’s efficiency and usefulness is negatively affected by badly-compressed images. Though this should not be a problem in the modern high-speed broadband age, it is – because so much essential data and methodologies still in use data back several decades.

Read More »