Papers

ULNeF: Untangled Layered Neural Fields for Mix-and-Match Virtual Try-On Members Public

Recent advances in neural models have shown excellent results for virtual fitting tasks (VTO), where a 3D representation of a garment is deformed to fit the target body shape. However, existing solutions are limited to a single layer of clothing and cannot solve the combinatorial complexity of mixing different types

Dmitry Spodarets
Dmitry Spodarets
Papers

Scaling Instruction-Finetuned Language Models Members Public

An important goal of artificial intelligence is to develop language models capable of generalizing data in the form of instructions to solve complex problems. Finalizing language models on a set of data formulated as instructions improves model performance and generalization to unseen tasks. Google has presented its work to promote

Dmitry Spodarets
Dmitry Spodarets
Papers

DiffusionDB: A Large-scale Prompt Gallery Dataset for Text-to-Image Generative Models Members Public

Diffusion models have recently gained enormous popularity, due to the ability to generate high-quality and controlled images based on textual cues written in natural language. However, generating images with the desired details is challenging, because it requires users to write appropriate cues indicating the exact expected results. Developing such cues

Dmitry Spodarets
Dmitry Spodarets
Papers

eDiffi: Text-to-Image Diffusion Models with Ensemble of Expert Denoisers Members Public

eDiff-I is the next generation of generative AI content creation tool that offers unprecedented text-to-image fusion, instant style transfer, and intuitive word-painting capabilities. This diffusion model for image synthesis from text is based on T5 text inlays, CLIP image inlays, and CLIP text inlays. This approach generates photorealistic images that

Dmitry Spodarets
Dmitry Spodarets
Papers

Musika! Fast Infinite Waveform Music Generation Members Public

Fast, user-controlled music generation opens up new possibilities for composing and performing music. But today's music generation systems require large amounts of data and computing resources for training, and slow output. This makes them impractical for real-time interactive use. Marco Pasini and Jan Schlüter's work, called Musika, is a music

Dmitry Spodarets
Dmitry Spodarets
Papers

YOWO-Plus: An Incremental Improvement Members Public

Spatiotemporal Action Detection (STAD) is a fundamental and important task in video understanding. It aims to detect actions in the current input frame and is widely used, for example, in video surveillance and somatosensory games. Developers are making many changes to the design of YOWO to make it better. For

Dmitry Spodarets
Dmitry Spodarets
Papers

AutoAvatar: Autoregressive Neural Fields for Dynamic Avatar Modeling Members Public

In their new work, AutoAvatar, researchers have made implicit avatar modeling possible for the first time. It is an autoregressive approach for modeling dynamically deforming human bodies directly from raw scans. 0:00/1× Animated 3D models of the human body are a key tool for applications ranging from virtual

Dmitry Spodarets
Dmitry Spodarets
Papers

High Fidelity Neural Audio Compression Members Public

Meta Fundamental AI Research (FAIR) team on audio hypercompression shows how AI can be used to ensure that audio messages don't glitch or slow down when the Internet connection is poor. AI researchers have created a three-part system and trained it to compress audio data to a given size. This

Dmitry Spodarets
Dmitry Spodarets
Papers