NEW BOT Телеграм, страница

VOGUE: Try-On by StyleGAN Interpolation Optimization

* project page
* youtube
* paper
* demo

Abstract

Given an image of a target person and an image of another person wearing a garment, we automatically generate the target person in the given garment. At the core of our method is a pose-conditioned StyleGAN2 latent space interpolation, which seamlessly combines the areas of interest from each image, i.e., body shape, hair, and skin color are derived from the target person, while the garment with its folds, material properties, and shape comes from the garment image. By automatically optimizing for interpolation coefficients in the latent space and per layer, we can perform a seamless, yet true to source, merging of the garment and target person. Our algorithm allows for garments to deform according to the given body shape, while preserving pattern and material details. Experiments demonstrate state-of-the-art photo-realistic results at high resolution (512x512).

#gan

159 views11:58

echoinside

https://twitter.com/advadnoun/status/1347807444190199808
DALL-E не релизнута, но вот что выходит на gradient ascent с помощью CLIP.

67 views12:36

echoinside

https://arxiv.org/pdf/2010.05334.pdf #gan

FreezeG
* github

Похоже на идею вот отсюда попыткой отделить генерацию формы от рендера, но здесь оно намного менее явно выражено. (Найдено у Саввы)

Inspired by the training footage of FreezeD

 trasfer learning. This is a pseudo translation method because the input image should be projected to the learned latent space first, and then the projected vector is propagated again to generate the target image. Therefore, the performance is limited to the in-domain images of the original GAN. I used

StyleGAN2 implementation

, and below are some of the results I've got. By also fixing the latent vector of the early layers and manipulating the ones that are fed into the last layers, the rendering style can be controlled separately.

#gan

60 views12:58

echoinside

Немного политоты, но все же. Неужели дойдем до стадии, когда телега останется последним оплотом свободы слова и все такое?

62 views13:03

echoinside

Forwarded from Sci-Hub (Александра Элбакян)

Знаете ли вы?

Вчера в Твиттере забанили не только аккаунт Трампа, но и аккаунт Sci-Hub. Причем наш аккаунт забанили раньше, даже появились статьи, где люди возмущались в комментариях, что полезный Сайхаб забанили, а Трампа годами забанить не могут. Через несколько часов снесли аккаунт Трампа.

Формальная причина бана Sci-Hub в Твиттере - нарушение копирайта. Хотя 9 лет это никого не волновало. На момент блокировки было 183 тысячи подписчика, а твиты набирали тысячи репостов и комментариев, причем 90% в поддержку Sci-Hub. И вот все нажитое непосильным трудом пропало! Возможно, скрытой причиной блокировки аккаунта проекта стали протесты в США и активизация борьбы с русскими шпионами на этом фоне. Как известно, власти США подозревают меня в работе на ГРУ, а в Твиттере проекта Sci-Hub был закреплен большой плакат с Лениным.

60 views13:03

echoinside

Еще эксперименты с CLIP:
https://twitter.com/quasimondo/status/1348194907626856449

Twitter

Mario Klingemann

Searching #StyleGAN2 for "This person looks like Shrek". I also just realized that I can use CLIP to constrain the search area by keeping the similarity close to the starting point of the search. https://t.co/XqmCjw5zIA

62 views23:31

echoinside

https://twitter.com/theshawwn/status/1347779418190536704

Twitter

Shawn Presser

CLIP is remarkably accurate at classifying anime. I made labels for every permutation of: "an anime drawing of someone with {long,short,} {light,dark,} {brown,blonde,red} hair and {light,dark} {blue,green,red} eyes {wearing cat ears, wearing a hat}" etc.…

76 views23:34

echoinside

For Age estimation
https://twitter.com/metasemantic/status/1348113145609465856

Twitter

Travis Hoppe

How rude! This bot thinks you're old. New experiment with @OpenAI 's CLIP. Model consistently overestimates both real and human-estimated ages by about 15 years, and is apparently *really* rough for some 20-year-olds (WHY?) 1/4

60 viewsedited 23:36

echoinside

Forwarded from Метаверсище и ИИще

Компьютерное зрение и компьютерный слух.
Отличная идея - скормить нейросетками не только видео, но и всю акустическую "картину", для более точного восстановления модели 3Д-пространства.
Этакая фото-аудио-грамметрия.
Исследователи из Facebook, разработали нейросеть, которая использует визуальные и звуковые эффекты из короткого видеоклипа для восстановления плана целого этажа. Ее можно применять для визуализации пространств, планирования маршрутов и разработки архитектурных проектов. Во время съемки включаются разные неистовые звуки, отражения которых потом фиксируются и учитываются.
https://habr.com/ru/news/t/536534/

60 views11:39

echoinside

random_vox128.gif

11.2 MB

InMoDeGAN: Interpretable Motion Decomposition Generative Adversarial Network for Video Generation

* project page
* github (coming soon, you know..)

In this work, we introduce an unconditional video generative model, InMoDeGAN, targeted to (a) generate high quality videos, as well as to (b) allow for interpretation of the latent space. For the latter, we place emphasis on interpreting and manipulating motion. Towards this, we decompose motion into semantic sub-spaces, which allow for control of generated samples. We design the architecture of InMoDeGAN-generator in accordance to proposed Linear Motion Decomposition, which carries the assumption that motion can be represented by a dictionary, with related vectors forming an orthogonal basis in the latent space. Each vector in the basis represents a semantic sub-space.

100 views20:28

echoinside

https://toonme.com/

69 views10:30

echoinside

Forwarded from Grisha Sotnikov

RepVGG: Making VGG-style ConvNets Great Again

Прикол

On ImageNet, RepVGG reaches over 80% top-1 accuracy, which is the first time for a plain model, to the best of our knowledge

https://arxiv.org/pdf/2101.03697.pdf
https://github.com/DingXiaoH/RepVGG

GitHub

GitHub - DingXiaoH/RepVGG: RepVGG: Making VGG-style ConvNets Great Again

RepVGG: Making VGG-style ConvNets Great Again. Contribute to DingXiaoH/RepVGG development by creating an account on GitHub.

60 views16:36

echoinside

Forwarded from Grisha Sotnikov

56 views16:36

echoinside

Forwarded from Just links

Meta Pseudo Labels
https://twitter.com/quocleix/status/1349443438698143744
https://arxiv.org/abs/2003.10580

Twitter

Quoc Le

Some nice improvement on ImageNet: 90% top-1 accuracy has been achieved :-) This result is possible by using Meta Pseudo Labels, a semi-supervised learning method, to train EfficientNet-L2. More details here: https://t.co/kiZzT4RNj7

61 views20:29

echoinside

Striding Toward the Minimum
(Source: The Batch)

When you’re training a deep learning model, it can take days for an optimization algorithm to minimize the loss function. A new approach could save time.
What’s new: Juntang Zhuang and colleagues at Yale, University of Illinois at Urbana-Champaign, and University of Central Florida proposed AdaBelief, a more efficient variation on the popular Adam optimizer.
Key insight: The popular optimization methods of stochastic gradient descent (SGD) and Adam sometimes take small steps, requiring more time to reach their destination, when they could take larger ones. Given a small learning rate and a point in a large, steep area of a loss function’s landscape, SGD takes small steps until the slope becomes steeper, while Adam’s steps become smaller as it progresses. In both scenarios, an ideal optimizer would predict that the slope is long and take larger steps.
How it works: AdaBelief adjusts its step size depending on the difference between the current gradient and the average of previous gradients.

Like Adam, AdaBelief moves along a function step by step and calculates an exponential moving average of the gradient, assigning exponentially smaller weights to previous gradients. Also like Adam, at each step, a steeper average gradient generally calls for a larger step size.
Unlike Adam, AdaBelief treats the weighted average as a prediction of the gradient at the next step. If the difference between the prediction and the actual gradient is small, the function’s steepness probably isn’t changing much, and AdaBelief takes a relatively larger step. Conversely, if the difference is large, the landscape is changing, and AdaBelief decreases the step size.

Results: The authors provide videos showing that, in experiments on functions with known minimums, AdaBelief was faster than both Adam and SGD with momentum (as shown above). To demonstrate their method’s accuracy, they compared AdaBelief to SGD, Adam, and other adaptive optimizers on tasks including image classification, image generation, and language modeling. AdaBelief basically matched SGD’s accuracy and exceeded that of all other adaptive optimizers. For instance, on ImageNet, AdaBelief increased a ResNet18’s highest top-1 accuracy, or accuracy of its best prediction, to 70.08 percent, on par with SGD’s 70.23 percent and 2 percent better than the best adaptive optimizers.
Why it matters: Faster optimization means faster training, and that means more time to experiment with different models.
We’re thinking: The authors’ video demonstrations suggest that AdaBelief could be a valuable alternative to Adam. However, they don’t supply any numbers that would make for a precise speed comparison. We look forward to the authors of the Deep Learning Optimizer Benchmark Suite, who have evaluated over a dozen optimizers in various tasks, running AdaBelief through its paces.

The Batch | DeepLearning.AI | AI News & Insights

Weekly AI news for engineers, executives, and enthusiasts.

69 viewsedited 10:49

echoinside

This media is not supported in your browser

VIEW IN TELEGRAM

65 views10:50

echoinside

Forwarded from Arxiv

- GAN Inversion: A Survey. (arXiv:2101.05278v1 [cs.CV])
http://arxiv.org/abs/2101.05278

61 views08:35

echoinside

Forwarded from эйай ньюз

AI врывается там, где его меньше всего ждали. Теперь нейросети могут видеть стереограммы. ~~А ты нет.~~

Я даже учил когда-то, что стереограммы работают через восприятие глубины и смещением картинки между двумя глазами, но, давайте согласимся, вероятность того, что это чистая магия — не нулевая.

Статья: https://arxiv.org/pdf/2012.15692.pdf

72 views09:30

About

Blog

Apps

Platform