NEW BOT Телеграм, страница

Deep Gravity

Multi-Task #ReinforcementLearning without
Interference

While deep reinforcement learning systems have demonstrated impressive results in domains ranging from game playing and robotic control, sample efficiency remains a major challenge, particularly as these algorithms learn individual tasks from scratch. Multi-task and goal-conditioned reinforcement learning have emerged as promising approaches for sharing structure across multiple tasks to enable more efficient learning. However, challenges in optimization have hamstrung such methods from realizing efficiency gains compared to learning tasks independently from scratch. Motivated by these challenges, we develop a general approach that can change the multi-task optimization landscape to alleviate conflicting gradients across tasks. In particular, we introduce two instantiations of this approach, one architectural and one algorithmic, that prevent gradients for different tasks from interfering with one another. On two challenging multi-task RL problems, we find that our approaches leads to greater final performance and learning efficiency in comparison to prior approaches.

Paper

🔭 @DeepGravity

100 views20:02

Deep Gravity

Meta-gradient updates for training return functions for #ReinforcementLearning systems,

Abstract
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for reinforcement learning. The embodiments described herein apply meta-learning (and in particular, meta-gradient reinforcement learning) to learn an optimum return function G so that the training of the system is improved. This provides a more effective and efficient means of training a reinforcement learning system as the system is able to converge on an optimum set of one or more policy parameters θ more quickly by training the return function G as it goes. In particular, the return function G is made dependent on the one or more policy parameters θ and a meta-objective function J′ is used that is differentiated with respect to the one or more return parameters η to improve the training of the return function G.

#Google
#DeepMind

Paper

🔭 @DeepGravity

Google

Meta-gradient updates for training return functions for reinforcement learning systems

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for reinforcement learning. The embodiments described herein apply meta-learning (and in particular, meta-gradient reinforcement learning) to learn an optimum…

103 views20:11

Deep Gravity

#DeepMind ’s Dreamer #AI learns from the past to predict the future

Some AI systems achieve goals in challenging environments by drawing on representations of the world informed by past experiences. They generalize these to novel situations, enabling them to complete tasks even in settings they haven’t encountered before. As it turns out, reinforcement learning — a training technique that employs rewards to drive software policies toward goals — is particularly well-suited to learning world models that summarize an agent’s experience, and by extension to facilitating the learning of novel behaviors.

Article

🔭 @DeepGravity

VentureBeat

DeepMind’s Dreamer AI learns from the past to predict the future

In a new preprint research paper, researchers at DeepMind and Google propose Dreamer, an algorithm that learns to predict outcomes from experience.

101 views22:17

Deep Gravity

#Fun

🔭 @DeepGravity

720 views22:27

Deep Gravity

Improved Few-Shot Visual Classification, by Peyman Bateni,

Few-shot learning is a fundamental task in computer vision that carries the promise of alleviating the need for exhaustively labeled data. Most few-shot learning approaches to date have focused on progressively more complex neural feature extractors and classifier adaptation strategies, as well as the refinement of the task definition itself. In this paper, we explore the hypothesis that a simple class-covariance-based distance metric, namely the Mahalanobis distance, adopted into a state of the art few-shot learning approach (CNAPS) can, in and of itself, lead to a significant performance improvement. We also discover that it is possible to learn adaptive feature extractors that allow useful estimation of the high dimensional feature covariances required by this metric from surprisingly few samples. The result of our work is a new "Simple CNAPS" architecture which has up to 9.2 trainable parameters than CNAPS and performs up to 6.1 the art on the standard few-shot image classification benchmark dataset.

Paper

🔭 @DeepGravity

96 views22:30

Deep Gravity

A very interesting paper by #Harvard University and #OpenAI #DeepDoubleDescent: WHERE BIGGER MODELS AND MORE DATA HURT ABSTRACT We show that a variety of modern deep learning tasks exhibit a “double-descent” phenomenon where, as we increase model size,…

A short explanation to this paper

#DeepDoubleDescent

YouTube

🔭 @DeepGravity

YouTube

Deep Double Descent

This video explores a new study on double descent evident in Deep Learning models such as CNNs, ResNets and Transformers. The double descent phenomenon is an...

87 views00:14

Deep Gravity

Various #datascience roles

🔭 @DeepGravity

86 views00:19

Deep Gravity

Animation: Visualizing #Moore ’s Law in Action (1971-2019)

Link

🔭 @DeepGravity

Visual Capitalist

Visualizing Moore’s Law in Action (1971-2019)

Can the predictions from Moore's Law keep up with technological innovation spanning almost 50 years? Watch this stunning animation to find out.

94 views00:41

Deep Gravity

🔭 @DeepGravity

101 views01:06

Deep Gravity

Meta-World: A Benchmark and Evaluation for Multi-Task and Meta #ReinforcementLearning

Abstract: #Meta-reinforcement learning algorithms can enable robots to acquire new skills much more quickly, by leveraging prior experience to learn how to learn. However, much of the current research on meta-reinforcement learning focuses on task distributions that are very narrow. For example, a commonly used meta-reinforcement learning benchmark uses different running velocities for a simulated robot as different tasks. When policies are meta-trained on such narrow task distributions, they cannot possibly generalize to more quickly acquire entirely new tasks. Therefore, if the aim of these methods is to enable faster acquisition of entirely new behaviors, we must evaluate them on task distributions that are sufficiently broad to enable generalization to new behaviors. In this paper, we propose an open-source simulated benchmark for meta-reinforcement learning and multi-task learning consisting of 50 distinct robotic manipulation tasks. Our aim is to make it possible to develop algorithms that generalize to accelerate the acquisition of entirely new, held-out tasks. We evaluate 6 state-of-the-art metareinforcement learning and multi-task learning algorithms on these tasks. Surprisingly, while each task and its variations (e.g., with different object positions) can be learned with reasonable success, these algorithms struggle to learn with multiple tasks at the same time, even with as few as ten distinct training tasks. Our analysis and open-source environments pave the way for future research in multi-task learning and meta-learning that can enable meaningful generalization, thereby unlocking the full potential of these methods.

Paper

🔭 @DeepGravity

129 views01:08

Deep Gravity

Yoshua #Bengio, Revered Architect of #AI, Has Some Ideas About What to Build Next

Link

🔭 @DeepGravity

IEEE Spectrum: Technology, Engineering, and Science News

Yoshua Bengio, Revered Architect of AI, Has Some Ideas About What to Build Next

The Turing Award winner wants AI systems that can reason, plan, and imagine

141 views17:20

Deep Gravity

#Quantum Supremacy Using a Programmable Superconducting Processor

Link

🔭 @DeepGravity

research.google

Quantum Supremacy Using a Programmable Superconducting Processor

Posted by John Martinis, Chief Scientist Quantum Hardware and Sergio Boixo, Chief Scientist Quantum Computing Theory, Google AI Quantum Physicist...

154 views17:21

Deep Gravity

120 #AI Predictions For 2020

Link

🔭 @DeepGravity

136 views17:25

Deep Gravity

New #TensorFlow courses by #Coursera

#Course

Link

🔭 @DeepGravity

Coursera

TensorFlow: Data and Deployment | Coursera

Learn TensorFlow: Data and Deployment from ...

147 views18:23

Deep Gravity

Tune #Hyperparameters for Classification #MachineLearning Algorithms

The seven classification algorithms we will look at are as follows:

Logistic Regression
Ridge Classifier
K-Nearest Neighbors (KNN)
Support Vector Machine (SVM)
Bagged Decision Trees (Bagging)
Random Forest
Stochastic Gradient Boosting

Article

🔭 @DeepGravity

556 views18:52

Deep Gravity

Code Faster in #Python with Intelligent Snippets

#Kite is a plugin for your IDE that uses machine learning to give you useful code completions for Python. Start coding faster today.

Kite

🔭 @DeepGravity

Code Faster with Kite

Kite is saying farewell

From 2014 to 2021, Kite was a startup using AI to help developers write code. We have stopped working on Kite, and are no longer supporting the Kite software. Thank you to everyone who used our product, and thank you to our team members and investors who…

472 views21:23

Deep Gravity

The Fundamentals of #Tensor Networks

Link

🔭 @DeepGravity

Tensors.net

Intro | Tensors.net

A breif introduction to tensor networks and their applications.

169 views20:50

Deep Gravity

#SelfDrivingCar Steering Angle Prediction Based on Image Recognition

Self-driving vehicles have expanded dramatically over the last few years. Udacity has release a dataset containing, among other data, a set of images with the steering angle captured during driving. The Udacity challenge aimed to predict steering angle based on only the provided images. We explore two different models to perform high quality prediction of steering angles based on images using different deep learning techniques including Transfer Learning, 3D CNN, #LSTM and ResNet. If the Udacity challenge was still ongoing, both of our models would have placed in the top ten of all entries.

Paper

🔭 @DeepGravity

155 views20:53

Deep Gravity

#Speech2Face: Learning the Face Behind a Voice

How much can we infer about a person's looks from the way they speak? In this paper, we study the task of reconstructing a facial image of a person from a short audio recording of that person speaking. We design and train a deep neural network to perform this task using millions of natural videos of people speaking from Internet/Youtube. During training, our model learns audiovisual, voice-face correlations that allow it to produce images that capture various physical attributes of the speakers such as age, gender and ethnicity. This is done in a self-supervised manner, by utilizing the natural co-occurrence of faces and speech in Internet videos, without the need to model attributes explicitly. Our reconstructions, obtained directly from audio, reveal the correlations between faces and voices. We evaluate and numerically quantify how--and in what manner--our Speech2Face reconstructions from audio resemble the true face images of the speakers.

Paper

🔭 @DeepGravity

120 views20:55

Deep Gravity

Learning human objectives by evaluating hypothetical behaviours

TL;DR: We present a method for training #ReinforcementLearning agents from human feedback in the presence of unknown unsafe states.

#DeepMind

Link

🔭 @DeepGravity

Deepmind

Learning human objectives by evaluating hypothetical behaviours

We present a new method for training reinforcement learning agents from human feedback in the presence of unknown unsafe states.

112 views20:56

Deep Gravity

At #OpenAI, we’ve used the multiplayer video game #Dota 2 as a research platform for general-purpose AI systems. Our Dota 2 #AI, called OpenAI Five, learned by playing over 10,000 years of games against itself. It demonstrated the ability to achieve expert-level performance, learn human–AI cooperation, and operate at internet scale.

Link

🔭 @DeepGravity

382 views20:56

About

Blog

Apps

Platform