NEW BOT Телеграм, страница

Local Policy Optimization for Trajectory-Centric Reinforcement Learning

The goal of this paper is to present a method for simultaneous trajectory and local stabilizing policy optimization to generate local policies for trajectory-centric model-based reinforcement learning (MBRL). This is motivated by the fact that global policy optimization for non-linear systems could be a very challenging problem both algorithmically and numerically. However, a lot of robotic manipulation tasks are trajectory-centric, and thus do not require a global model or policy. Due to inaccuracies in the learned model estimates, an open-loop trajectory optimization process mostly results in very poor performance when used on the real system. Motivated by these problems, we try to formulate the problem of trajectory optimization and local policy synthesis as a single optimization problem. It is then solved simultaneously as an instance of nonlinear programming. We provide some results for analysis as well as achieved performance of the proposed technique under some simplifying assumptions.

Paper

🔭 @DeepGravity

81 views20:14

Deep Gravity

Fast is better than free: Revisiting adversarial training

Adversarial training, a method for learning robust deep networks, is typically assumed to be more expensive than traditional training due to the necessity of constructing adversarial examples via a first-order method like projected gradient decent (PGD). In this paper, we make the surprising discovery that it is possible to train empirically robust models using a much weaker and cheaper adversary, an approach that was previously believed to be ineffective, rendering the method no more costly than standard training in practice. Specifically, we show that adversarial training with the fast gradient sign method (FGSM), when combined with random initialization, is as effective as PGD-based training but has significantly lower cost. Furthermore we show that FGSM adversarial training can be further accelerated by using standard techniques for efficient training of deep networks, allowing us to learn a robust CIFAR10 classifier with 45% robust accuracy to PGD attacks with ϵ=8/255 in 6 minutes, and a robust ImageNet classifier with 43% robust accuracy at ϵ=2/255 in 12 hours, in comparison to past work based on "free" adversarial training which took 10 and 50 hours to reach the same respective thresholds. Finally, we identify a failure mode referred to as "catastrophic overfitting" which may have caused previous attempts to use FGSM adversarial training to fail. All code for reproducing the experiments in this paper as well as pretrained model weights are at this https URL.

Paper

🔭 @DeepGravity

80 views20:14

Deep Gravity

#lime

This project is about explaining what machine learning classifiers (or models) are doing. At the moment, we support explaining individual predictions for text classifiers or classifiers that act on tables (numpy arrays of numerical or categorical data) or images, with a package called lime (short for local interpretable model-agnostic explanations). Lime is based on the work presented in this paper (bibtex here for citation). Here is a link to the promo video:

Repo

🔭 @DeepGravity

GitHub

GitHub - marcotcr/lime: Lime: Explaining the predictions of any machine learning classifier

Lime: Explaining the predictions of any machine learning classifier - marcotcr/lime

84 views20:15

Deep Gravity

How a Kaggle Grandmaster cheated in $25,000 AI contest with hidden code – and was fired from dream SV job

Link

🔭 @DeepGravity

www.theregister.co.uk

How a Kaggle Grandmaster cheated in $25,000 AI contest with hidden code – and was fired from dream SV job

Pet adoption ML coder apologizes and says desire to be ranked #1 'compromised my judgement'

82 views20:16

Deep Gravity

Microsoft open sources breakthrough optimizations for transformer inference on GPU and CPU

Link

🔭 @DeepGravity

Microsoft Open Source Blog

Microsoft open sources breakthrough optimizations for transformer inference on GPU and CPU

Microsoft has open sourced enhanced versions of transformer inference optimizations into the ONNX Runtime and extended them to work on both GPU and CPU.

77 views20:17

Deep Gravity

#Google just created the most detailed image of a #brain yet

Link 1

Link 2

🔭 @DeepGravity

livescience.com

Google just created the most detailed image of a brain yet

. The mesmerizing threads of blue, yellow, purple and green represent thousands of brain cells and millions of connections found inside the brain of a fruit fly.

88 viewsedited 20:18

Deep Gravity

What is Gradient Accumulation in Deep Learning?

Link

🔭 @DeepGravity

Medium

What is Gradient Accumulation in Deep Learning?

Backpropagation process of neural networks explained

94 views20:19

Deep Gravity

How to Control the Stability of Training Neural Networks With the Batch Size

Link

@DeepGravity

Machine Learning Mastery

How to Control the Stability of Training Neural Networks With the Batch Size

Neural networks are trained using gradient descent where the estimate of the error used to update the weights is calculated based on a subset of the training dataset. The number of examples from the training dataset used in the estimate of the error gradient…

103 views20:20

Deep Gravity

ML impossible: Train 1 billion samples in 5 minutes on your laptop using Vaex and Scikit-Learn

Link

@DeepGravity

Medium

ML impossible: Train 1 billion samples in 5 minutes on your laptop using Vaex and Scikit-Learn

Make your laptop feel like a supercomputer.

460 views20:21

Deep Gravity

Stop using for loops, here are other cool options

Link

🔭 @DeepGravity

Medium

Stop using for loops, here are other cool options

Are you someone who always resort to “for” loop and “forEach” loop to iterate over items in an array, then this is for you.

458 views20:21

Deep Gravity

Fully hardware-implemented memristor convolutional neural network

Abstract
Memristor-enabled neuromorphic computing systems provide a fast and energy-efficient approach to training neural networks1,2,3,4. However, convolutional neural networks (CNNs)—one of the most important models for image recognition5—have not yet been fully hardware-implemented using memristor crossbars, which are cross-point arrays with a memristor device at each intersection. Moreover, achieving software-comparable results is highly challenging owing to the poor yield, large variation and other non-ideal characteristics of devices6,7,8,9. Here we report the fabrication of high-yield, high-performance and uniform memristor crossbar arrays for the implementation of CNNs, which integrate eight 2,048-cell memristor arrays to improve parallel-computing efficiency. In addition, we propose an effective hybrid-training method to adapt to device imperfections and improve the overall system performance. We built a five-layer memristor-based CNN to perform MNIST10 image recognition, and achieved a high accuracy of more than 96 per cent. In addition to parallel convolutions using different kernels with shared inputs, replication of multiple identical kernels in memristor arrays was demonstrated for processing different inputs in parallel. The memristor-based CNN neuromorphic system has an energy efficiency more than two orders of magnitude greater than that of state-of-the-art graphics-processing units, and is shown to be scalable to larger networks, such as residual neural networks. Our results are expected to enable a viable memristor-based non-von Neumann hardware solution for deep neural networks and edge computing.

Paper

🔭 @DeepGravity

Nature

Fully hardware-implemented memristor convolutional neural network

Nature - A fully hardware-based memristor convolutional neural network using a hybrid training method achieves an energy efficiency more than two orders of magnitude greater than that of...

164 views18:29

Deep Gravity

DeepMind’s MEMO AI solves novel reasoning tasks with less compute

Link

🔭 @DeepGravity

VentureBeat

DeepMind’s MEMO AI solves novel reasoning tasks with less compute

DeepMind's new architecture -- MEMO -- solves novel reasoning tasks with less computation than several baseline AI models.

131 views18:31

Deep Gravity

How to properly ship and deploy your machine learning model

Link

🔭 @DeepGravity