NEW BOT Телеграм, страница

Deep Gravity

A Complete Guide To #Math And #Statistics For #DataScience

Link

Deep (Learning) Gravity, [02.01.20 09:39]
Asymmetric #GAN for Unpaired Image-to-image Translation

Unpaired image-to-image translation problem aims to model the mapping from one domain to another with unpaired training data. Current works like the well-acknowledged Cycle GAN provide a general solution for any two domains through modeling injective mappings with a symmetric structure. While in situations where two domains are asymmetric in complexity, i.e., the amount of information between two domains is different, these approaches pose problems of poor generation quality, mapping ambiguity, and model sensitivity. To address these issues, we propose Asymmetric GAN (AsymGAN) to adapt the asymmetric domains by introducing an auxiliary variable (aux) to learn the extra information for transferring from the information-poor domain to the information-rich domain, which improves the performance of state-of-the-art approaches in the following ways. First, aux better balances the information between two domains which benefits the quality of generation. Second, the imbalance of information commonly leads to mapping ambiguity, where we are able to model one-to-many mappings by tuning aux, and furthermore, our aux is controllable. Third, the training of Cycle GAN can easily make the generator pair sensitive to small disturbances and variations while our model decouples the ill-conditioned relevance of generators by injecting aux during training. We verify the effectiveness of our proposed method both qualitatively and quantitatively on asymmetric situation, label-photo task, on Cityscapes and Helen datasets, and show many applications of asymmetric image translations. In conclusion, our AsymGAN provides a better solution for unpaired image-to-image translation in asymmetric domains.

Paper

🔭 @DeepGravity

DZone

A Complete Guide To Math And Statistics For Data Science

In this article, we provide a comprehensive guide for individuals looking to get started with data science.

71 views08:40

Deep Gravity

The intuitions behind #Bayesian Optimization with Gaussian Processes

Link

🔭 @DeepGravity

www.mindfoundry.ai

The Intuitions Behind Bayesian Optimization

Why become a Bayesian Optimization expert? If you're a citizen data scientist it's important to understand the science behind machine learning automations.

55 views08:42

Deep Gravity

The Promise of Hierarchical #ReinforcementLearning

Article

🔭 @DeepGravity

The Gradient

The Promise of Hierarchical Reinforcement Learning

This idea of temporal abstraction, once incorporated into reinforcement learning (RL), converts it into *hierarchical* reinforcement learning (HRL).

52 views08:42

Deep Gravity

A Year in Review: AI in 2019

Link

🔭 @DeepGravity

A Year in Review: AI in 2019

2019 witnessed a surge in Algorithms, Data, Investments, Research papers, and much more. It was a fast-paced year with advances in research and development of new methods, services and frameworks.

54 views08:44

Deep Gravity

How to create #Tensorflow 2 Sequence Dataset from scratch

Link

🔭 @DeepGravity

Medium

How to create Tensorflow 2 Sequence Dataset from scratch

How to create your own Dataset for Tensorflow model training by extending Sequence class?

59 views08:45

Deep Gravity

The Best #MachineLearning #Startups To Work For In 2020 Based On Glassdoor

Link

🔭 @DeepGravity

Forbes

The Best Machine Learning Startups To Work For In 2020 Based On Glassdoor

According to Indeed, Machine Learning Engineer job openings grew 344% between 2015 to 2018 and have an average base salary of $146,085

58 views08:46

Deep Gravity

4 #NumPy Tricks Every #Python Beginner should Learn

Link

🔭 @DeepGravity

Medium

4 NumPy Tricks Every Python Beginner should Learn

Tricks for writing readable codes

54 views08:47

Deep Gravity

Off-Policy Estimation of Long-Term Average Outcomes with Applications to Mobile Health

With the recent advancements in wearables and sensing technology, health scientists are increasingly developing mobile health (mHealth) interventions. In mHealth interventions, mobile devices are used to deliver treatment to individuals as they go about their daily lives, generally designed to impact a near time, proximal outcome such as stress or physical activity. The mHealth intervention policies, often called Just-In-time Adaptive Interventions, are decision rules that map a user's context to a particular treatment at each of many time points. The vast majority of current mHealth interventions deploy expert-derived policies. In this paper, we provide an approach for conducting inference about the performance of one or more such policies. In particular, we estimate the performance of a mHealth policy using historical data that are collected under a possibly different policy. Our measure of performance is the average of proximal outcomes (rewards) over a long time period should the particular mHealth policy be followed. We provide a semi-parametric efficient estimator as well as the confidence intervals. This work is motivated by HeartSteps, a mobile health physical activity intervention.

Paper

🔭 @DeepGravity

65 viewsedited 09:02

Deep Gravity

A New Framework for Query Efficient Active #ImitationLearning

We seek to align agent policy with human expert behavior in a #reinforcementlearning (#RL) setting, without any prior knowledge about dynamics, reward function, and unsafe states. There is a human expert knowing the rewards and unsafe states based on his preference and objective, but querying that human expert is expensive. To address this challenge, we propose a new framework for imitation learning (IL) algorithm that actively and interactively learns a model of the user's reward function with efficient queries. We build an adversarial generative model of states and a successor feature (SR) model trained over transition experience collected by learning policy. Our method uses these models to select state-action pairs, asking the user to comment on the optimality or safety, and trains a adversarial neural network to predict the rewards. Different from previous papers, which are almost all based on uncertainty sampling, the key idea is to actively and efficiently select state-action pairs from both on-policy and off-policy experience, by discriminating the queried (expert) and unqueried (generated) data and maximizing the efficiency of value function learning. We call this method adversarial reward query with successor representation. We evaluate the proposed method with simulated human on a state-based 2D navigation task, robotic control tasks and the image-based video games, which have high-dimensional observation and complex state dynamics. The results show that the proposed method significantly outperforms uncertainty-based methods on learning reward models, achieving better query efficiency, where the adversarial discriminator can make the agent learn human behavior more efficiently and the SR can select states which have stronger impact on value function. Moreover, the proposed method can also learn to avoid unsafe states when training the reward model.

Paper

🔭 @DeepGravity

61 views09:04

Deep Gravity

#MachineLearning from a Continuous Viewpoint

We present a continuous formulation of machine learning, as a problem in the calculus of variations and differential-integral equations, very much in the spirit of classical numerical analysis and statistical physics. We demonstrate that conventional machine learning models and algorithms, such as the random feature model, the shallow neural network model and the residual neural network model, can all be recovered as particular discretizations of different continuous formulations. We also present examples of new models, such as the flow-based random feature model, and new algorithms, such as the smoothed particle method and spectral method, that arise naturally from this continuous formulation. We discuss how the issues of generalization error and implicit regularization can be studied under this framework.

Paper

🔭 @DeepGravity

67 views09:05

Deep Gravity

Using #ReinforcementLearning to Design a Better Rocket Engine

Article

🔭 @DeepGravity

Medium

Using Reinforcement Learning to Design a Better Rocket Engine

Applying ML techniques to the manufacturing industry and the crucial role ML Product Managers play.

73 views09:10

Deep Gravity

World Programs for Model-Based Learning and Planning in Compositional State and Action Spaces

Some of the most important tasks take place in environments which lack cheap and perfect simulators, thus hampering the application of model-free #reinforcementlearning (#RL). While model-based RL aims to learn a dynamics model, in a more general case the learner does not know a priori what the action space is. Here we propose a formalism where the learner induces a world program by learning a dynamics model and the actions in graph-based compositional environments by observing state-state transition examples. Then, the learner can perform RL with the world program as the simulator for complex planning tasks. We highlight a recent application, and propose a challenge for the community to assess world program-based planning.

Paper

🔭 @DeepGravity

79 views09:12

Deep Gravity

What is “The Art of Thinking Like a #DataScientist” Workbook and Why It Matters

Article

🔭 @DeepGravity

Datasciencecentral

What is “The Art of Thinking Like a Data Scientist” Workbook and Why It Matters

For over 3 decades, my passion has been to assist organizations leverage the business potential of data and analytics and help them envision where and how data…

90 viewsedited 09:12

Deep Gravity

Can #ReinforcementLearning Break Through in 2020

Article

🔭 @DeepGravity

Datasciencecentral

Can Reinforcement Learning Break Through in 2020

Summary: Reinforcement Learning (RL) is going to be critical to achieving our AI/ML technology goals but it has several barriers to overcome. While reliabili…

143 views09:13

Deep Gravity

#AI

#Stephen_Hawking

🔭 @DeepGravity

207 views09:15

Deep Gravity

#Gartner Top 10 Strategic Technology Trends for 2020

YouTube

Article

🔭 @DeepGravity

YouTube

Gartner Top 10 Strategic Technology Trends for 2020

Blockchain, AI security, distributed cloud, hyperautomation, multiexperience, human augmentation democratization, traceability and transparency, the empowere...

154 viewsedited 10:58

Deep Gravity

#ArtificialIntelligence in 2020

This video recaps developments in #AI from 2019!

YouTube

🔭 @DeepGravity

YouTube

Artificial Intelligence in 2020

This video recaps developments in AI from 2019! Happy New Year! Thanks for watching! Please Subscribe! Links from Video Below: The Lottery Ticket Hypothesis:...

126 viewsedited 11:11

Deep Gravity

Top minds in #MachineLearning predict where #AI is going in 2020

Link

🔭 @DeepGravity

VentureBeat

Top minds in machine learning predict where AI is going in 2020

Google AI's Jeff Dean, PyTorch's Soumith Chintala, Nvidia ML's Anima Anandkumar, Kidd Lab's Celeste Kidd, and IBM Research's Dario Gil on the future of AI.

102 views11:11

Deep Gravity

Long-Term Visitation Value for Deep Exploration in Sparse Reward #ReinforcementLearning

Reinforcement learning with sparse rewards is still an open challenge. Classic methods rely on getting feedback via extrinsic rewards to train the agent, and in situations where this occurs very rarely the agent learns slowly or cannot learn at all. Similarly, if the agent receives also rewards that create suboptimal modes of the objective function, it will likely prematurely stop exploring. More recent methods add auxiliary intrinsic rewards to encourage exploration. However, auxiliary rewards lead to a non-stationary target for the Q-function. In this paper, we present a novel approach that (1) plans exploration actions far into the future by using a long-term visitation count, and (2) decouples exploration and exploitation by learning a separate function assessing the exploration value of the actions. Contrary to existing methods which use models of reward and dynamics, our approach is off-policy and model-free. We further propose new tabular environments for benchmarking exploration in reinforcement learning. Empirical results on classic and novel benchmarks show that the proposed approach outperforms existing methods in environments with sparse rewards, especially in the presence of rewards that create suboptimal modes of the objective function. Results also suggest that our approach scales gracefully with the size of the environment. Source code is available at https://github.com/sparisi/visit-value-explore

Paper

🔭 @DeepGravity

GitHub

GitHub - sparisi/visit-value-explore

Contribute to sparisi/visit-value-explore development by creating an account on GitHub.

107 views11:12

Deep Gravity

Dive into Deep Learning

An interactive #DeepLearning #book with code, math, and discussions, based on the #NumPy interface.

Book

🔭 @DeepGravity

117 views11:14

Deep Gravity

#NeuralNetworks 201: All About #Autoencoders

Link

🔭 @DeepGravity

KDnuggets

Neural Networks 201: All About Autoencoders - KDnuggets

Autoencoders can be a very powerful tool for leveraging unlabeled data to solve a variety of problems, such as learning a "feature extractor" that helps build powerful classifiers, finding anomalies, or doing a Missing Value Imputation.