in Education by
I know the basics of feedforward neural networks, and how to train them using the backpropagation algorithm, but I'm looking for an algorithm that I can use for training an ANN online with reinforcement learning. For example, the cart-pole swing up problem is one I'd like to solve with an ANN. In that case, I don't know what should be done to control the pendulum, I only know how close I am to the ideal position. I need to have the ANN learn based on reward and punishment. Thus, supervised learning isn't an option. Another situation is something like the snake game, where feedback is delayed, and limited to goals and anti-goals, rather than reward. I can think of some algorithms for the first situation, like hill-climbing or genetic algorithms, but I'm guessing they would both be slow. They might also be applicable in the second scenario, but incredibly slow, and not conducive to online learning. My question is simple: Is there a simple algorithm for training an artificial neural network with reinforcement learning? I'm mainly interested in real-time reward situations, but if an algorithm for goal-based situations is available, even better. Select the correct answer from above options

1 Answer

0 votes
by
 
Best answer
Reinforcement learning means goal-oriented algorithms, which learn how to attain a complex objective (goal) or maximize along a particular dimension over many steps; for example, maximize the points won in a game over many moves. Some research papers on the topic: Efficient Reinforcement Learning Through Evolving Neural Network Topologies (2002) Reinforcement Learning Using Neural Networks, with Applications to Motor Control Reinforcement Learning Neural Network To The Problem Of Autonomous Mobile Robot Obstacle Avoidance And some code: Code examples for neural network reinforcement learning. You can check out more papers on Google Scholar. The above papers are the top search results on google. Hope this answer helps.

Related questions

0 votes
    What is the difference between back-propagation and feed-forward neural networks? By googling and reading, I found ... feed-forward? Select the correct answer from above options...
asked Jan 31, 2022 in Education by JackTerrance
0 votes
    I'm making a basic feedforward neural network to solve the XOR gate problem. Standard settings: input layer + hidden layer + ... outputData_samples : Array = Array() for i in 0.....
asked Jan 31, 2022 in Education by JackTerrance
0 votes
    I read a few books and articles about Convolutional neural network, it seems I understand the concept but I don ... please help thanks. Select the correct answer from above options...
asked Jan 30, 2022 in Education by JackTerrance
0 votes
    It is a principal question, regarding the theory of neural networks: Why do we have to normalize the input for ... is not normalized? Select the correct answer from above options...
asked Jan 27, 2022 in Education by JackTerrance
0 votes
    Is there any method to find out the number of layers and the number of neurons per layer? As input I solely have ... I can't try this. Select the correct answer from above options...
asked Jan 22, 2022 in Education by JackTerrance
0 votes
    I have learned a Machine Learning course using Matlab as a prototyping tool. Since I got addicted to F#, I ... of resources? Thanks. Select the correct answer from above options...
asked Jan 30, 2022 in Education by JackTerrance
0 votes
    I was wondering if you creative minds out there could think of some situations or applications in the web environment ... AI in games. Select the correct answer from above options...
asked Jan 26, 2022 in Education by JackTerrance
0 votes
    I'm learning the difference between the various machine learning algorithms. I understand that the implementations of ... for that? Select the correct answer from above options...
asked Jan 25, 2022 in Education by JackTerrance
0 votes
    I saw a few examples of a neural network but they work for a fixed set of inputs. How can I deal with ... property of neural networks? Select the correct answer from above options...
asked Jan 22, 2022 in Education by JackTerrance
0 votes
    I know about the Gradient Descent & Back-propagation Theorem. What I didn't get is: When and how to use ... gives the correct weights. Select the correct answer from above options...
asked Jan 22, 2022 in Education by JackTerrance
0 votes
    I'm trying to write a program that takes text(article) as input and outputs the polarity of this text, ... open-source implementation. Select the correct answer from above options...
asked Feb 4, 2022 in Education by JackTerrance
0 votes
    Here's a puzzle... I have two databases of the same 50000+ electronic products and I want to match products ... I tackle this problem? Select the correct answer from above options...
asked Jan 29, 2022 in Education by JackTerrance
0 votes
    I just started with machine learning. I want to know about the applications of machine learning. I know we ... recent applications. Select the correct answer from above options...
asked Jan 26, 2022 in Education by JackTerrance
0 votes
    I'm looking for some examples of robot/AI programming using Lisp. Are there any good online examples available ... in nature)? Select the correct answer from above options...
asked Feb 4, 2022 in Education by JackTerrance
0 votes
    I'm teaching a kid programming, and am introducing some basic artificial intelligence concepts at the moment. To begin ... and boxes)? Select the correct answer from above options...
asked Feb 4, 2022 in Education by JackTerrance
...