I know about the Gradient Descent & Back-propagation Theorem. What I didn't get is: When and how to use Bias?
Ex - When mapping the AND function, when I use 2 inputs and 1 output, it does not give the correct weights, however, when I use 3 inputs (1 of which is a bias), it gives the correct weights.
Select the correct answer from above options