## Performance comparison of gradient descent (GD) and stochastic gradient descent (SGD) method on MNIST dataset

Does SGD perform better than GD? Read out more to see its effect on MNIST dataset

