An excellent overview of the different gradient descent algorithms, and a nice example of content that is available as both a responsive website and a PDF on arXiv: https://ruder.io/optimizing-gradient-descent/