- Learning representations by back-propagating errors
Nature 1986
David Rumelhart, Geoffrey Hinton, Ronald Williams
https://www.iro.umontreal.ca/~vincentp/ift3395/lectures/backprop_old.pdf - Object Recognition with gradient based learning
AT&T Shannon Lab 1999
Yann LeCun, Partick Haffner, Leon Bottou, Yoshua Bengio
http://yann.lecun.com/exdb/publis/pdf/lecun-99.pdf - Generative Adversarial Nets
Departement d’informatique et de recherche operationnelle, Universite de Montreal
Ian J. Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, Yoshua Bengio
https://arxiv.org/pdf/1406.2661.pdf - Image-to-Image Translation with Conditional Adversarial Networks
Phillip Isola Jun-Yan Zhu Tinghui Zhou Alexei A. Efros
Berkeley AI Research (BAIR) Laboratory, UC Berkeley
https://arxiv.org/abs/1611.07004 - Reinforcement Learning: A Survey
Leslie Pack Kaelbling, Michael L. Littman, Andrew W. Moore
Computer Science, Brown University & CMU (1996)
https://arxiv.org/pdf/cs/9605103.pdf - Policy Gradient Methods for Reinforcement Learning with Function Approximation
Richard S. Sutton, David McAllester, Satinder Singh, Yishay Mansour
AT&T Labs - Research, 180 Park Avenue, Florham Park, NJ 07932
https://papers.nips.cc/paper/1713-policy-gradient-methods-for-reinforcement-learning-with-function-approximation.pdf - Playing Atari with Deep Reinforcement Learning
Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, Martin Riedmiller
DeepMind Technologies
http://www0.cs.ucl.ac.uk/staff/d.silver/web/Publications_files/dqn.pdf - Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih, Mehdi Mirza, Adria Badia, David Silver, Alex Graves, Tim Harley, Timothy Lillicrap, Koray Kavukcuoglu
DeepMind Technologies
https://arxiv.org/abs/1602.01783 - High-Dimensional Continuous Control Using Generalized Advantage Estimation
John Schulman, Philipp Moritz, Sergey Levine, Michael I. Jordan and Pieter Abbeel
Department of Electrical Engineering and Computer Science
University of California, Berkeley
https://arxiv.org/pdf/1506.02438.pdf - (this week) Conditional Generative Adversarial Nets for Convolutional Face Generation
Jon Gauthier
Stanford University
http://www.foldl.me/uploads/papers/tr-cgans.pdf