In 1977, Barto joined the College of Information and Computer Sciences at the University of Massachusetts Amherst as a postdoctoral research associate, was promoted to associate professor in 1982, and full professor in 1991. He was department chair from 2007 to 2011 and a core faculty member of the Neuroscience and Behavior program.[5]
During this time at UMass, Barto co-directed the Autonomous Learning Laboratory (initially the Adaptive Network Laboratory), which generated several key ideas in reinforcement learning.[5]Richard Sutton, with whom he co-authored the influential book Reinforcement Learning: An Introduction (MIT Press 1998; 2nd edition 2018),[5] was his PhD student.
Reinforcement learning
When Barto started at UMass, he joined a group of researchers trying to explore the behavior of neurons in the human brain as the basis for human intelligence, a concept that had been advanced by computer scientist A. Harry Klopf. Barto was joined by his doctoral student Sutton in using mathematics toward furthering the concept and using it as the basis for artificial intelligence. This concept became known as reinforcement learning and went on to becoming a key part of artificial intelligence techniques.[6]
Barto and Sutton used Markov decision processes (MDP) as the mathematical foundation to explain how agents (algorithmic entities) made decisions when in a stochastic or random environment, receiving rewards at the end of every action. Traditional MDP theory assumed the agents knew all information about the MDPs in their attempt toward maximizing their cumulative rewards. Barto and Sutton's reinforcement learning techniques allowed for both the environment and the rewards to be unknown, and thus allowed for these category of algorithms to be applied to a wide array of problems.[7]
Barto built a lab in UMass Amherst toward developing the ideas on reinforcement learning while Sutton returned to Canada. Reinforcement learning as a topic continued to develop in academic circles until one of its first major real world applications saw Google's AlphaGo program built on this concept defeating the then prevailing human champion.[6] Barto and Sutton have widely been credited and accepted as pioneers of modern reinforcement learning, with the technique itself being foundational to the modern AI boom.[8]
Barto published over one hundred papers or chapters in journals, books, and conference and workshop proceedings. He is co-author with Richard Sutton of the book Reinforcement Learning: An Introduction, MIT Press 1998 (2nd edition 2018), and co-editor with Jennie Si, Warren Powell, and Don Wunch II of the Handbook of Learning and Approximate Dynamic Programming, Wiley-IEEE Press, 2004.[9]
Barto was awarded the UMass Neurosciences Lifetime Achievement Award in 2019, the IEEE Neural Network Society Pioneer Award in 2004,[12] and the IJCAI Award for Research Excellence in 2017. His citation for the latter read: "Professor Barto is recognized for his groundbreaking and impactful research in both the theory and application of reinforcement learning."[2]
In 2025, he received the Turing Award from the Association for Computing Machinery together with his former doctoral student Richard S. Sutton for their work on reinforcement learning; the citation of the award read: "For developing the conceptual and algorithmic foundations of reinforcement learning."[6][13]
↑"Virtual History Interview". International Neural Network Society. January 7, 2022. Retrieved September 6, 2022.
123"Andrew G. Barto". University of Massachusetts Amherst. February 17, 2008. Archived from the original on November 28, 2020. Retrieved October 18, 2020.
↑"Barto elected IEEE fellow". University of Massachusetts Amherst. November 22, 2005. Archived from the original on December 3, 2019. Retrieved December 3, 2019.