Date:
Friday, 24 March, 2023 - 15:00
Speakers: Alfio Borzi, University of Wuezburg
Hosted at: SISSA, International School of Advanced Studies, Trieste, Italy, room 137.
Abstract:
Neural networks are machine learning models for constructing universal function approximation algorithms.
The learning of these networks requires to iteratively solve large nonlinear optimization problems, and gradient methods
play a central role in this framework, which motivates the investigation of techniques for accelerating and robustifying these methods.
On the other hand, for some neural network architectures, the learning process can be interpreted as an optimal control problem
and, also in this case, efficient and robust solution techniques are required.
On the other hand, for some neural network architectures, the learning process can be interpreted as an optimal control problem
and, also in this case, efficient and robust solution techniques are required.
Category:
