publication venue for Regularizing transformers with deep probabilistic layers. 161. 2023 Towards a mathematical framework to inform neural network modelling via polynomial regression. 142:57-72. 2021