A Novel Method for Function Smoothness in Neural Networks

Lindqvist, Blerta

doi:10.1109/ACCESS.2022.3189363

aalto1 untyped-item.component.html

A Novel Method for Function Smoothness in Neural Networks

Files

A_Novel_Method_for_Function_Smoothness_in_Neural_Networks.pdf (5.39 MB)

Access rights

openAccess

publishedVersion

A1 Alkuperäisartikkeli tieteellisessä aikakauslehdessä

This publication is imported from Aalto University research portal.
View publication in the Research portal (opens in new window)
View/Open full text file from the Research portal (opens in new window)

Unless otherwise stated, all rights belong to the author. You may download, display and print this publication for Your own personal use. Commercial use is prohibited.

Authors

Lindqvist, Blerta

Date

2022

Department

Department of Computer Science

Language

en

Pages

11

Series

IEEE Access, Volume 10, pp. 75354-75364

Abstract

Existing methods for function smoothness in neural networks have limitations. These methods can make training sensitive to their hyperparameters, or their smoothness constraints can limit model capacity. These methods can impose too much smoothness, even in areas without data, or they can impose non-meaningful smoothness constraints. The way these methods measure smoothness can also be computationally hard. One of the main methods for function smoothness, Lipschitz continuity, does not even imply differentiability theoretically, let alone continuous differentiability, that is smoothness. In this paper, we propose a method based on the theoretical definition of the derivative to ensure that the derivative of the parametrized function should tend toward its theoretical value for the given neural network parameters in the vicinity of training samples. The method changes the classifier and its training minimally and has no added hyperparameters. The proposed method is shown to achieve a smoother function in the vicinity of both training and testing samples for all tested datasets, as measured with decreased values of the Frobenius norm of the Jacobian with respect to inputs. Due to the correlation between function smoothness and generalization, the method makes classifiers generalize better and achieve higher accuracy than default classifiers for Restricted ImageNet, CIFAR10 and MNIST. Due to the correlation between function smoothness and adversarial robustness, the proposed method makes classifiers with high-capacity architecture more robust to adversarial samples generated with the PGD attack compared to default classifiers for the Restricted ImageNet, CIFAR10, Fashion-MNIST and MNIST datasets.

Description

Keywords

adversarial attack, adversarial robustness, deep learning classifier, Function smoothness, generalization

DOI

10.1109/ACCESS.2022.3189363

Citation

Lindqvist, B 2022, 'A Novel Method for Function Smoothness in Neural Networks', IEEE Access, vol. 10, pp. 75354-75364. https://doi.org/10.1109/ACCESS.2022.3189363

Permanent link to this item

https://urn.fi/URN:NBN:fi:aalto-202209145571

Collections

[article-cris] Perustieteiden korkeakoulu / SCI

Show all metadata

A Novel Method for Function Smoothness in Neural Networks

Files

Access rights

URL

Journal Title

Journal ISSN

Volume Title

Authors

Date

Department

Major/Subject

Mcode

Degree programme

Language

Pages

Series

Abstract

Description

Keywords

Other note

DOI

Citation

Permanent link to this item

Collections

Endorsement

Review

Supplemented By

Referenced By