Tighter guarantees for the compressive multi-layer perceptron

Ata Kaban; Yamonporn Thummanusarn

doi:10.1007/978-3-030-04070-3_30

Tighter guarantees for the compressive multi-layer perceptron

Ata Kaban, Yamonporn Thummanusarn

Computer Science

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

1 Citation (Scopus)

269 Downloads (Pure)

Abstract

We are interested in theoretical guarantees for classic 2- layer feed-forward neural networks with sigmoidal activation functions, having inputs linearly compressed by random projection. Due to the speedy increase of the dimensionality of modern data sets, and the development of novel data acquisition devices in compressed sensing, a proper understanding of are the guarantees obtainable is of much practical importance. We start by analysing previous work that attempted to derive a lower bound on the target dimension to ensure low distortion of the outputs under random projection, and we find a disagreement with empirically observed behaviour. We then give a new lower bound on the target dimension that, in contrast with previous work, does not depend on the number of hidden neurons, but only depends on the Frobenius norm of the first layer weights, and in addition it holds for a much larger class of random projections. Numerical experiments agree with our finding. Furthermore, we are able to bound the generalisation error of the compressive network in terms of the error and the expected distortion of the optimal network in the original uncompressed class. These results mean that one can provably learn networks with arbitrarily large number of hidden units from randomly compressed data, as long as there is sufficient regularity in the original learning problem, which our analysis rigorously quantifies.

Original language	English
Title of host publication	Theory and Practice of Natural Computing
Subtitle of host publication	7th International Conference, TPNC 2018 Dublin, Ireland, December 12–14, 2018 Proceedings
Editors	David Fagan, Carlos Martín-Vide, Michael O’Neill, Miguel A. Vega-Rodríguez
Publisher	Springer
Pages	388-400
Number of pages	13
ISBN (Electronic)	978-3-030-04070-3
ISBN (Print)	978-3-030-04069-7
DOIs	https://doi.org/10.1007/978-3-030-04070-3_30
Publication status	E-pub ahead of print - 22 Nov 2018
Event	7th International Conference on the Theory and Practice of Natural Computing (TPNC 2018) - Dublin, Ireland Duration: 12 Dec 2018 → 14 Dec 2018

Publication series

Name	Lecture Notes in Computer Science
Publisher	Springer
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Conference

Conference	7th International Conference on the Theory and Practice of Natural Computing (TPNC 2018)
Country/Territory	Ireland
City	Dublin
Period	12/12/18 → 14/12/18

Keywords

Error analysis
Random projection
Multi-layer perceptron

Access to Document

10.1007/978-3-030-04070-3_30Licence: None: All rights reserved

Ata_Kaban_&_Yamonporn_Thummanusarn_Tighter_guarantees_for_the_compressive_multi_layer_perceptron_Proceedings_of_7th_International_Conference_on_the_Theory_and_Practice_of_Natural_Computing_2018
Checked for eligibility: 25/09/2018 The final authenticated version is available online at https://doi.org/10.1007/978-3-030-04070-3_30
Accepted author manuscript, 367 KBLicence: Other (please specify with Rights Statement)

Cite this

Kaban, A., & Thummanusarn, Y. (2018). Tighter guarantees for the compressive multi-layer perceptron. In D. Fagan, C. Martín-Vide, M. O’Neill, & M. A. Vega-Rodríguez (Eds.), Theory and Practice of Natural Computing: 7th International Conference, TPNC 2018 Dublin, Ireland, December 12–14, 2018 Proceedings (pp. 388-400). (Lecture Notes in Computer Science). Springer. Advance online publication. https://doi.org/10.1007/978-3-030-04070-3_30

Kaban, Ata ; Thummanusarn, Yamonporn. / Tighter guarantees for the compressive multi-layer perceptron. Theory and Practice of Natural Computing: 7th International Conference, TPNC 2018 Dublin, Ireland, December 12–14, 2018 Proceedings. editor / David Fagan ; Carlos Martín-Vide ; Michael O’Neill ; Miguel A. Vega-Rodríguez. Springer, 2018. pp. 388-400 (Lecture Notes in Computer Science).

@inproceedings{175ae922ca0b48b0897f5a14ab02f7ed,

title = "Tighter guarantees for the compressive multi-layer perceptron",

abstract = "We are interested in theoretical guarantees for classic 2- layer feed-forward neural networks with sigmoidal activation functions, having inputs linearly compressed by random projection. Due to the speedy increase of the dimensionality of modern data sets, and the development of novel data acquisition devices in compressed sensing, a proper understanding of are the guarantees obtainable is of much practical importance. We start by analysing previous work that attempted to derive a lower bound on the target dimension to ensure low distortion of the outputs under random projection, and we find a disagreement with empirically observed behaviour. We then give a new lower bound on the target dimension that, in contrast with previous work, does not depend on the number of hidden neurons, but only depends on the Frobenius norm of the first layer weights, and in addition it holds for a much larger class of random projections. Numerical experiments agree with our finding. Furthermore, we are able to bound the generalisation error of the compressive network in terms of the error and the expected distortion of the optimal network in the original uncompressed class. These results mean that one can provably learn networks with arbitrarily large number of hidden units from randomly compressed data, as long as there is sufficient regularity in the original learning problem, which our analysis rigorously quantifies.",

keywords = "Error analysis, Random projection, Multi-layer perceptron",

author = "Ata Kaban and Yamonporn Thummanusarn",

year = "2018",

month = nov,

day = "22",

doi = "10.1007/978-3-030-04070-3_30",

language = "English",

isbn = "978-3-030-04069-7",

series = "Lecture Notes in Computer Science",

publisher = "Springer",

pages = "388--400",

editor = "Fagan, {David } and { Mart{\'i}n-Vide}, Carlos and O{\textquoteright}Neill, {Michael } and { A. Vega-Rodr{\'i}guez}, Miguel",

booktitle = "Theory and Practice of Natural Computing",

note = "7th International Conference on the Theory and Practice of Natural Computing (TPNC 2018) ; Conference date: 12-12-2018 Through 14-12-2018",

}

Kaban, A & Thummanusarn, Y 2018, Tighter guarantees for the compressive multi-layer perceptron. in D Fagan, C Martín-Vide, M O’Neill & M A. Vega-Rodríguez (eds), Theory and Practice of Natural Computing: 7th International Conference, TPNC 2018 Dublin, Ireland, December 12–14, 2018 Proceedings. Lecture Notes in Computer Science, Springer, pp. 388-400, 7th International Conference on the Theory and Practice of Natural Computing (TPNC 2018), Dublin, Ireland, 12/12/18. https://doi.org/10.1007/978-3-030-04070-3_30

Tighter guarantees for the compressive multi-layer perceptron. / Kaban, Ata; Thummanusarn, Yamonporn.
Theory and Practice of Natural Computing: 7th International Conference, TPNC 2018 Dublin, Ireland, December 12–14, 2018 Proceedings. ed. / David Fagan; Carlos Martín-Vide; Michael O’Neill; Miguel A. Vega-Rodríguez. Springer, 2018. p. 388-400 (Lecture Notes in Computer Science).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

TY - GEN

T1 - Tighter guarantees for the compressive multi-layer perceptron

AU - Kaban, Ata

AU - Thummanusarn, Yamonporn

PY - 2018/11/22

Y1 - 2018/11/22

N2 - We are interested in theoretical guarantees for classic 2- layer feed-forward neural networks with sigmoidal activation functions, having inputs linearly compressed by random projection. Due to the speedy increase of the dimensionality of modern data sets, and the development of novel data acquisition devices in compressed sensing, a proper understanding of are the guarantees obtainable is of much practical importance. We start by analysing previous work that attempted to derive a lower bound on the target dimension to ensure low distortion of the outputs under random projection, and we find a disagreement with empirically observed behaviour. We then give a new lower bound on the target dimension that, in contrast with previous work, does not depend on the number of hidden neurons, but only depends on the Frobenius norm of the first layer weights, and in addition it holds for a much larger class of random projections. Numerical experiments agree with our finding. Furthermore, we are able to bound the generalisation error of the compressive network in terms of the error and the expected distortion of the optimal network in the original uncompressed class. These results mean that one can provably learn networks with arbitrarily large number of hidden units from randomly compressed data, as long as there is sufficient regularity in the original learning problem, which our analysis rigorously quantifies.

AB - We are interested in theoretical guarantees for classic 2- layer feed-forward neural networks with sigmoidal activation functions, having inputs linearly compressed by random projection. Due to the speedy increase of the dimensionality of modern data sets, and the development of novel data acquisition devices in compressed sensing, a proper understanding of are the guarantees obtainable is of much practical importance. We start by analysing previous work that attempted to derive a lower bound on the target dimension to ensure low distortion of the outputs under random projection, and we find a disagreement with empirically observed behaviour. We then give a new lower bound on the target dimension that, in contrast with previous work, does not depend on the number of hidden neurons, but only depends on the Frobenius norm of the first layer weights, and in addition it holds for a much larger class of random projections. Numerical experiments agree with our finding. Furthermore, we are able to bound the generalisation error of the compressive network in terms of the error and the expected distortion of the optimal network in the original uncompressed class. These results mean that one can provably learn networks with arbitrarily large number of hidden units from randomly compressed data, as long as there is sufficient regularity in the original learning problem, which our analysis rigorously quantifies.

KW - Error analysis

KW - Random projection

KW - Multi-layer perceptron

U2 - 10.1007/978-3-030-04070-3_30

DO - 10.1007/978-3-030-04070-3_30

M3 - Conference contribution

SN - 978-3-030-04069-7

T3 - Lecture Notes in Computer Science

SP - 388

EP - 400

BT - Theory and Practice of Natural Computing

A2 - Fagan, David

A2 - Martín-Vide, Carlos

A2 - O’Neill, Michael

A2 - A. Vega-Rodríguez, Miguel

PB - Springer

T2 - 7th International Conference on the Theory and Practice of Natural Computing (TPNC 2018)

Y2 - 12 December 2018 through 14 December 2018

ER -

Kaban A, Thummanusarn Y. Tighter guarantees for the compressive multi-layer perceptron. In Fagan D, Martín-Vide C, O’Neill M, A. Vega-Rodríguez M, editors, Theory and Practice of Natural Computing: 7th International Conference, TPNC 2018 Dublin, Ireland, December 12–14, 2018 Proceedings. Springer. 2018. p. 388-400. (Lecture Notes in Computer Science). Epub 2018 Nov 22. doi: 10.1007/978-3-030-04070-3_30

Tighter guarantees for the compressive multi-layer perceptron

Abstract

Publication series

Conference

Keywords

Access to Document

Fingerprint

Cite this