PAC learning with approximate predictors

Andrew Turner; Ata Kaban

doi:10.1007/s10994-023-06301-4

PAC learning with approximate predictors

Andrew Turner, Ata Kaban^*

^*Corresponding author for this work

Computer Science

Research output: Contribution to journal › Article › peer-review

12 Downloads (Pure)

Abstract

Approximate learning machines have become popular in the era of small devices, including quantised, factorised, hashed, or otherwise compressed predictors, and the quest to explain and guarantee good generalisation abilities for such methods has just begun. In this paper, we study the role of approximability in learning, both in the full precision and the approximated settings. We do this through a notion of sensitivity of predictors to the action of the approximation operator at hand. We prove upper bounds on the generalisation of such predictors, yielding the following main findings, for any PAC-learnable class and any given approximation operator: 1) We show that under mild conditions, approximable target concepts are learnable from a smaller labelled sample, provided sufficient unlabelled data; 2) We give algorithms that guarantee a good predictor whose approximation also enjoys the same generalisation guarantees; 3) We highlight natural examples of structure in the class of sensitivities, which reduce, and possibly even eliminate the otherwise abundant requirement of additional unlabelled data, and henceforth shed new light onto what makes one problem instance easier to learn than another. These results embed the scope of modern model-compression approaches into the general goal of statistical learning theory, which in return suggests appropriate algorithms through minimising uniform bounds.

Original language	English
Number of pages	40
Journal	Machine Learning
Early online date	8 Feb 2023
DOIs	https://doi.org/10.1007/s10994-023-06301-4
Publication status	E-pub ahead of print - 8 Feb 2023

Keywords

statistical learning
generalisation error bounds
model-compression
approximate learning algorithms

ASJC Scopus subject areas

Computer Science(all)
Artificial Intelligence

Access to Document

10.1007/s10994-023-06301-4Licence: Creative Commons: Attribution (CC BY)

TurnerA2023PAC-learningFinal published version, 2.35 MBLicence: Creative Commons: Attribution (CC BY)

FORGING: Fortuitous Geometries and Compressive Learning
Kaban, A.
Engineering & Physical Science Research Council
9/01/17 → 8/01/23
Project: Research Councils

Cite this

@article{87cabdaecf4e4cdc8cd75f65eaee0ef4,

title = "PAC learning with approximate predictors",

abstract = "Approximate learning machines have become popular in the era of small devices, including quantised, factorised, hashed, or otherwise compressed predictors, and the quest to explain and guarantee good generalisation abilities for such methods has just begun. In this paper, we study the role of approximability in learning, both in the full precision and the approximated settings. We do this through a notion of sensitivity of predictors to the action of the approximation operator at hand. We prove upper bounds on the generalisation of such predictors, yielding the following main findings, for any PAC-learnable class and any given approximation operator: 1) We show that under mild conditions, approximable target concepts are learnable from a smaller labelled sample, provided sufficient unlabelled data; 2) We give algorithms that guarantee a good predictor whose approximation also enjoys the same generalisation guarantees; 3) We highlight natural examples of structure in the class of sensitivities, which reduce, and possibly even eliminate the otherwise abundant requirement of additional unlabelled data, and henceforth shed new light onto what makes one problem instance easier to learn than another. These results embed the scope of modern model-compression approaches into the general goal of statistical learning theory, which in return suggests appropriate algorithms through minimising uniform bounds.",

keywords = "statistical learning, generalisation error bounds, model-compression, approximate learning algorithms",

author = "Andrew Turner and Ata Kaban",

year = "2023",

month = feb,

day = "8",

doi = "10.1007/s10994-023-06301-4",

language = "English",

journal = "Machine Learning",

issn = "0885-6125",

publisher = "Springer",

}

TY - JOUR

T1 - PAC learning with approximate predictors

AU - Turner, Andrew

AU - Kaban, Ata

PY - 2023/2/8

Y1 - 2023/2/8

N2 - Approximate learning machines have become popular in the era of small devices, including quantised, factorised, hashed, or otherwise compressed predictors, and the quest to explain and guarantee good generalisation abilities for such methods has just begun. In this paper, we study the role of approximability in learning, both in the full precision and the approximated settings. We do this through a notion of sensitivity of predictors to the action of the approximation operator at hand. We prove upper bounds on the generalisation of such predictors, yielding the following main findings, for any PAC-learnable class and any given approximation operator: 1) We show that under mild conditions, approximable target concepts are learnable from a smaller labelled sample, provided sufficient unlabelled data; 2) We give algorithms that guarantee a good predictor whose approximation also enjoys the same generalisation guarantees; 3) We highlight natural examples of structure in the class of sensitivities, which reduce, and possibly even eliminate the otherwise abundant requirement of additional unlabelled data, and henceforth shed new light onto what makes one problem instance easier to learn than another. These results embed the scope of modern model-compression approaches into the general goal of statistical learning theory, which in return suggests appropriate algorithms through minimising uniform bounds.

AB - Approximate learning machines have become popular in the era of small devices, including quantised, factorised, hashed, or otherwise compressed predictors, and the quest to explain and guarantee good generalisation abilities for such methods has just begun. In this paper, we study the role of approximability in learning, both in the full precision and the approximated settings. We do this through a notion of sensitivity of predictors to the action of the approximation operator at hand. We prove upper bounds on the generalisation of such predictors, yielding the following main findings, for any PAC-learnable class and any given approximation operator: 1) We show that under mild conditions, approximable target concepts are learnable from a smaller labelled sample, provided sufficient unlabelled data; 2) We give algorithms that guarantee a good predictor whose approximation also enjoys the same generalisation guarantees; 3) We highlight natural examples of structure in the class of sensitivities, which reduce, and possibly even eliminate the otherwise abundant requirement of additional unlabelled data, and henceforth shed new light onto what makes one problem instance easier to learn than another. These results embed the scope of modern model-compression approaches into the general goal of statistical learning theory, which in return suggests appropriate algorithms through minimising uniform bounds.

KW - statistical learning

KW - generalisation error bounds

KW - model-compression

KW - approximate learning algorithms

U2 - 10.1007/s10994-023-06301-4

DO - 10.1007/s10994-023-06301-4

M3 - Article

SN - 0885-6125

JO - Machine Learning

JF - Machine Learning

ER -

PAC learning with approximate predictors

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Fingerprint

Projects

FORGING: Fortuitous Geometries and Compressive Learning

Cite this