Search-based diverse sampling from real-world software product lines

Yi Xiang, Han Huang, Yuren Zhou, Sizhe Li, Chuan Luo, Qingwei Lin, Miqing Li, Xiaowei Yang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

607 Downloads (Pure)

Abstract

Real-world software product lines (SPLs) often encompass enormous valid configurations that are impossible to enumerate. To understand properties of the space formed by all valid configurations, a feasible way is to select a small, valid and representative sample set. Even though a number of sampling strategies have been proposed, they either fail to produce diverse samples with respect to the number of selected features (an important property to characterize behaviors of configurations), or achieve diverse sampling but with limited scalability (the handleable configuration space size is limited to 1013). To resolve this dilemma, we propose a scalable diverse sampling strategy, which uses a distance metric in combination with the novelty search algorithm to produce diverse samples in an incremental way. The distance metric is carefully designed to measure similarities between configurations, and further diversity of a sample set. The novelty search incrementally improves diversity of samples through the search for novel configurations. We evaluate our sampling algorithm on 39 real-world SPLs. It is able to generate the required number of samples for all the SPLs, including those which can not be counted by sharpSAT, a state-of-the-art model counting solver. Moreover, it performs better than or at least competitively to some state-of-the-art samplers with respect to the diversity of the sample sets. Our results suggest that only the proposed sampler (among all tested ones) achieves scalable diverse sampling.
Original languageEnglish
Title of host publication2022 IEEE/ACM 44th International Conference on Software Engineering (ICSE)
PublisherIEEE
Pages1945-1957
Number of pages13
DOIs
Publication statusPublished - 20 Jun 2022
Event44th IEEE/ACM International Conference on Software Engineering: ICSE 2022 - Pittsburgh, United States
Duration: 8 May 202227 May 2022

Publication series

NameInternational Conference on Software Engineering. Proceedings
PublisherIEEE
ISSN (Print)0270-5257
ISSN (Electronic)1558-1225

Conference

Conference44th IEEE/ACM International Conference on Software Engineering
Country/TerritoryUnited States
CityPittsburgh
Period8/05/2227/05/22

Keywords

  • Software product lines
  • distance metric
  • diverse sampling
  • novelty search

ASJC Scopus subject areas

  • Software

Fingerprint

Dive into the research topics of 'Search-based diverse sampling from real-world software product lines'. Together they form a unique fingerprint.

Cite this