TIE algorithm: A layer over clustering-based taxonomy generation for handling an evolving data

Rabia Irfan; Sharifullah Khan; Kashif Rajpoot; Ali Mustafa Qamar

doi:10.1631/FITEE.1700517

TIE algorithm: A layer over clustering-based taxonomy generation for handling an evolving data

Rabia Irfan, Sharifullah Khan, Kashif Rajpoot, Ali Mustafa Qamar

Research output: Contribution to journal › Article › peer-review

237 Downloads (Pure)

Abstract

Taxonomy is generated to effectively organize and access data that is large in volume, as taxonomy is a way of representing concepts that exist in data. It needs to be evolved to reflect changes occur continuously in data. Existing automatic taxonomy generation techniques do not handle the evolution of data, therefore their generated taxonomies do not truly represent the data. The evolution of data can be handled either by regenerating taxonomy from scratch, or incrementally evolving taxonomy whenever changes occur in the data. The former approach is not economical subject to time and resources. Taxonomy
incremental evolution (TIE) algorithm, proposed in this paper, is a novel attempt to handle an evolving data. It serves as a layer over an existing clustering-based taxonomy generation technique and incrementally evolves an existing taxonomy. The algorithm was evaluated on scholarly articles selected from computing domain. It was found that the algorithm evolves taxonomy in a
considerably shorter period of time, having better quality per unit time as compared to the taxonomy regenerated from scratch.

Original language	English
Pages (from-to)	763–782
Number of pages	20
Journal	Frontiers of Information Technology and Electronic Engineering
Volume	19
Issue number	6
DOIs	https://doi.org/10.1631/FITEE.1700517
Publication status	Published - Jun 2018

Keywords

Taxonomy
Clustering algorithms
Information science
Knowledge management
Machine learning

Access to Document

10.1631/FITEE.1700517

TIE_algorithm
This is a post-peer-review, pre-copyedit version of an article published in Frontiers of Information Technology & Electronic Engineering. The final authenticated version is available online at: http://dx.doi.org/10.1631/FITEE.1700517
Accepted author manuscript, 332 KBLicence: None: All rights reserved

Cite this

@article{5c44bc040a12474f987a57710babb929,

title = "TIE algorithm: A layer over clustering-based taxonomy generation for handling an evolving data",

abstract = "Taxonomy is generated to effectively organize and access data that is large in volume, as taxonomy is a way of representing concepts that exist in data. It needs to be evolved to reflect changes occur continuously in data. Existing automatic taxonomy generation techniques do not handle the evolution of data, therefore their generated taxonomies do not truly represent the data. The evolution of data can be handled either by regenerating taxonomy from scratch, or incrementally evolving taxonomy whenever changes occur in the data. The former approach is not economical subject to time and resources. Taxonomyincremental evolution (TIE) algorithm, proposed in this paper, is a novel attempt to handle an evolving data. It serves as a layer over an existing clustering-based taxonomy generation technique and incrementally evolves an existing taxonomy. The algorithm was evaluated on scholarly articles selected from computing domain. It was found that the algorithm evolves taxonomy in aconsiderably shorter period of time, having better quality per unit time as compared to the taxonomy regenerated from scratch.",

keywords = "Taxonomy, Clustering algorithms, Information science, Knowledge management, Machine learning",

author = "Rabia Irfan and Sharifullah Khan and Kashif Rajpoot and Qamar, {Ali Mustafa}",

year = "2018",

month = jun,

doi = "10.1631/FITEE.1700517",

language = "English",

volume = "19",

pages = "763–782",

journal = "Frontiers of Information Technology and Electronic Engineering",

issn = "2095-9184",

publisher = "Springer Science + Business Media",

number = "6",

}

TY - JOUR

T1 - TIE algorithm: A layer over clustering-based taxonomy generation for handling an evolving data

AU - Irfan, Rabia

AU - Khan, Sharifullah

AU - Rajpoot, Kashif

AU - Qamar, Ali Mustafa

PY - 2018/6

Y1 - 2018/6

N2 - Taxonomy is generated to effectively organize and access data that is large in volume, as taxonomy is a way of representing concepts that exist in data. It needs to be evolved to reflect changes occur continuously in data. Existing automatic taxonomy generation techniques do not handle the evolution of data, therefore their generated taxonomies do not truly represent the data. The evolution of data can be handled either by regenerating taxonomy from scratch, or incrementally evolving taxonomy whenever changes occur in the data. The former approach is not economical subject to time and resources. Taxonomyincremental evolution (TIE) algorithm, proposed in this paper, is a novel attempt to handle an evolving data. It serves as a layer over an existing clustering-based taxonomy generation technique and incrementally evolves an existing taxonomy. The algorithm was evaluated on scholarly articles selected from computing domain. It was found that the algorithm evolves taxonomy in aconsiderably shorter period of time, having better quality per unit time as compared to the taxonomy regenerated from scratch.

AB - Taxonomy is generated to effectively organize and access data that is large in volume, as taxonomy is a way of representing concepts that exist in data. It needs to be evolved to reflect changes occur continuously in data. Existing automatic taxonomy generation techniques do not handle the evolution of data, therefore their generated taxonomies do not truly represent the data. The evolution of data can be handled either by regenerating taxonomy from scratch, or incrementally evolving taxonomy whenever changes occur in the data. The former approach is not economical subject to time and resources. Taxonomyincremental evolution (TIE) algorithm, proposed in this paper, is a novel attempt to handle an evolving data. It serves as a layer over an existing clustering-based taxonomy generation technique and incrementally evolves an existing taxonomy. The algorithm was evaluated on scholarly articles selected from computing domain. It was found that the algorithm evolves taxonomy in aconsiderably shorter period of time, having better quality per unit time as compared to the taxonomy regenerated from scratch.

KW - Taxonomy

KW - Clustering algorithms

KW - Information science

KW - Knowledge management

KW - Machine learning

U2 - 10.1631/FITEE.1700517

DO - 10.1631/FITEE.1700517

M3 - Article

SN - 2095-9184

VL - 19

SP - 763

EP - 782

JO - Frontiers of Information Technology and Electronic Engineering

JF - Frontiers of Information Technology and Electronic Engineering

IS - 6

ER -

TIE algorithm: A layer over clustering-based taxonomy generation for handling an evolving data

Abstract

Keywords

Access to Document

Fingerprint

Cite this