Hybrid Fruit-Fly Optimization Algorithm with K-Means for Text Document Clustering

Venkatachalam, K.

Hybrid Fruit-Fly Optimization Algorithm with K-Means for Text Document Clustering

Autori: Timea Bezdan, Catalin Stoean, Ahmed Al Naamany, Nebojša Bačanin Džakula, Tarik A. Rashid, Miodrag Živković, K. Venkatachalam

Časopis: Mathematics

Volume, no: 9 , 16

ISSN: 2227-7390

DOI: 10.3390/math9161929

Stranice: 1-19

Link: https://www.mdpi.com/2227-7390/9/16/1929

Apstrakt:

The fast-growing Internet results in massive amounts of text data. Due to the large volume of the unstructured format of text data, extracting relevant information and its analysis becomes very challenging. Text document clustering is a text-mining process that partitions the set of text-based documents into mutually exclusive clusters in such a way that documents within the same group are similar to each other, while documents from different clusters differ based on the content. One of the biggest challenges in text clustering is partitioning the collection of text data by measuring the relevance of the content in the documents. Addressing this issue, in this work a hybrid swarm intelligence algorithm with a K-means algorithm is proposed for text clustering. First, the hybrid fruit-fly optimization algorithm is tested on ten unconstrained CEC2019 benchmark functions. Next, the proposed method is evaluated on six standard benchmark text datasets. The experimental evaluation on the unconstrained functions, as well as on text-based documents, indicated that the proposed approach is robust and superior to other state-of-the-art methods.

Ključne reči: machine learning; text document clustering; metaheuristic algorithms; fruit-fly optimization algorithm; K-means

Priložene datoteke:

Timea Bezdan, Catalin Stoean, Ahmed Al Naamany, Nebojsa Bacanin Dzakula, Tarik A Rashid, Miodrag Zivkovic, K Venkatachalam. 2021 [8401].pdf ( veličina: 551,41 KB, broj pregleda: 533 )

Zahvaljujemo se što ste preuzeli publikaciju sa portala Singipedia.

Ukoliko želite da se prijavite za obaveštenja o sadržajima iz oblasti ove publikacije, možete nam ostaviti adresu svoje elektronske pošte.