Mining of High-Utility Patterns in Big IoT-based Databases

Jimmy M. T. Wu, Shandong University of Science and Technology
Gautam Srivastava, Brandon University
Jerry C. Lin, Western Norway University of Applied Sciences
Youcef Djenouri, SINTEF Foundation for Scientific and Industrial Research
Min Wei, Shandong University of Science and Technology
Reza M. Parizi, Kennesaw State University
Mohammad S. Khan, East Tennessee State UniversityFollow

Document Type

Article

Publication Date

2-1-2021

Description

When focusing on the general area of data mining, high-utility itemset mining (HUIM) can be defined as an offset of frequent itemset mining (FIM). It is known to emphasize more factors critically, which gives HUIM its intrinsic edge. Due to the flourishing development of the IoT technique, the uncertainty patterns mining is also attractive. Potential high-utility itemset mining (PHUIM) is introduced to reveal valuable patterns in an uncertainty database. Unfortunately, even though the previous methods are all very effective and powerful to mine, the potential high-utility itemsets quickly. These algorithms are not specifically designed for a database with an enormous number of records. In the previous methods, uncertainty transaction datasets would be load in the memory ultimately. Usually, several pre-defined operators would be applied to modify the original dataset to reduce the seeking time for scanning the data. However, it is impracticable to apply the same way in a big-data dataset. In this work, a dataset is assumed to be too big to be loaded directly into memory and be duplicated or modified; then, a MapReduce framework is proposed that can be used to handle these types of situations. One of our main objectives is to attempt to reduce the frequency of dataset scans while still maximizing the parallelization of all processes. Through in-depth experimental results, the proposed Hadoop algorithm is shown to perform strongly to mine all of the potential high-utility itemsets in a big-data dataset and shows excellent performance in a Hadoop computing cluster.

Citation Information

Wu, Jimmy M. T.; Srivastava, Gautam; Lin, Jerry C.; Djenouri, Youcef; Wei, Min; Parizi, Reza M.; and Khan, Mohammad S.. 2021. Mining of High-Utility Patterns in Big IoT-based Databases. Mobile Networks and Applications. Vol.26(1). 216-233. https://doi.org/10.1007/s11036-020-01701-5 ISSN: 1383-469X

Digital Commons @ East Tennessee State University

ETSU Faculty Works

Mining of High-Utility Patterns in Big IoT-based Databases

Document Type

Publication Date

Description

Citation Information

Search

Browse All

Browse Faculty Works

Author Corner

Links

Digital Commons @ East Tennessee State University

ETSU Faculty Works

Mining of High-Utility Patterns in Big IoT-based Databases

Creator(s)

Document Type

Publication Date

Description

Citation Information

Share

Search

Browse All

Browse Faculty Works

Author Corner

Links