Student: Piyush Lakhawat
Advisor: Arun Somani
Title: Exploiting Clustering for Enhanced Utility Itemset Mining
Abstract: Current Utility Itemset Mining (UIM) problem model lacks a key modelling capability of capturing cluster specific patterns in the dataset. Information in transactions fairly representative of a cluster type is more characteristic of them and should not be generalized over the entire data. Subjecting such transactions to the common threshold in the UIM problem leads to information loss. We identify that an implicit use of the cluster structure of data in the UIM problem model will address this limitation. We do this by introducing a new clustering based utility in the definition of the UIM problem model and modifying the definitions of absolute utilities based on it. This enhanced UIM problem model enables the cluster specific patterns to emerge while still mining the inter-cluster patterns and can integrate into all UIM techniques.