A Model-Based Frequency Constraint for Mining Associations from Transaction Data

Computer Science – Databases

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Scientific paper

10.1007/s10618-005-0026-2

Mining frequent itemsets is a popular method for finding associated items in databases. For this method, support, the co-occurrence frequency of the items which form an association, is used as the primary indicator of the associations's significance. A single user-specified support threshold is used to decided if associations should be further investigated. Support has some known problems with rare items, favors shorter itemsets and sometimes produces misleading associations. In this paper we develop a novel model-based frequency constraint as an alternative to a single, user-specified minimum support. The constraint utilizes knowledge of the process generating transaction data by applying a simple stochastic mixture model (the NB model) which allows for transaction data's typically highly skewed item frequency distribution. A user-specified precision threshold is used together with the model to find local frequency thresholds for groups of itemsets. Based on the constraint we develop the notion of NB-frequent itemsets and adapt a mining algorithm to find all NB-frequent itemsets in a database. In experiments with publicly available transaction databases we show that the new constraint provides improvements over a single minimum support threshold and that the precision threshold is more robust and easier to set and interpret by the user.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

A Model-Based Frequency Constraint for Mining Associations from Transaction Data does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with A Model-Based Frequency Constraint for Mining Associations from Transaction Data, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and A Model-Based Frequency Constraint for Mining Associations from Transaction Data will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-596636

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.