Another Example of Support and Share |
We present a small example database in Table 1 to illustrate the support
and confidence measures. There are five transactions in the database, so
the support of each itemset is measured relative to 5.
|
|
|
A, C, D |
|
B, E |
|
A, B, C, E |
|
B, E |
|
B, D, F |
Table 2 shows the supports of the 6 items in the database. The first
column lists the item, the second lists the number of transactions in which
the item appears, and the third column lists the support of the item. Item
A in the first row, for example, appears in 2 transactions in Table 1 (transactions
T1 and T3). This represents 40% of the 5 total transactions.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Table 3 shows the support for some itemsets derived from the database.
The columns are analogous to those in Table 2. For example, the itemset
{A, B} in the second row appears in only one transaction, transaction T3,
which gives it a support of 20%.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Table 4 shows the confidence measures of several association rules derived
from the itemsets in Table 3. The confidence of 100% for the rule A®
C means that in every transaction in which A appears, C also appears. The
confidence of this rule can be calculated by dividing the number of transactions
in which the itemset {A, C} appears, which is 2 (see Table 3), by the number
of transactions in which the item A appears, also 2 (Table 2).
Association Rule | Confidence conf(X® Y) |
|
|
|
|
|
|
|
|
|
|
|
|
We illustrate the share measures using the database shown in Table 5.
This is the same database as shown in Table 1 except that each item has
a transaction count. For this database, the set of items I = {A,
B, C, D, E, F}, m = 6, and the set of transactions T = {T1,
T2, T3, T4, T5}, n
= 5.
Transaction ID |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Table 6 shows the transactions measure values for each item. From Table 6 we see that the set of transactions associated with the item A is TA = {T1, T3}. The global measure value for item A is:
MV(A) = tmv(A,T1) +tmv(A,T3) = 1+2 = 3.
The remaining global measure values for items B to F are listed in the rightmost column of Table 6. The total item count for the database is:
MV = MV(A) + ... + MV(F) = 20.
|
|
|
||||
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Table 7 lists several 2-itemsets from the example database, the transactions
which contain each itemset, the transaction measure values of each item
and the local measure value of each itemset. If the itemset X = {A,C}
(row 1 in the table), then X is associated with the set of transactions
TX
= {T1, T3} (see Table 5). The local count
of item A in the itemset X is = tmv(A,T1) +
tmv(A,T3) = 1+2 = 3 as shown in column 3. The local measure
value of the item C in the itemset X is lmv(C,X) =tmv(C,T1)
+ tmv(C,T3) = 2+2 = 4 (column 4). The local measure value
of the itemset X is therefore:
|
Contained in transactions |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|