Промышленный лизинг Промышленный лизинг  Методички 

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 [ 38 ] 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222

based on price will not only fail to motivate the convenience seekers, it runs the risk of steering them toward less profitable products when they would be happy to pay more.

This chapter describes how simple, single-campaign response models can be combined to create a best next offer model that matches campaigns to customers. Collaborative filtering, an approach to grouping customers into like-minded segments that may respond to similar offers, is discussed in Chapter 8.

Data Mining to Choose the Right Place to Advertise

One way of targeting prospects is to look for people who resemble current customers. For instance, through surveys, one nationwide publication determined that its readers have the following characteristics:

59 percent of readers are college educated.

46 percent have professional or executive occupations.

21 percent have household income in excess of $75,000/year.

7 percent have household income in excess of $100,000/year.

Understanding this profile helps the publication in two ways: First, by targeting prospects who match the profile, it can increase the rate of response to its own promotional efforts. Second, this well-educated, high-income readership can be used to sell advertising space in the publication to companies wishing to reach such an audience. Since the theme of this section is targeting prospects, lets look at how the publication used the profile to sharpen the focus of its prospecting efforts. The basic idea is simple. When the publication wishes to advertise on radio, it should look for stations whose listeners match the profile. When it wishes to place take one cards on store counters, it should do so in neighborhoods that match the profile. When it wishes to do outbound telemarketing, it should call people who match the profile. The data mining challenge was to come up with a good definition of what it means to match the profile.

Who Fits the Profile?

One way of determining whether a customer fits a profile is to measure the similarity-which we also call distance-between the customer and the profile. Several data mining techniques use this idea of measuring similarity as a distance. Memory-based reasoning, discussed in Chapter 8, is a technique for classifying records based on the classifications of known records that



are in the same neighborhood. Automatic cluster detection, the subject of Chapter 11, is another data mining technique that depends on the ability to calculate a distance between two records in order to find clusters of similar records close to each other.

For this profiling example, the purpose is simply to define a distance metric to determine how well prospects fit the profile. The data consists of survey results that represent a snapshot of subscribers at a particular time. What sort of measure makes sense with this data? In particular, what should be done about the fact that the profile is expressed in terms of percentages (58 percent are college educated; 7 percent make over $100,000), whereas an individual either is or is not college educated and either does or does not make more than $100,000?

Consider two survey participants. Amy is college educated, earns $80,000/year, and is a professional. Bob is a high-school graduate earning $50,000/year. Which one is a better match to the readership profile? The answer depends on how the comparison is made. Table 4.1 shows one way to develop a score using only the profile and a simple distance metric.

This table calculates a score based on the proportion of the audience that agrees with each characteristic. For instance, because 58 percent of the readership is college educated, Amy gets a score of 0.58 for this characteristic. Bob, who did not graduate from college, gets a score of 0.42 because the other 42 percent of the readership presumably did not graduate from college. This is continued for each characteristic, and the scores are added together. Amy ends with a score of 2.18 and Bob with the higher score of 2.68. His higher score reflects the fact that he is more similar to the profile of current readers than is Amy.

Table 4.1 Calculating Fitness Scores for Individuals by Comparing Them along Each Demographic Measure

READERSHIP

YES SCORE

SCORE

AMY SCORE

BOB SCORE

College educated

0.58

0.42

0.58

0.42

Prof or exec

0.46

0.54

0.46

0.54

Income >$75K

0.21

0.79

0.21

0.79

Income >$100K

0.07

0.93

0.93

0.93

Total

2.18

2.68



The problem with this approach is that while Bob looks more like the profile than Amy does, Amy looks more like the audience the publication has targeted-namely, college-educated, higher-income individuals. The success of this targeting is evident from a comparison of the readership profile with the demographic characteristics of the U.S. population as a whole. This suggests a less naive approach to measuring an individuals fit with the publications audience by taking into account the characteristics of the general population in addition to the characteristics of the readership. The approach measures the extent to which a prospect differs from the general population in the same ways that the readership does.

Compared to the population, the readership is better educated, more professional, and better paid. In Table 4.2, the Index columns compare the readerships characteristics to the entire population by dividing the percent of the readership that has a particular attribute by the percent of the population that has it. Now, we see that the readership is almost three times more likely to be college educated than the population as a whole. Similarly, they are only about half as likely not to be college educated. By using the indexes as scores for each characteristic, Amy gets a score of 8.42 (2.86 + 2.40 + 2.21 + 0.95) versus Bob with a score of only 3.02 (0.53 + 0.67 + 0.87 + 0.95). The scores based on indexes correspond much better with the publications target audience. The new scores make more sense because they now incorporate the additional information about how the target audience differs from the U.S. population as a whole.

Table 4.2 Calculating Scores by Taking the Proportions in the Population into Account

READERSHIP

US POP

INDEX

READERSHIP

US POP

INDEX

College educated

20.3%

2.86

79.7%

0.53

Prof or exec

19.2%

2.40

80.8%

0.67

Income >$75K

9.5%

2.21

90.5%

0.87

Income >$100K

2.4%

2.92

97.6%

0.95

Team-Fly®



1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 [ 38 ] 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222