[Zlib-devel] Some inflate_fast() statistics

Mark Adler madler at alumni.caltech.edu
Sat Nov 4 23:17:04 EST 2006


Steve and other interested nerds,

Here are some match length frequency counts from typical deflate  
data, generated using the infgen and pngdat utilities that I sent on  
this email list a while back, and standard Unix utilities, thusly:

     infgen < test.tar.gz | grep ^match | awk '{ print $2 }' | sort - 
n | uniq -c

or:

     pngdat < bigpngs/pdp11,70.png | infgen | grep ^match | awk  
'{ print $2 }' | sort -n | uniq -c

mark


"Typical" web pages stored in a tar file (the large number of 258  
matches is explained by the series of nulls in tar files):

gromit% infgen < test.tar.gz | grep ^match | awk '{ print $2 }' |  
sort -n | uniq -c
70632 3
96627 4
66200 5
50093 6
36953 7
29365 8
22928 9
20179 10
16400 11
13893 12
11261 13
8429 14
7812 15
5977 16
5508 17
4923 18
4209 19
3407 20
3562 21
3335 22
2909 23
2559 24
2589 25
2498 26
2275 27
2538 28
2101 29
2238 30
1547 31
1819 32
1258 33
1446 34
1407 35
1115 36
1017 37
1093 38
1046 39
1223 40
1215 41
1282 42
1274 43
977 44
1061 45
978 46
763 47
623 48
635 49
753 50
502 51
656 52
499 53
618 54
471 55
416 56
417 57
414 58
812 59
325 60
333 61
452 62
588 63
627 64
460 65
240 66
319 67
246 68
196 69
282 70
198 71
196 72
306 73
375 74
127 75
199 76
154 77
231 78
173 79
194 80
218 81
274 82
188 83
218 84
159 85
160 86
254 87
188 88
151 89
354 90
178 91
198 92
109 93
100 94
127 95
133 96
139 97
   88 98
112 99
179 100
202 101
105 102
113 103
   98 104
107 105
141 106
159 107
108 108
   93 109
   81 110
   48 111
   42 112
150 113
346 114
183 115
177 116
   96 117
   41 118
   87 119
   41 120
   67 121
104 122
   71 123
   97 124
   50 125
   90 126
   76 127
   61 128
   67 129
   63 130
   85 131
   50 132
140 133
102 134
   89 135
   89 136
   76 137
   59 138
   70 139
   92 140
   76 141
   60 142
   50 143
   63 144
   58 145
   37 146
   48 147
   41 148
   72 149
   67 150
   86 151
   70 152
   80 153
   51 154
   77 155
118 156
   39 157
   43 158
   42 159
   49 160
   17 161
   25 162
   69 163
   26 164
   52 165
   44 166
   52 167
   61 168
   68 169
   37 170
   28 171
   64 172
   29 173
   41 174
   28 175
   34 176
   45 177
   31 178
   41 179
   38 180
   67 181
   35 182
   23 183
   22 184
   25 185
   35 186
   21 187
   34 188
   24 189
   27 190
   37 191
   58 192
   31 193
   26 194
   49 195
   26 196
   25 197
   17 198
   30 199
   31 200
   17 201
   28 202
    6 203
   21 204
   12 205
   12 206
   17 207
   25 208
   41 209
   77 210
   19 211
    8 212
   11 213
   14 214
   20 215
   26 216
   18 217
   26 218
   19 219
   22 220
   15 221
   14 222
   15 223
   21 224
   12 225
   10 226
   27 227
   25 228
   32 229
   11 230
   19 231
   16 232
   32 233
   39 234
   38 235
    7 236
   10 237
   19 238
   16 239
   15 240
   13 241
   22 242
   39 243
    9 244
    8 245
   14 246
    9 247
    7 248
   21 249
   16 250
    6 251
   22 252
   10 253
    9 254
   27 255
   12 256
   51 257
3367 258

A large png image (Steve's favorite):

gromit% pngdat < bigpngs/pdp11,70.png | infgen | grep ^match | awk  
'{ print $2 }' | sort -n | uniq -c
589718 3
1225451 4
705512 5
355213 6
220095 7
170913 8
132174 9
102171 10
85400 11
66028 12
47291 13
36402 14
26569 15
18257 16
13741 17
10086 18
6587 19
4848 20
3663 21
2356 22
1731 23
1499 24
916 25
694 26
577 27
342 28
288 29
233 30
127 31
137 32
120 33
   71 34
   56 35
   46 36
   32 37
   33 38
   31 39
   17 40
   10 41
    9 42
   15 43
    6 44
    9 45
    2 46
    3 47
    4 48
    1 50
    4 51
    1 52
    1 53
    1 59





More information about the Zlib-devel mailing list