The performance of our approach is evaluated using a total of mammograms extracted from the MIAS mammographic database [169]. Among them, show confirmed masses (the ground-truth provided by an expert) while the rest were normal mammograms. We used four different groups for the mass RoIs according to their size. Each group corresponds to the following intervals of mass sizes: , , , . In each interval there were, respectively, , , and masses. Three masses were excluded from the modeling: one for being much larger than the rest, and the other two for being located at the border of the mammogram. In order to evaluate our proposal a direct comparison with algorithms d1 and d2, described in Sections and respectively, is given.
As is shown in Figure , the proposed approach has a better performance compared to both d1 and d2 approaches. Note that algorithm d1 has a tendency to produce a large number of false positives at high sensitivity rates. For instance, at a sensitivity of , d1 has false positive per image in mean, while d2 , and our approach (Eig) .
|
Figure shows the behaviour of the proposed algorithm according to the size of the lesions (note that we have reduced the scale on the x-axis for a better visualization). At lower sensitivities the algorithm is more or less independent of the size of the lesion. However, when sensitivity is around becomes an important factor, being the performance of the algorithm inversely proportional to the size of the mass. This is due to the fact that the proposed algorithm only detects the centre and the bounding square of the mass: if the mass is larger than the bounding square the sensitivity decreases because there are some pixels not considered as a mass. The opposite can also happen, when the bounding square is larger than the mass the sensitivity decreases because there are some pixels being considered as a mass which are not really part of a mass. Note that both problems are more likely to appear in large masses with spicules than in small masses.
Once the mammograms containing masses are detected, ROC curves are obtained measuring the accuracy with which the masses have been detected. The overall performance of the developed approach over the mammograms containing masses resulted in a , while using d1: and using d2: . Thus, the proposed approach obtains a better performance compared to the original algorithms.
Table shows the effect of the lesion size for the different algorithms in terms of mean and standard deviation. Note that the proposed algorithm has a similar performance for the three smallest sizes, while for the largest one the performance decreases. This is due to the shape variability of larger masses. For example, Figure (c) shows an elliptic mass correctly detected, but with a high number of pixels not being part of a mass classified inside the bounding square. Comparing the performance with algorithms d1 and d2, our approach has a similar trend to d1, with a better performance for smaller masses than for larger masses. In contrast, algorithm d2 tends to increase its performance proportionally to the mass size.