In this section we exhaustively evaluate the effect of introducing the breast density into the false positive reduction step. As already explained, this is done by clustering the DDSM RoIs database according to this parameter. The results are obtained by using the same leave-one-out method explained in Section , where RoIs depicted a true mass and the rest were normal, but suspicious tissue.
Figure shows the mean value obtained using the leave-one-out strategy and varying the ratio between both kinds of RoIs. Note that, as expected, the performance of both PCA and 2DPCA approaches decreases as the ratio of RoIs depicting masses decrease. For the PCA approach we obtained for the ratio and for the ratio , while using the 2DPCA approach we obtained and respectively. Again, the 2DPCA approach obtained better performances than the PCA.
The mean for each cluster size at ratio is shown in Table . The overall performance of the system is up to . Moreover, a similar trend to the one mentioned in is observed, with a better classification for larger masses. Thus, comparing the performance of both results we show that considering the breast tissue obtain an improvement of in value.
Figure shows the mean kappa value obtained using the leave-one-out strategy at different ratios and threshold . The same behaviour mentioned for values is repeated, where the performance of both approaches are reduced when increasing the number of normal samples. On the other hand, Figure shows a comparison for the 2DPCA approach with and without taking breast density information into account. Note that considering such parameter the results clearly improves. At ratios , , , and the agreement can be considered as almost perfect, while for ratios and the agreement is substantial. In contrast, without using such information, the agreement for the two latter ratios was only moderate, and only the agreement in ratios and could be considered as almost perfect.
|
Finally, in Table a comparison of the performance of the methods with respect to each BIRADS category is shown. Note that for the PCA-based method, mammograms with lower BIRADS were better classified that mammograms with higher BIRADS. This result seems plausible because it is equivalent to the performance of a human expert, as it is well known that experts radiologists have more difficulties to find masses in dense mammograms than in fatty ones. In contrast, the performance of the 2DPCA-based method is more independent of the breast tissue, although for BIRADS IV its performance decreased.