The performance of the system is evaluated using a total of mammograms, with confirmed masses (the ground-truth provided by an expert) and the rest being normal mammograms.
Both sets of RoIs (the one containing only masses and the other containing masses and normal tissue) were extracted from these mammograms. We used four different groups according to their size. For the first dataset, each group corresponds to the following intervals for mass sizes: , and there were, respectively, , , and masses. For the second set of RoIs these groups were completed with normal, but suspicious, RoIs images for each mass RoI. In the results, algorithms d1, d2 and the algorithm without this false positive reduction step are also included for direct comparison.
Figure shows the FROC curve for our proposal explained in the Chapter (grey line) and the same approach integrated with the proposed false positive reduction algorithm (black line). Note that the inclusion of this step clearly improves the performance of the algorithm: at the same sensitivity, the number of false positives per image is reduced. For instance, one false positive per image is reduced at a sensitivity of . Analyzing in the other direction, the inclusion of the false positive reduction algorithm allows to increase the sensitivity at a given false positive rate. For example, at one false positive per image the sensitivity increases from to .
|
On the other hand, Figure shows the FROC curve for the algorithms d1, d2, and the proposed system including the false positive reduction step (the black line with squares). The difference between the proposed algorithm and both approaches is now clearer than in Figure . For instance, at the same sensitivity analyzed in Section (Sensitivity = ) the mean number of false positive per image is now , which is less compared to the algorithm without the false positive reduction step. This shows the benefits of including this algorithm.
|
We include again the performance of the algorithm detailed for each lesion size in Figure . Note that larger masses are still more difficult to be accurately detected. However, the inclusion of the false positive reduction step allows to detect them at almost false positive per image less than without this step. Moreover, the performance of the three smaller sizes is now more similar than without using the false positive reduction step.
Once the mammograms containing masses are detected, ROC curves are constructed to measure the accuracy in which the masses are detected. The overall performance over the mammograms containing masses resulted in values of and without and with the false positive reduction step, while the results for the both compared approaches were and for algorithms d1 and d2 respectively. Note that the false positive reduction step introduces a penalization term in the accuracy with which the algorithm detects masses. This is due to the elimination of some RoI that were actually representing a true mass.
Table shows the effect of the lesion size for the different algorithms in terms of mean and standard deviation of values. Note that the inclusion of the false positive reduction step in some cases slightly decreases the performance of the proposal. This is due to the above mentioned fact, where a mass which was correctly detected using the proposal, was then considered as normal tissue by the false positive reduction algorithm. When this is not the case, the obtained is increased.