The performance of the system is evaluated using a total of
mammograms,
with confirmed masses (the ground-truth provided
by an expert) and the rest being normal mammograms.
Both sets of RoIs (the one containing only masses and the other
containing masses and normal tissue) were extracted from these
mammograms. We used four different groups according to their size.
For the first dataset, each group corresponds to the following
intervals for mass sizes:
, and there were, respectively,
,
,
and
masses. For the second set of RoIs these groups
were completed with
normal, but suspicious, RoIs images for
each mass RoI. In the results, algorithms d1, d2 and
the algorithm without this false positive reduction step are also
included for direct comparison.
Figure shows the FROC curve for our
proposal explained in the Chapter
(grey line) and the same approach integrated with the proposed
false positive reduction algorithm (black line). Note that the
inclusion of this step clearly improves the performance of the
algorithm: at the same sensitivity, the number of false positives
per image is reduced. For instance, one false positive per image
is reduced at a sensitivity of
. Analyzing in the other
direction, the inclusion of the false positive reduction algorithm
allows to increase the sensitivity at a given false positive rate.
For example, at one false positive per image the sensitivity
increases from
to
.
![]() |
On the other hand, Figure shows
the FROC curve for the algorithms d1, d2, and the
proposed system including the false positive reduction step (the
black line with squares). The difference between the proposed
algorithm and both approaches is now clearer than in
Figure
. For instance, at the same sensitivity
analyzed in Section
(Sensitivity =
)
the mean number of false positive per image is now
, which
is
less compared to the algorithm without the false
positive reduction step. This shows the benefits of including this
algorithm.
![]() |
We include again the performance of the algorithm detailed for
each lesion size in Figure . Note that
larger masses are still more difficult to be accurately detected.
However, the inclusion of the false positive reduction step allows
to detect them at almost
false positive per image less than
without this step. Moreover, the performance of the three smaller
sizes is now more similar than without using the false positive
reduction step.
Once the mammograms containing masses are detected, ROC curves are
constructed to measure the accuracy in which the masses are
detected. The overall performance over the
mammograms
containing masses resulted in
values of
and
without and with the false positive reduction step,
while the results for the both compared approaches were
and
for algorithms d1
and d2 respectively. Note that the false positive reduction
step introduces a penalization term in the accuracy with which the
algorithm detects masses. This is due to the elimination of some
RoI that were actually representing a true mass.
Table shows the effect of the
lesion size for the different algorithms in terms of mean and
standard deviation of
values. Note that the inclusion of the
false positive reduction step in some cases slightly decreases the
performance of the proposal. This is due to the above mentioned
fact, where a mass which was correctly detected using the
proposal, was then considered as normal tissue by the false
positive reduction algorithm. When this is not the case, the
obtained
is increased.