Once the mammograms that contain masses have been detected, the
algorithms have to be capable of precisely identifying the
position and borders of them. This capability is here evaluated
using ROC analysis, with the emphasis on performance with respect
to the different morphological aspects detailed in MIAS
annotations: the lesion shape (circular mass or spiculated one),
lesion size, and breast tissue type (glandular, dense or fatty).
In addition, an evaluation of the
influence of the number of clusters in clustering algorithms is presented.
Lesion Shape Influence
The lesion shape has a strong influence on the performance of the
segmentation algorithms. Table shows the
values of
when segmenting the
mammograms with masses.
In all cases the algorithms show more accurate detection on
circumscribed masses. However, due to the small number of samples
this difference is not significant.
All algorithms are more accurate in detecting circular masses than
spiculated ones. For algorithms
and
this is due to the
fact that both algorithms apply enhancement filters and thin
spicules are likely to be removed. Algorithm
shows a larger
difference which might be caused by the fact that the original
algorithm was applied to Regions of Interest (RoIs), while our
implementation is applied to the whole image. On the other hand,
note that the efficiency of algorithms
and
is hardly
affected by the lesion shape and it can be attributed to the
nature of clustering, which only takes pixel feature similarity
into account independently of neighbouring pixels. The pattern
matching approach (
) loses some accuracy if the lesion is
spiculated, because the proposed templates are circular. Finally,
algorithms
and
were originally developed to detect
spiculated masses, but in our implementation show higher accuracy
for circumscribed masses. These last two algorithms show the
overall best performance on both circumscribed and spiculated
masses.
Lesion Size Influence
The influence of the lesion size on the algorithms accuracy is
summarized in Table . It shows that the
approach works well for small masses, but when the size of
the masses increases, the algorithms performance decreases. This
is due to the fact that as the size of the masses increases, the
variation in their shape also increases. The opposite is true for
algorithms
and
, where performance increases with the
size of the lesion. Both k-Means (
) and FCM (
) tend to
produce homogeneous clusters [78]. We can also see that
the use of filters and statistical approaches (algorithms
,
and
) follow the same behaviour as the clustering-based
methods.
Finally, note that algorithm
increases its performance with
the size of the lesion until a maximum is reached. Then, its
performance decreases. This is due to the use of skeletons. When
the mass is small, its skeleton is less important for the
detection process. If the mass size increases, its skeleton
becomes easier to detect. For large masses the skeleton becomes
more difficult to detect and hence of less use in the detection
process. It should be noted that this decrease in performance for
larger masses is not significant.
Breast Tissue Influence
The accuracy of the
algorithms depending on breast tissue
classification is summarized in Table
.
Note that most of the algorithms have superior performance in
fatty breasts. This is clearly true for the
and a2
algorithms, which reduce their accuracy by
for dense
breasts. The reason for this can be found in the fact that in the
glandular and dense breasts of the MIAS database, the difference
between mass and normal tissue is less clear when compared to
fatty breasts. In addition, most algorithms have better
performance when dealing with fatty breasts when compared to
breasts of increasing density tissue.
The two exceptions to the above rules are algorithms
and
. This is because these algorithms use contour
information as a basis for the detection process and as such have
a better performance when increased intensity changes are present.
Number of Clusters Influence
As indicated when describing the clustering approaches,
over-segmentation of the image results in improved mass detection.
Table shows the result of
segmenting the image using algorithms
and
but with
different numbers of initial clusters, pointing out that the
performance of the algorithms increases with the increase in
initial set of seeds. However, there is a point where the increase
becomes small and the
value is stable. However, this point
is not the same for all mammograms. In addition, the segmentation
time also increases with the number of initial seed points. In
general, an initial set of
seeds gives the best trade-off
between processing time and detection performance.