next up previous contents
Next: Used Mammograms Containing Masses Up: Málaga Database Previous: Málaga Database   Contents

Database Characteristics

The mammograms are stored in a raw format, where each pixel is represented by two bytes (the grey level of each pixel is represented using $ 12$ bits). The size of each mammogram is variable, but it can be known from the overlay file.

Each case is stored using the following nomenclature: XXXXAB.raw, where XXXX is a number representing the case, A indicates if the mammograms provides from a right breast (``R'') or from a left one (``L''). The B can be ``C'' or ``O'', which respectively imply that the mammogram provides a CC or a MLO view.

The overlays are stored in a tif file, which the same name of the case, but with a constant suffix (_lb). Thus, looking for the size of these files, is possible to know the size of the raw files.

The most interesting fact of this database is that the overlay files are composed by the opinion of six different radiologists. Thus, each overlay file has only seven grey-levels (0 means normal pixels, and from $ 1$ to $ 6$ there is the opinion of the different radiologists), where $ 1$ means that only one radiologist has marked the pixel as belonging to a mass, $ 2$ that two radiologists marked the pixel as mass, and so on. Thus, linearly equalizing the images, the centre of the masses are the most brighter regions of the mass, and then, gradually, grey-level values decrease surrounding them.


next up previous contents
Next: Used Mammograms Containing Masses Up: Málaga Database Previous: Málaga Database   Contents
Arnau Oliver 2008-06-17