Histograms (from scratch)

Computing the Histogram

A histogram is nothing but a function that takes as input each of the gray levels present in an image, and offers as output the number of pixels that each of those values share.

from numpy import *
import scipy
import matplotlib.pyplot

image=scipy.misc.lena()

Hist = image.flatten().tolist()
grayscales  = uniq(Hist)
frequencies = [Hist.count(x) for x in grayscales]

matplotlib.pyplot.figure()
matplotlib.pyplot.bar(grayscales,frequencies,color='g',edgecolor='k')
matplotlib.pyplot.savefig('/Users/blanco/Desktop/histogram.png')

Of course, there already is a simple (and much faster!) command in sage that will take care of this computation for us: matplotlib.pyplot.hist.

Histogram Equalization

We alter now the value of each grayscale of the original image, in such a way that the final histogram is as flat as possible, and spreads out over the entire range of the gray levels. We accomplish this by the following procedure:

Assume $f \colon \{ 1, \dotsc, N\} \times \{ 1, \dotsc, M\} \to \{ 0, \dotsc, 255\}$ is a $N \times M$ image with 256 gray-scales. We are looking for a function $\mathcal{E}\colon \{ 0, \dotsc, 255\} \to \{ 0, \dotsc, 255\}$ that performs the histogram equalization: $g(\boldsymbol{x}) = \mathcal{E} \big( f( \boldsymbol{x}) \big)$ .

The way we accomplish it is by:

Collecting first the histogram of each gray-scale $k \in \{ 0, \dotsc, 255\}:$ $H(k) = \# \{ \boldsymbol{x} : f(\boldsymbol{x}) = k \}$
Computing the probability of finding a pixel with each gray-scale in the given image: $p(k) = \frac{1}{NM} H(k).$
Computing the cumulative-density function for each gray-scale:
$P(k) = \frac{1}{NM} \displaystyle{\sum_{j=0}^k} H(j).$
The histogram equalization $\mathcal{E}$ simply takes the propability density function for the values in the image $f$ and multiplies them by the cumulative density function of the values in the image that we seek:
$\mathcal{E}(k) = \frac{1}{NMp(k)} \displaystyle{\sum_{j=0}^k} H(k).$

Note that this procedure reduces the number of gray-scales in an image.

cumsumfreqs = cumsum(frequencies)

histeq=zeros(image.shape)
for index, value in enumerate(grayscales):
    histeq+=cumsumfreqs[index]*(image==value)

histeq/=prod(img.shape)
histeq*=len(grayscales)

As proof of concept, let us compare original with its histogram-equalized version:


Original	Histogram-equalization

Note the obvious sharpening of details, albeit the apparent presence of noise where before the image seemed flat. This noise—that we can appreciate for example in the hat or walls—actually reveals a richer texture of the materials photographed. These tectures were not observable directly in the original. Note as well the accentuation of the brightest areas, producing an image with better contrast. I am sure the reader will find some more qualitative improvement in the histogram-equalization version of the image. Working on a set of dark images will reveal even more surprising benefits of this technique.

References

The Image Processing Handbook, Sixth Edition

Comments (1) Trackbacks (0) Leave a comment Trackback

Allen W. Smith, Ph.D.

August 20, 2014 at 8:04 pm

Reply

Should equation 4 be H(k) or H(j)? (And I note in the above that, by the graph, the top value is not being altered, only the bottom one.) Good point about it actually winding up less values – particularly critical with only 255 different ones to start with!

No trackbacks yet.

Francisco Blanco-Silva