Writing Programs to Process Images

Wesley E. Snyder; Hairong Qi

doi:10.1017/9781316882641.005

2 - Writing Programs to Process Images

from Part I - Preliminaries

Published online by Cambridge University Press: 25 October 2017

Wesley E. Snyder and

Hairong Qi

Show author details

Wesley E. Snyder: Affiliation:
North Carolina State University
Hairong Qi: Affiliation:
University of Tennessee

Book contents

Get access

Summary

Computer Science is not about computers any more than astronomy is about telescopes.

– E.W. Dijkstra

Introduction

One may take two approaches to writing software for image analysis, depending on what one is required to optimize. One may write in a style that optimizes/minimizes programmer time, or one may write to minimize computer time. In this course, computer time will not be a concern (at least not usually), but your time will be far more valuable. For that reason, we want to follow a programming philosophy that produces correct, operational code in a minimal amount of programmer time. The programming assignments in this book are specified to be written in C or C++, rather than in MATLAB or JAVA. This is a conscious and deliberate decision. MATLAB in particular hides many of the details of data structures and data manipulation from the user. Most of the time, that's a good thing. However, in the course of teaching variations of this course for many years, the authors have found that many of those details are precisely the details that students need to grasp in order to effectively understand what image processing (particularly at the pixel level) is all about.

In this book, at least initially, we want the students to write code that works at the pixel level, so they come to understand what the computer is really doing. Later in the course, the student will slowly move up in levels of abstraction. We point the reader to [2.2, 2.1] for texts that emphasize the use of MATLAB.

Basic Programming Structure for Image Processing

Images may be thought of as two- or three-dimensional arrays. They are usually processed pixel-by-pixel in a raster scan. In order to manipulate an image, two- or three-nested for-loops is the most commonly used programming structure, as shown in Figures 2.1 and 2.2.

In these examples, we use two or three integers (row, col, and frame) as the indices to the row, column, and frame of the image. By increasing row, col, and frame with a step one, we are actually scanning the image pixel-wise from left to right, top to bottom, frame by frame.

Type: Chapter
Information: Fundamentals of Computer Vision , pp. 11 - 15

DOI: https://doi.org/10.1017/9781316882641.005 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 2017

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Book purchase

Temporarily unavailable

References

[2.1] P., Corke. Robotics, Vision and Control: Fundamental Algorithms in MATLA B. Springer, 2011.

[2.2] R., Gonzalez, R., Woods, and L., Eddins. Digital Image Processing Using MATLAB. McGraw- Hill, 2nd edition, 2016.