Theo Gevers and Arnold W. M. Smeulders
In this chapter, we present an overview on the theory, techniques, and applications of content-based image retrieval. We choose patterns of use, image domains, and computation as the pivotal building blocks of our survey. A graphical overview of the content-based image retrieval scheme is given in Figure 8.1. Derived from this scheme, we follow the data as they flow through the computational process (see Figure 8.3), with the conventions indicated in Figure 8.2. In all of this chapter, we follow the review in [155] closely.
We focus on still images and leave video retrieval as a separate topic. Video retrieval could be considered a broader topic than image retrieval, as video is more than a set of isolated images. However, video retrieval could also be considered simpler than image retrieval, since, in addition to pictorial information, video contains supplementary information such as motion, spatial constraints, and time constraints (e.g., video discloses its objects more easily, as many points corresponding to one object move together and are spatially coherent in time). In still pictures, the user's narrative expression of intention is in image selection, object description, and composition. Video, in addition, has the linear timeline as an important information cue to assist the narrative structure.