Computers are indispensable for the analysis of large amounts of data, for tasks that require complex computation, or for the extraction of quantitative information. On the other hand, the human visual cortex is an excellent image analysis apparatus, especially for extracting higher-level information, and for many applications — including medicine, security, and remote sensing — human analysts still cannot be replaced by computers. For this reason, many important image analysis tools such as edge detectors and neural networks are inspired by human visual perception models.
Digital Image Analysis or Computer Image Analysis is when a computer or electrical device automatically studies an image to obtain useful information from it. Note that the device is often a computer but may also be an electrical circuit, a digital camera or a mobile phone. It involves the fields of computer or machine vision, and medical imaging, and makes heavy use of pattern recognition, digital geometry, and signal processing. This field of computer science developed in the 1950s at academic institutions such as the MIT A.I. Lab, originally as a branch of artificial intelligence and robotics.
There are many different techniques used in automatically analysing images. Each technique may be useful for a small range of tasks, however there still aren't any known methods of image analysis that are generic enough for wide ranges of tasks, compared to the abilities of a human's image analysing capabilities. Examples of image analysis techniques in different fields include:
Since the early 2010s, deep learning methods have substantially advanced the field of image analysis. In 2012, a deep convolutional neural network (CNN) known as AlexNet achieved a significant reduction in error rates on the ImageNet large-scale image classification benchmark, demonstrating the effectiveness of deep learning for visual recognition tasks.[3] Subsequent architectures such as ResNet introduced residual connections that enabled training of much deeper networks, further improving accuracy across image analysis tasks.[4]
Real-time object detection became practical with frameworks such as YOLO (You Only Look Once), which unified detection and classification into a single network pass.[5] In 2020, the Vision Transformer (ViT) demonstrated that transformer architectures, originally developed for natural language processing, could achieve competitive results on image classification when applied directly to sequences of image patches.[6]
More recently, foundation models trained on large-scale datasets have enabled zero-shot generalisation across image analysis tasks. The Segment Anything Model (SAM), trained on over one billion masks, can segment arbitrary objects in images without task-specific fine-tuning.[7] These advances have made image analysis techniques increasingly accessible through browser-based tools and open-source implementations.[8]
Applications
The applications of digital image analysis are continuously expanding through all areas of science and industry, including:
anatomy, allows for precise measurements, visualization, and statistical analysis of anatomical structures.[9]
remote sensing, such as detecting intruders in a house, and producing land cover/land use maps.[10][11]
robotics, such as to avoid steering into an obstacle.
security, such as detecting a person's eye color or hair color.
Object-based
Image segmentation during the object base image analysis.
Object-based image analysis (OBIA) involves two typical processes, segmentation and classification. Segmentation helps to group pixels into homogeneous objects. The objects typically correspond to individual features of interest, although over-segmentation or under-segmentation is very likely. Classification then can be performed at object levels, using various statistics of the objects as features in the classifier. Statistics can include geometry, context and texture of image objects. Over-segmentation is often preferred over under-segmentation when classifying high-resolution images.[12]
Object-based image analysis has been applied in many fields, such as cell biology, medicine, earth sciences, and remote sensing. For example, it can detect changes of cellular shapes in the process of cell differentiation.;[13] it has also been widely used in the mapping community to generate land cover.[12][14]
When applied to earth images, OBIA is known as geographic object-based image analysis (GEOBIA), defined as "a sub-discipline of geoinformation science devoted to (...) partitioning remote sensing (RS) imagery into meaningful image-objects, and assessing their characteristics through spatial, spectral and temporal scale".[15][14] The international GEOBIA conference has been held biannually since 2006.[16]
↑Solomon, C.J., Breckon, T.P. (2010). Fundamentals of Digital Image Processing: A Practical Approach with Examples in Matlab. Wiley-Blackwell. doi:10.1002/9780470689776. ISBN978-0470844731.{{cite book}}: CS1 maint: multiple names: authors list (link)
↑Krizhevsky, Alex; Sutskever, Ilya; Hinton, Geoffrey E. (2012). "ImageNet classification with deep convolutional neural networks". Advances in Neural Information Processing Systems. 25: 1097–1105. doi:10.1145/3065386.
↑He, Kaiming; Zhang, Xiangyu; Ren, Shaoqing; Sun, Jian (2016). Deep Residual Learning for Image Recognition. CVPR 2016. doi:10.1109/CVPR.2016.90.
↑Redmon, Joseph; Divvala, Santosh; Girshick, Ross; Farhadi, Ali (2016). You Only Look Once: Unified, Real-Time Object Detection. CVPR 2016. arXiv:1506.02640. doi:10.1109/CVPR.2016.91.
↑Dosovitskiy, Alexey; etal. (2021). An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale. ICLR 2021. arXiv:2010.11929.
↑G.J. Hay & G. Castilla: Geographic Object-Based Image Analysis (GEOBIA): A new name for a new discipline. In: T. Blaschke, S. Lang & G. Hay (eds.): Object-Based Image Analysis – Spatial Concepts for Knowledge-Driven Remote Sensing Applications. Lecture Notes in Geoinformation and Cartography, 18. Springer, Berlin/Heidelberg, Germany: 75-89 (2008)
Quantitative Image Analysis of Microstructures by H.E. Exner & H.P. Hougardy, DGM Informationsgesellschaft mbH, ISBN3-88355-132-5 (1988).
"Metallographic and Materialographic Specimen Preparation, Light Microscopy, Image Analysis and Hardness Testing", Kay Geels in collaboration with Struers A/S, ASTM International 2006.