Automatic image annotation

Automatic image annotation (also known as automatic image tagging or linguistic indexing) is the process by which a computer system automatically assigns metadata in the form of captioning or keywords to a digital image. This application of computer vision techniques is used in image retrieval systems to organize and locate images of interest from a database.

This method can be regarded as a type of multi-class image classification with a very large number of classes - as large as the vocabulary size. Typically, image analysis in the form of extracted feature vectors and the training annotation words are used by machine learning techniques to attempt to automatically apply annotations to new images. The first methods learned the correlations between image features and training annotations, then techniques were developed using machine translation to try to translate the textual vocabulary with the 'visual vocabulary', or clustered regions known as blobs. Work following these efforts have included classification approaches, relevance models and so on.

The advantages of automatic image annotation versus content-based image retrieval (CBIR) are that queries can be more naturally specified by the user.[1] CBIR generally (at present) requires users to search by image concepts such as color and texture, or finding example queries. Certain image features in example images may override the concept that the user is really focusing on. The traditional methods of image retrieval such as those used by libraries have relied on manually annotated images, which is expensive and time-consuming, especially given the large and constantly growing image databases in existence.

Automatic Image Annotation Software

SuperAnnotate

SuperAnnotate is an end-to-end platform for computer vision engineers and annotation teams to annotate, manage, train, and ultimately automate computer vision pipelines.

Automation: The platform allows three distinct types of automation both on labeling and quality assurance levels. The automation can be done through transfer learning, active learning[2] and mislabel detection.[3] Through the established link between the data annotation projects and Neural Network environment, one has the capacity to train custom models, perform manual corrections and iterate, all within the same platform, consequently increasing the speed and the accuracy of each new annotation task. The platform also allows you to select the most relevant frames from the large set of images which will help to reach the highest recognition accuracy with the limited dataset. Apart from the annotation automation itself, SuperAnnotate allows the elimination of data noise by automating the detection of mislabeled training samples. The platform is specifically built to unify and automate the entire data annotation pipeline.

API Integrations: The platform comes with a built in Python SDK that automates project setup and distribution, team management, and scaling for larger projects. The SDK includes a variety of data transfers functions, annotation converters, functions for data manipulations of images, annotations etc.[4] It also allows CV engineers to run training, compare multiple training results, automatically find risky annotations etc.[5]

Labelbox

Labelbox is a training data platform[6] used primarily for computer vision applications to provide labeled data to data scientists and machine learning engineers in their development of machine learning models.

The platform claims to offer automated labeling for image annotation by relying on predictions from an existing machine learning model to pre-label image data.[7] The product also offers a Python SDK.[8]

References

SuperAnnotate (2020-09-30), AnnotationSoftware/active_learning, retrieved 2020-11-17
SuperAnnotate (2020-09-17), AnnotationSoftware/qa-automation, retrieved 2020-11-17
SuperAnnotate (2020-09-17), AnnotationSoftware/superannotate-python-sdk, retrieved 2020-11-17
"SuperAnnotate Desktop". opencv.org. Retrieved 2020-11-17.
"Labelbox: The training data platform for AI teams". labelbox.com. Retrieved 2020-12-16.
"Labelbox: Use your own model to accelerate labeling". labelbox.com. Retrieved 2020-12-16.
Labelbox, labelbox: Labelbox Python API, retrieved 2020-12-16

Datta, Ritendra; Dhiraj Joshi; Jia Li; James Z. Wang (2008). "Image Retrieval: Ideas, Influences, and Trends of the New Age". ACM Computing Surveys. 40 (2): 1–60. doi:10.1145/1348246.1348248. S2CID 7060187.
Nicolas Hervé; Nozha Boujemaa (2007). "Image annotation : which approach for realistic databases ?" (PDF). ACM International Conference on Image and Video Retrieval. Archived from the original (PDF) on 2011-05-20.
M Inoue (2004). "On the need for annotation-based image retrieval" (PDF). Workshop on Information Retrieval in Context. pp. 44–46. Archived from the original (PDF) on 2014-08-08.

Automatic image annotation