Ontology

In the context of data labeling, an ontology is used to define and classify the concepts, objects, and relationships that are relevant to a certain project or domain. According to the Oxford dictionary, we can say that ontology is a “set of concepts and categories in a subject area or domain that shows their properties and the relations between them.“. This is why we can use ontology as a synonym of class hierarchy. 

We use ontologies in data labeling to ensure that the labels applied to the data are consistent, accurate, and meaningful. Let’s take as example a machine learning task to clarify images of animals. This might include definitions and relationships for concepts such as “cat,” “dog,” and “animal”. In this example, an ontology could be defining that the label “cat” is related to “animal”, “dog” is related to “animal”, but “animal” is not necessarily associated with “cat” or “dog”.

We see, then, that ontology in the data labeling context is an operational term to assign classes and possibilities of labeling. Thus, the scientific definition of ontology and the one used in this context are slightly distinct.

Other companies may use ontology to describe annotation guides, but there is a difference between the two concepts. Ontologies are present in the annotation guide, but the last is a more general term. That is, the labeling instructions can include a class hierarchy, but especially includes how to distinguish the classes and attributes during the labeling process. 

Finally, it’s the experts in the domain that manually ontologies create and maintain ontologies. Great data labeling companies will help you to reduce inconsistency of your class hierarchy. We need a well-defined ontology, with quality manual annotation, to ensure the quality and reliability of your labeled data.

Synonyms: Class hierarchy

Related terms: Typology; Labeling instructions