Glossary

Zero-shot Learning

Training models to recognize unseen categories during training, leveraging knowledge transfer and generalization.

Definition

Zero-shot Learning (ZSL) is an advanced machine learning technique where a model learns to correctly identify and categorize objects, concepts, or entities it has never explicitly seen during training. This approach relies on transferring and generalizing knowledge from seen to unseen categories by leveraging semantic relationships and attributes shared across different classes. In zero-shot learning, models are typically trained with a rich set of features or attributes that describe each class, along with the relationships between these classes.

The model then uses this learned knowledge structure to infer the properties and categories of new, unseen instances. Zero-shot learning is particularly useful in scenarios where it is impractical to collect labeled data for every possible category, and it highlights the model's ability to leverage abstract, high-level understanding to make inferences about new or rare categories.

Examples / Use Cases

In natural language processing, a zero-shot learning model might be trained to perform text classification tasks on a set of known topics and then applied to classify texts in completely new topics based on the semantic understanding of words and topics. For instance, if trained on sports and politics articles, it might classify an unseen "technology" article by recognizing related words and concepts.

In image recognition, a model trained on a dataset of animal images with attributes like "has feathers," "can fly," and "has beak" could potentially identify an unseen bird species in an image by associating these attributes with the concept of birds.

In e-commerce, a zero-shot recommendation system could suggest products in categories never before encountered, based on user preferences and product attributes learned from other categories. These examples demonstrate the power of zero-shot learning in enabling models to apply learned knowledge to novel situations, expanding their applicability and utility in dynamic, real-world environments.

Related Terms

Machine Learning

← Back to Glossary