Image Classification

Classify images into predefined categories using Vision Transformer (ViT) models. The model returns the top predicted labels with confidence scores.

For full API reference (classifyImage(), options, result types, and custom providers), see the Core Vision guide.

See it in action

Try Smart Gallery for a working demo.

Recommended Models

Model	Size	Categories	Use Case
`Xenova/vit-base-patch16-224`	~86MB	1000 ImageNet classes	General image classification

ImageNet Classes

ViT models trained on ImageNet classify into 1000 categories including animals, vehicles, food, and everyday objects. For classifying into custom categories, use Zero-Shot Image Classification with CLIP.

Showcase Apps

App	Description	Links
Smart Gallery	Auto-classify gallery photos by content type	Demo · Source

Next Steps

Core Vision API

Full API reference including classifyImage().

Zero-Shot Image

Classify images into custom categories without training.

Object Detection

Detect and locate objects in images.