Free Open-Vocabulary Object Detection

Detect any object with natural language prompts — powered by SAM 3, 100% free

Drag & drop an image here, or browse

Supports: JPEG, PNG, WebP, BMP (max 10MB)

What objects to detect?

Separate multiple prompts with commas

Show advanced options

What is Open-Vocabulary Object Detection?

Open-vocabulary object detection is an advanced AI technology that identifies and segments objects in images using natural language descriptions. Unlike traditional detection models limited to fixed classes, SAM 3 (Segment Anything Model 3) can detect any object you describe — from "person wearing a red jacket" to "coffee cup on desk". Simply describe what you're looking for, and the model will find and segment it.

Supported File Types

Images: JPEG, PNG, WebP, BMP
Maximum file size: 10MB

For best results, use clear, high-resolution images with good lighting.

How it works

Upload an image by dragging and dropping or clicking to browse.
Describe what you want to detect using natural language (e.g., "person with hat, red car, dog").
Click "Detect Objects" to process your image.
View and download the annotated image with segmentation masks.

Privacy & Data Handling

Your uploaded images are processed securely and are not stored permanently. Images are deleted immediately after processing. We do not use your data for training purposes. For more details, see our Privacy Policy.

API Access

Need to integrate object detection into your application? Check out our Object Detection API documentation for programmatic access with additional features.