Simple Object Detection Action
The Simple Object Detection Action detects objects in a still image and sends the object information to the destination channel.
By default, the action counts detected objects for the following 6 default classes: Person, Car, Bus, Truck, Cat, and Dog. In addition, all detected objects (including other classes) are included in the objects array.
You can optionally configure Target Object Classes to specify which object classes to include in the detection results. When configured, the action filters detections so that only the specified object classes are included in the objects array, and the output still returns count fields only for the 6 default classes.
Configuration
In the Action Condition input field, you can specify the conditions under which the action will be executed using the values (Message or Context) from the event source to the channel.
Configuring Simple Object Detection Action
The Image URL field supports expressions.
- Image URL: Specify the URL of the still image to be used for simple object detection.
- Example expression:
${event.payload.presignedUrls.get}(to perform simple object detection on a file uploaded to Soracom Harvest Files).
- Example expression:
- Target Object Classes: Specify the object classes to include in the detection results. If no selection is made, all supported object classes will be targeted for detection. You can select multiple object classes. For a full list, refer to Supported Object Classes.
Output
Set up how the action's output data will be processed. Refer to Enable Republishing of Action Output for more details.
Output Data of the Action
By default, the output data includes the count for each of the 6 default classes and an objects array containing all detected objects. The objects array includes objects from both the default classes and other supported classes. The following example shows a person and a cup detected in the image:
{
"person": 1,
"car": 0,
"bus": 0,
"truck": 0,
"cat": 0,
"dog": 0,
"objects": [
{
"location": [
0.42561606168746946,
0.3883926073710124,
0.5880238652229309,
0.8953371047973633
],
"score": 0.6473677158355713,
"label": "person",
"ts": "2024-06-17T05:28:24Z"
},
{
"location": [
0.3454461991786957,
0.6387763867775599,
0.39976382851600645,
0.739234521985054
],
"score": 0.5285048484802246,
"label": "cup",
"ts": "2024-06-17T05:28:24Z"
}
]
}
The count fields are provided only for the 6 default classes. Other detected object classes are included in the objects array but are not counted in separate fields.
When Target Object Classes is configured (e.g., truck), the action filters the detection results so that only objects belonging to the specified object classes are included in the objects array. The response still contains count fields only for the 6 default classes. These counts reflect detections of those default classes within the filtered results. For example, when a truck is detected:
{
"person": 0,
"car": 0,
"bus": 0,
"truck": 1,
"cat": 0,
"dog": 0,
"objects": [
{
"location": [
0.04083414,
0.21190794,
0.4107799,
0.8861815
],
"score": 0.91171336,
"label": "truck",
"ts": "2026-03-16T23:39:13Z"
}
]
}
When no objects matching the specified object classes are detected:
{
"person": 0,
"car": 0,
"bus": 0,
"truck": 0,
"cat": 0,
"dog": 0,
"objects": []
}
The output data includes the following attributes:
| Field | Description |
|---|---|
person |
Number of detected people. |
car |
Number of detected cars. |
bus |
Number of detected buses. |
truck |
Number of detected trucks. |
cat |
Number of detected cats. |
dog |
Number of detected dogs. |
objects[] |
Information about the objects detected by the Simple Object Detection Action. |
objects[].location |
The position of the detected object within the still image. |
objects[].score |
Confidence score indicating the accuracy of the detected object. |
objects[].label |
Detected object label. |
objects[].ts |
Date and time when the object was detected. |
Supported Object Classes
| Class Label | ID | Class Label | ID |
|---|---|---|---|
| Airplane | airplane | Microwave | microwave |
| Apple | apple | Motorcycle | motorcycle |
| Backpack | backpack | Mouse | mouse |
| Banana | banana | Orange | orange |
| Baseball Bat | baseball bat | Oven | oven |
| Baseball Glove | baseball glove | Parking Meter | parking meter |
| Bear | bear | Person | person |
| Bed | bed | Pizza | pizza |
| Bench | bench | Potted Plant | potted plant |
| Bicycle | bicycle | Refrigerator | refrigerator |
| Bird | bird | Remote | remote |
| Boat | boat | Sandwich | sandwich |
| Book | book | Scissors | scissors |
| Bottle | bottle | Sheep | sheep |
| Bowl | bowl | Sink | sink |
| Broccoli | broccoli | Skateboard | skateboard |
| Bus | bus | Skis | skis |
| Cake | cake | Snowboard | snowboard |
| Car | car | Spoon | spoon |
| Carrot | carrot | Sports Ball | sports ball |
| Cat | cat | Stop Sign | stop sign |
| Cell Phone | cell phone | Suitcase | suitcase |
| Chair | chair | Surfboard | surfboard |
| Clock | clock | Teddy Bear | teddy bear |
| Couch | couch | Tennis Racket | tennis racket |
| Cow | cow | Tie | tie |
| Cup | cup | Toaster | toaster |
| Dining Table | dining table | Toilet | toilet |
| Dog | dog | Toothbrush | toothbrush |
| Donut | donut | Traffic Light | traffic light |
| Elephant | elephant | Train | train |
| Fire Hydrant | fire hydrant | Truck | truck |
| Fork | fork | TV | tv |
| Frisbee | frisbee | Umbrella | umbrella |
| Giraffe | giraffe | Vase | vase |
| Hair Drier | hair drier | Wine Glass | wine glass |
| Handbag | handbag | Zebra | zebra |
| Horse | horse | ||
| Hot Dog | hot dog | ||
| Keyboard | keyboard | ||
| Kite | kite | ||
| Knife | knife | ||
| Laptop | laptop |