Computer Vision May 2026 12 min read

Polygon Annotation: When Polygons Beat Bounding Boxes (and When They Don't)

Polygons sit in the awkward middle of computer-vision annotation — cheaper than segmentation, more expensive than a box, and constantly mis-scoped in both directions. This is the practical guide to when polygons are the right tool, what they cost, and the per-vertex trap that wrecks budgets.

Most teams who land on a polygon brief came to it from one of two directions. Either they tried to train an object detector on bounding boxes and the model kept including background as “part of the object” — fashion AI is the textbook case. Or they spec'd full pixel segmentation, paid double, and realised six weeks in that polygon-level fidelity would've been enough.

This guide is the version we wish landed in those briefs on day one — what a polygon actually is, when it's the right tool, the vertex-count discipline that decides whether your project comes in on budget, and how polygon QA actually works. Honest, opinionated, no vendor-deck energy.

What a Polygon Actually Is

A polygon is an ordered list of (x, y) vertices that, joined in sequence, trace the outline of an object. A 12-vertex polygon around a car is twelve points the annotator placed along the car's silhouette. The model learns “everything inside this closed shape is the object” — which, unlike a bounding box, doesn't drag a halo of background into the training signal.

The trade-off is straightforward. A box is four numbers and takes seconds. A polygon is N numbers and takes minutes. The model benefit is real, but only when the object actually has irregular shape — putting a polygon around a rectangular shipping container is paying for accuracy you don't need.

The 30% Rule: When To Reach For a Polygon

Honest test we apply on every incoming brief — if a bounding box around the object would be more than 30% background, a polygon will train a noticeably better model. Less than 30%, the box is fine and you save the money.

For the box-vs-polygon end of this decision, we go deep in the bounding box guide. For the polygon-vs-segmentation end, see the image segmentation guide.

The Vertex-Count Discipline (Where Polygon Projects Live or Die)

The single biggest determinant of polygon project cost and quality is vertex discipline. Two failure modes show up on almost every incoming dataset audit:

The fix is a vertex budget written into the annotation spec, per class. Cars at 3/4 view — 12–20 vertices. Pedestrians — 16–24. Bananas — 8–12. Crisp-edge logos — corner-driven, no smoothing fluff. Polygons that fall outside the band get flagged in QA and re-traced. Boring, effective, the difference between a project that ships and one that runs over.

Polygon vs Box vs Segmentation: The Honest Cost Comparison

Rough per-image cost ratios we see on production projects (varies with class density and complexity):

The point isn't the exact ratios — they shift with class density and image complexity. The point is that polygon is the right tool for a specific middle band, and projects that drift into polygon territory when boxes were fine, or that drift into segmentation when polygons were enough, are the two cheapest ways to over-pay. Match the tool to the question.

Formats and Tools

COCO JSON is the dominant format — polygons stored as ordered (x, y) vertex lists, indexed against image IDs and category labels. GeoJSON is the geospatial cousin, common for satellite and drone work. Pascal VOC XML with polygon extensions exists in older pipelines.

On the tooling side, polygon-native viewers (CVAT, Label Studio, the commercial suites) all support the basic draw-and-edit. The features that actually matter for production speed — magnetic edge snapping, AI-assisted boundary suggestion, vertex-budget warnings, and bulk class re-assignment — vary widely between tools. Pick on those, not on the polished demo.

Quality: IoU, mAP, and the Vertex-Distance Metric Most People Skip

Standard polygon QA tracks IoU against gold-standard polygons (threshold 0.7–0.8 for production work), per-class mAP on a held-out set, and inter-annotator agreement on the gold. The metric worth adding that most vendors skip — mean vertex-distance error from the gold polygon. IoU alone can mask sloppy tracing where the polygon happens to enclose the right area but the boundary is wandering. Vertex-distance catches that. General framework lives in the annotation QA playbook.

Need polygons done properly?

Free 50-image polygon pilot — vertex-budgeted, per-class accuracy reporting, COCO / GeoJSON output. 48-hour turnaround.

See our polygon annotation service

Where Polygons Get Used in Production

Related Reading

Free Sample · 24-48 hours

Get a 50-image polygon pilot in 48 hours

Send a representative sample — we'll deliver vertex-budgeted polygons in COCO or GeoJSON with per-class accuracy on the gold set.

No commitment. NDA available on request. We respond within 24 hours, often the same day for Gulf-region inquiries.

Neel Bennett

AI Annotation Specialist at AI Taggers

Neel has over 8 years of experience in AI training data and machine learning operations. He specializes in helping enterprises build high-quality datasets for computer vision and NLP applications across healthcare, automotive, and retail industries.

Connect on LinkedIn