In 2025, the data annotation industry was a $4.1 billion market. In 2026, it is larger and more fragmented. LLM training data demand has reshaped the top end of the market, pulling specialised evaluators — lawyers, doctors, software engineers, native Arabic speakers — into annotation workflows at rates the industry had not previously had to accommodate. Meanwhile, the bottom end of the market has been commoditised further, with crowdsourced platforms pushing simple task rates to near-zero for low-complexity workloads.
The consequence is a pricing landscape where the spread between the cheapest and most expensive annotation for what looks like "the same task" is now 20–50×. Understanding why requires understanding what actually drives annotation cost — and where vendor quotes routinely understate what a production project will actually cost. All prices in this post are expressed in Australian dollars (AUD) and reflect production-quality annotation from credible vendors, not crowdsourced minimum-viable work.
What Actually Drives Annotation Cost
Before looking at task-level pricing, it helps to understand the four variables that move annotation cost most:
- Annotator scarcity. Simple bounding boxes on consumer images can be produced by any literate person with brief training. Emirati Arabic conversational annotation, board-certified radiology segmentation, and Arabic RLHF preference ranking cannot. The supply constraint on specialist annotators sets a floor on per-unit cost that volume cannot fully overcome.
- Task complexity and instructions density. A binary classification with two possible labels and three edge-case examples in the guidelines is fast. Named entity recognition across 12 entity types with 40 pages of annotation guidelines covering financial terminology, nested entities, and multi-token spans is slow. Annotation speed (units per hour) is the primary per-unit cost driver for labour cost.
- QA discipline. Single-pass annotation with automated format checking costs roughly half as much as dual-annotation with adjudication and batch-level inter-annotator agreement measurement. The quality delta is substantial — and the right choice depends entirely on what your model is doing with the data.
- Data sensitivity and compliance overhead. Annotating pseudonymised customer support transcripts from a retail app is straightforward. Annotating clinical discharge summaries for an FDA 21 CFR Part 11-regulated medical AI adds consent verification, access logging, audit trail documentation, and secure data handling overhead that has real cost. GDPR or Australian Privacy Act compliance for European or Australian user data adds similar overhead.
These four variables combine differently across task types, which is why task-type pricing tables are a starting point, not a quote. The numbers below reflect production-grade annotation at reputable vendors — not offshore crowdsourced work with 40% accuracy on edge cases.
Computer Vision Annotation: 2026 Price Ranges
Computer vision annotation is the most mature segment of the market, with the widest spread between simple and complex tasks.
Bounding Box Annotation
- Consumer goods / e-commerce (≤5 objects per image): AUD 0.04–0.12 per image
- Urban street scenes with mixed object classes (10–20 objects): AUD 0.40–1.20 per image
- ADAS driving scenes (20–50 objects, partial occlusion): AUD 1.20–4.00 per image
Segmentation Annotation
- Polygon / instance segmentation, simple objects: AUD 0.25–0.90 per image
- Semantic segmentation, multi-class urban driving: AUD 1.50–5.00 per image
- Medical imaging segmentation (organ, lesion, tumour boundary): AUD 3.00–12.00 per image
Classification and Attribute Tagging
- Binary or 3-class image classification: AUD 0.02–0.06 per image
- Multi-attribute product tagging (10–20 attributes): AUD 0.10–0.35 per image
- Medical image classification with specialist review: AUD 2.00–8.00 per image
For image annotation on e-commerce or retail AI, our data annotation for e-commerce guide covers what quality attributes are actually worth paying the premium for. For large-scale image and video projects, our pricing page covers volume-tier structure.
NLP and Text Annotation: Per-Unit Costs by Complexity
Text annotation pricing varies enormously by task structure. A binary sentiment label on a tweet is a fundamentally different economic task from a discourse-level annotation of a 2,000-word legal document.
Classification Tasks
- Binary or 3-class text classification (intent, sentiment, topic): AUD 0.04–0.10 per text unit
- Multi-class classification with 10–20 labels: AUD 0.08–0.22 per unit
- Hierarchical taxonomy classification (100+ classes): AUD 0.15–0.50 per unit
Named Entity Recognition and Span Labelling
- NER with 4–6 standard entity types (person, org, location, date): AUD 0.06–0.15 per sentence
- NER with 10–15 domain-specific entity types: AUD 0.12–0.30 per sentence
- Clinical NER (medications, dosages, diagnoses, procedures) with clinical review: AUD 0.30–0.80 per sentence
Document-Level Tasks
- Verbatim transcription, English audio: AUD 0.80–1.80 per audio minute
- Document classification and key-field extraction (invoices, forms): AUD 0.40–1.20 per document page
- Relation extraction and coreference in legal or scientific text: AUD 0.50–1.50 per document page
The quality measurement overhead for NLP tasks often exceeds that for image tasks because inter-annotator agreement on span-level and relation-level tasks is structurally harder to achieve. Understanding how to measure this accurately — and what κ thresholds are actually defensible — is covered in our Cohen's kappa guide for annotation teams.
LLM Training Data: SFT Pairs and RLHF Comparisons
The fastest-growing annotation cost category in 2026 is LLM training data. Demand for supervised fine-tuning (SFT) pairs and RLHF preference datasets has pushed rates significantly higher than 2025, and the specialist premium for domain-specific evaluators has widened.
- SFT instruction-response pairs, general-purpose English: AUD 8–25 per pair. Rate is driven by response quality requirements — a pair where the evaluator must write a 300-word, factually accurate response with accurate citations costs more than a 50-word factual answer to a simple question.
- SFT pairs requiring domain expertise (legal, medical, finance): AUD 25–75 per pair. The evaluator must hold relevant professional credentials and demonstrate domain accuracy in the response content. Supply of qualified evaluators in niche domains is the primary cost driver.
- RLHF preference comparisons, general English: AUD 15–50 per comparison task. Rates have risen approximately 20–35% since Q1 2025 as LLM developers scaled RLHF pipelines faster than evaluator supply grew.
- RLHF preference comparisons, domain-specialised: AUD 40–120 per comparison. Legal reasoning, medical advice safety, and complex coding tasks sit at the high end.
For a detailed treatment of how to design RLHF preference datasets and avoid the structural mistakes that make expensive RLHF data useless, see our RLHF data collection guide.
Want a Real Quote for Your Project?
We price transparently — per-unit rates, QA overhead, and guideline development included in scope. No hidden rework costs.
Medical Annotation: The Credentialled Work Floor
Medical annotation sits in its own pricing tier because the annotator credential requirement is non-negotiable for any dataset heading towards a regulatory pathway or clinical deployment. The market is not price-competitive in the way general annotation is — there are simply not enough fellowship-trained radiologists, board-certified ophthalmologists, or licensed clinical coders willing to do annotation work at commodity rates.
- Radiology image annotation (bounding boxes, segmentation, grading): AUD 4.00–18.00 per image depending on modality (CT, MRI, plain film) and complexity tier. Multi-structure organ segmentation on contrast CT runs at the high end.
- Ophthalmology retinal image grading (diabetic retinopathy, AMD, glaucoma): AUD 5.00–15.00 per image. ICDR-aligned grading from fellowship-trained annotators with dual-grader adjudication is standard for FDA submission datasets. See our ophthalmology AI annotation guide for the full grading protocol breakdown.
- Clinical NLP (EHR text, discharge summaries, clinical notes): AUD 0.30–0.80 per sentence for NER; AUD 1.20–3.50 per document page for structured extraction with clinical review sign-off.
- Pathology annotation (histopathology whole-slide or digital pathology): AUD 8.00–30.00+ per annotated region of interest, depending on tissue type, stain, and the level of pathologist involvement in annotation versus QA review.
FDA 21 CFR Part 11-aligned annotation adds provenance logging, access control documentation, and audit trail requirements that add approximately 20–35% to base annotation cost on any clinical AI project. For medical AI projects, our clinical expert annotation service and financial document annotation service both include compliance documentation as standard deliverables.
Multilingual and Arabic Annotation: The Scarcity Premium
Language annotation pricing is anchored to annotator supply. For high-resource languages (French, German, Spanish, Mandarin), the premium over English is modest — typically 1.2–1.8×. For lower-resource languages and dialects where qualified annotator supply is genuinely constrained, the premium is structural.
- Modern Standard Arabic (MSA) text annotation: AUD 0.08–0.18 per unit (classification, NER), approximately 1.5–2× the English rate. MSA-qualified annotator supply is adequate for most project scales but prices have risen since 2024 as Arabic LLM demand grew.
- Dialect-specific Arabic annotation (Khaleeji, Egyptian, Levantine): AUD 0.15–0.40 per unit — 2–3× English. Native-dialect annotators with annotation training are a constrained resource in each dialect community. Khaleeji (Gulf Arabic) is the tightest market, Egyptian the most available.
- Arabic conversational annotation for chatbot / dialogue training: AUD 10–30 per dialogue depending on dialect specificity and domain. Emirati Arabic government service dialogues sit at the high end of this range.
- Hebrew text annotation: AUD 0.14–0.32 per unit. Supply is highly constrained relative to demand from Israeli fintech and government AI projects. For Hebrew-specific annotation, see our Hebrew data annotation service.
- Turkish text annotation: AUD 0.10–0.25 per unit. Supply is more available than Arabic dialects or Hebrew, but agglutinative morphology makes Turkish annotation slower and therefore more expensive per-unit than the per-sentence count suggests.
For Arabic annotation specifically, the premium is justified by what you get wrong if you cut corners. An MSA-trained annotator on Khaleeji dialect data will mislabel sentiment, miss idiomatic meaning, and fail on dialect-specific vocabulary in ways that are invisible until you run your model in production on real users. For the data discipline required for dialect-specific annotation, see our native speaker annotator service and the Arabic sentiment analysis annotation guide.
LiDAR, 3D, and Video: The Premium Tier
LiDAR and 3D annotation is the highest per-unit cost segment in production annotation, combining specialist tool skills with the slowest annotation speed of any task type.
- LiDAR cuboid annotation, simple scenes (warehouse, parking): AUD 0.80–2.50 per frame. Low object density, stable viewpoints, good sensor coverage.
- LiDAR cuboid annotation, ADAS driving scenes: AUD 4.00–12.00 per frame. High object density, weather degradation, partial occlusion, multi-object tracking across frames.
- Multi-sensor fusion annotation (LiDAR + camera, cuboid + 2D box alignment): AUD 8.00–20.00 per frame. The correlation task between sensor modalities is the most time-intensive per-frame operation in AV annotation.
- Video annotation (action recognition, tracking, keypoint across frames): AUD 2.00–8.00 per minute of video for action and tracking tasks. Sport performance AI annotation using pose estimation across broadcast footage runs AUD 4.00–10.00 per minute.
QA Costs, Volume Tiers, and What Quotes Leave Out
The gap between a vendor quote and the total project cost typically lives in four categories:
- Guideline development and pilot iteration. Production-quality annotation guidelines require a pilot phase of 200–500 items, IAA measurement, guideline revision, and re-pilot before full production begins. This cycle typically costs AUD 2,000–8,000 depending on task complexity and is often excluded from quotes that assume guidelines are provided by the client.
- QA overhead. Automated format checking and single-pass sampling adds 10–15% to annotation cost. Dual annotation with full-batch adjudication — the standard for any regulatory-pathway or high-stakes deployment — adds 55–70% to the base annotation cost. Quotes that lead with per-unit rates rarely include QA overhead transparently.
- Data preparation. PII scrubbing, pseudonymisation, format conversion, and data validation before annotation begins is real labour. Depending on data source quality, expect AUD 1,000–5,000 per project in data preparation overhead that does not show up in per-unit rates.
- Rework cycles. Annotation projects that do not include pilot-phase IAA measurement almost always include rework. Rework on 10% of a 100,000-item project at median task rates is a five-figure cost that disappears from early quotes and reappears in change orders.
Volume discounts are real but segmented. Thresholds where meaningful per-unit reductions typically activate: 50,000+ items (5–15% reduction), 200,000+ items (15–25% reduction), 1,000,000+ items (25–40% reduction). Simple, homogeneous tasks see the largest discounts at high volume. Specialist credentialled tasks retain higher per-unit floors even at scale because annotator scarcity cannot be overcome by volume alone.
The structural implication is straightforward: a vendor quoting AUD 0.06 per image for a task that genuinely requires dual annotation, pilot IAA measurement, and data preparation is either quoting cheap crowdsourced work (with corresponding quality) or will deliver the same quote with change order additions that bring the true cost to AUD 0.10–0.15 per image. Understanding what your project actually needs before comparing quotes is the prerequisite to any meaningful vendor evaluation. For a framework on writing annotation requirements that make vendor quotes comparable, see our annotation guidelines writing guide.
Related Reading
- → Cohen's kappa and IAA metrics guide — measuring quality before it costs you in rework
- → RLHF data collection guide — preference dataset design for LLM training
- → How to choose a data annotation company — vendor selection beyond the per-unit quote
- → AI Taggers pricing tiers
- → Arabic data labelling service
FAQ
What does bounding box annotation cost per image in 2026?
Simple bounding box annotation on consumer-goods or product images with five or fewer objects runs AUD 0.04–0.12 per image from quality vendors. ADAS driving scenes with 20–50 mixed objects and partial occlusion run AUD 1.20–4.00 per image. Medical imaging segmentation with specialist review sits at AUD 3.00–12.00 per image. The spread reflects annotator credential requirements and QA discipline, not vendor margin.
How much does Arabic annotation cost compared to English?
MSA annotation runs approximately 1.5–2× English rates. Dialect-specific Arabic — Khaleeji, Egyptian, Levantine — runs 2–3× English. Emirati Arabic conversational annotation runs AUD 10–30 per dialogue. Arabic RLHF preference pairs run AUD 30–80 per comparison. These premiums reflect genuine annotator scarcity, not arbitrary pricing.
What does RLHF preference annotation cost in 2026?
General-purpose English RLHF preference comparisons run AUD 15–50 per task. Domain-specialised comparisons (legal, medical, finance) run AUD 40–120 per comparison. SFT instruction-response pairs run AUD 8–25 for general English and AUD 25–75 for domain-specialist pairs. Rates are up 20–35% from 2025 due to evaluator supply constraints.
Why is medical annotation so much more expensive?
Medical annotation requires credentialled annotators — board-certified radiologists, fellowship-trained specialists — whose supply is constrained by professional qualification, not annotation market dynamics. FDA 21 CFR Part 11 compliance adds audit trail documentation and access control overhead. Dual-reader adjudication is standard for regulatory-pathway datasets. These are structural costs, not premiums that vendor negotiation can eliminate.
At what volume do annotation discounts meaningfully reduce cost?
Volume discounts typically activate at 50,000+ items (5–15% reduction), 200,000+ items (15–25%), and 1,000,000+ items (25–40%). Simple tasks see the largest proportional discounts. Specialist credentialled tasks retain higher per-unit floors regardless of volume — the annotator supply constraint sets a floor that volume cannot fully overcome.
What costs do vendor quotes typically leave out?
The most commonly excluded costs: guideline development and pilot iteration (AUD 2,000–8,000), QA overhead (10–70% on top of per-unit annotation cost depending on QA discipline), data preparation including pseudonymisation and format conversion, and rework cycles from undetected guideline ambiguity. A quote covering only the per-unit rate will understate total project cost by 40–80% on most real projects.
Get a Transparent Quote for Your Annotation Project
Per-unit rates, QA overhead, and guideline development included in scope. No surprise change orders.
Neel Bennett
AI Annotation Specialist at AI Taggers
Neel has over 8 years of experience in AI training data and machine learning operations. He specializes in helping enterprises build high-quality datasets for computer vision and NLP applications across healthcare, automotive, and retail industries.
Connect on LinkedIn