How Barcode Localization Works

Note on Terminology: “Localization” in this context refers to finding the physical location (position, bounding box) of a barcode within an image - not language localization (i18n). This is standard computer vision terminology.

This article explains the technical details of how Quagga2 locates and decodes barcodes in images. Understanding this can help you optimize performance and troubleshoot issues.

Overview

Quagga2 uses a two-stage process:

Barcode Locator (blue box in the images below) - Finds regions that look like barcodes
Barcode Decoder (red line in the images below) - Reads the actual barcode data

This approach is based on the paper Locating and decoding EAN-13 barcodes from images captured by digital cameras by Douglas et al., with adaptations and modifications for web browsers.

Stage 1: Barcode Locator

The locator finds patterns that look like barcodes. A barcode is typically characterized by:

Lines (black bars and white gaps)
That are close to each other
With a similar angle (parallel lines)

Step 1: Creating a Binary Image

The first step is converting the color image to binary (black and white). Instead of using a simple threshold (e.g., everything below 127 is black), Quagga2 uses Otsu’s method, which adapts to lighting changes across the image.

Binary Image

Otsu’s method analyzes the image histogram and automatically separates foreground (barcode) from background, even with uneven lighting.

Step 2: Slicing into a Grid

The binary image is divided into a 20×15 grid (assuming 4:3 aspect ratio). Each cell is analyzed independently to determine if it contains barcode-like patterns.

Step 3: Extract Skeleton

Each cell undergoes skeletonization - reducing bars to their centerline (1px width). This is done through iterative erosion and dilation.

Skeleton Image

The skeleton clearly shows where parallel lines exist, making it easier to identify barcode regions.

Step 4: Component Labeling

Using connected-component labeling, each line in the skeletonized image is separated into individual components. This is done with a fast algorithm based on the paper “A Linear-Time Component-Labeling Algorithm Using Contour Tracing Technique” by Fu Chang et al.

Component Labeling

Each color represents a distinct labeled component (line). Notice how each cell is processed independently.

Here are zoomed examples of two cells:

Component labeling - barcode lines Good: Parallel lines indicate a possible barcode

Component labeling - text Bad: Random components indicate noise/text

Step 5: Determining Orientation

For each component, Quagga2 calculates its orientation using central image moments. This is a mathematical technique to extract the angle of a shape.

The orientation (θ) is calculated as:

$\Theta = \frac{1}{2} \arctan \left( \frac{2\mu'_{11}}{\mu'_{20} - \mu'_{02}} \right)$

Where μ (mu) are central moments calculated from raw moments (M):

$\mu'_{11} = M_{11}/M_{00} - \bar{x}\bar{y} \\ \mu'_{02} = M_{02}/M_{00} - \bar{y}^2 \\ \mu'_{20} = M_{20}/M_{00} - \bar{x}^2$

The centroid (x̄, ȳ) is calculated from raw moments:

$\bar{x} = M_{10}/M_{00} \\ \bar{y} = M_{01}/M_{00}$

And the raw moments (M) are computed as:

$M_{00} = \sum_x \sum_y I(x,y) \\ M_{10} = \sum_x \sum_y x I(x,y) \\ M_{01} = \sum_x \sum_y y I(x,y) \\ M_{11} = \sum_x \sum_y xy I(x,y) \\ M_{20} = \sum_x \sum_y x^2 I(x,y) \\ M_{02} = \sum_x \sum_y y^2 I(x,y)$

Where I(x,y) is the pixel value at position (x,y) - either 0 or 1 in a binary image.

Don’t worry if the math looks intimidating - the key insight is that these formulas calculate which direction each line is pointing.

Step 6: Determining Cell Quality

Cells are evaluated based on how parallel their lines are:

Filter out noise: Discard cells with fewer than 2 components, or components smaller than 6 pixels
Cluster angles: Group similar angles together
Select dominant cluster: Pick the cluster with the most members
Quality threshold: Only accept cells where ≥75% of components share the same angle

Cells that pass this test are called patches and contain:

Unique index
Bounding box
All components with their angles
Average angle
Direction vector

Found patches

Yellow boxes show patches that were classified as possible barcode areas. Note some false positives (text regions).

Step 7: Finding Connected Cells

Patches are grouped together if they’re neighbors with similar orientation (within 5% angle difference). This is done using recursive component labeling.

Connected patch labels

Each color represents a distinct group. Sometimes adjacent patches have different colors due to angle differences exceeding the 5% threshold.

Step 8: Selecting Groups

Groups are sorted by size (number of patches) and only the largest groups are kept - these are most likely to be actual barcodes.

Remaining patch labels

Small groups and false positives have been filtered out.

Step 9: Create Bounding Box

For each group, a minimum bounding box is calculated:

Calculate average angle of all patches in the group
Rotate all patches by this angle
Find outermost corners (min/max x and y)
Create bounding box
Rotate box back to original orientation

Rotated cells with box Patches rotated to horizontal, bounding box calculated

Final bounding box rotated back to match barcode orientation

The bounding box now precisely outlines the barcode, including its rotation and scale. This information is passed to the decoder.

Stage 2: Barcode Decoder

With the bounding box and orientation known, the decoder:

Samples pixel intensities along scan lines within the box
Detects transitions from black to white (edges of bars)
Calculates bar widths
Matches patterns against the selected barcode format(s)
Validates checksums
Returns the decoded data

Why This Approach?

Unlike simpler barcode scanners that require the barcode to be:

Horizontal (aligned with the viewport)
At a specific distance (scale)
Centered in the frame

Quagga2’s localization algorithm is invariant to rotation and scale. It can find and decode barcodes:

At any angle
At any size (within reason)
Anywhere in the image
Even with multiple barcodes present

This makes it much more practical for real-world camera scanning where users can’t always position the camera perfectly.

Performance Considerations

The localization algorithm is computationally intensive. Key factors affecting performance:

Image size: Smaller images process faster. Consider downscaling if speed is critical.
Grid size: The default 20×15 grid balances accuracy and speed
Patch filtering: Aggressive filtering reduces false positives but may miss difficult barcodes
Number of readers: Enabling multiple barcode formats increases decode time

See Optimize Performance for practical tips.

Original Paper: Locating and decoding EAN-13 barcodes from images captured by digital cameras
Otsu’s Method: Wikipedia - Otsu’s method
Skeletonization: Wikipedia - Morphological skeleton
Component Labeling: Wikipedia - Connected-component labeling
Image Moments: Wikipedia - Image moment
Fast Labeling Algorithm: A Linear-Time Component-Labeling Algorithm (PDF)
CodeProject Implementation: Connected Component Labeling and Vectorization

Source Code

The localization algorithm is implemented in:

src/locator/barcode_locator.js - Main locator logic
src/locator/skeletonizer.js - Skeletonization algorithm (asm.js)
src/common/cluster.js - Clustering algorithms
src/common/cv_utils.js - Computer vision utilities

Contributions and improvements are welcome!

How Barcode Localization Works {#how-barcode-localization-works}

Barcode scanning for browser and node!

How Barcode Localization Works

Overview