Point Cloud Classification Module

A Viam module that implements a Vision service for point cloud classification using machine learning models. This module enables robots to classify 3D scenes and objects by processing point cloud data from depth cameras.

Overview

The viam-labs:pointcloud-classification:classifier Vision service captures point cloud data from a camera, preprocesses it (normalization, sampling), and runs inference through an ML model to produce classification results with confidence scores.

Key Features

Automatic Model Adaptation: Extracts model requirements from ML model metadata (input shape, feature count, class labels)
Flexible Feature Support: Works with models expecting XYZ coordinates only, XYZ+RGB, or XYZ+RGB+normals
Point Cloud Preprocessing: Normalization to unit sphere and configurable sampling methods
Seamless Integration: Implements the standard Viam Vision service API for easy integration with existing Viam robots

Prerequisites

Python 3.11 or higher
A Viam robot configuration with:
- A camera component that supports point cloud capture
- An ML model service configured with a point cloud classification model

Installation

From Viam Registry

Add this module to your robot configuration through the Viam app:

Navigate to your robot's config page
Click the Services tab
Click Create service
Search for viam-labs:pointcloud-classification:classifier
Click Add module
Configure the service attributes (see Configuration section)

Local Development

Clone this repository:

git clone <repository-url>
cd pointcloud-classification

Run the setup script to install dependencies:

./setup.sh

This will install uv (if needed), create a virtual environment, and sync dependencies.

Configuration

Add the classifier as a Vision service in your robot configuration:

{
  "name": "my_classifier",
  "type": "vision",
  "namespace": "rdk",
  "model": "viam-labs:pointcloud-classification:classifier",
  "attributes": {
    "mlmodel_name": "my_pointcloud_model",
    "camera_name": "my_depth_camera",
    "sampling_method": "random"
  }
}

Configuration Attributes

Name	Type	Required	Description
`mlmodel_name`	string	Yes	Name of the ML model service to use for inference
`camera_name`	string	No	Default camera to use when not specified in method calls
`sampling_method`	string	No	Sampling method: "random", "voxel", or "fps" (default: "random")

Usage

Python SDK Example

from viam.robot.client import RobotClient
from viam.services.vision import VisionClient

async def classify_scene():
    # Connect to robot
    robot = await RobotClient.at_address('<robot-address>')

    # Get the vision service
    classifier = VisionClient.from_robot(robot, "my_classifier")

    # Get classifications from camera
    classifications = await classifier.get_classifications_from_camera(
        camera_name="my_depth_camera",
        count=5
    )

    # Print results
    for c in classifications:
        print(f"{c.class_name}: {c.confidence:.2%}")

    await robot.close()

Using capture_all_from_camera

# Get both image and classifications in one call
result = await classifier.capture_all_from_camera(
    camera_name="my_depth_camera",
    return_image=True,
    return_classifications=True,
    extra={"count": 3}
)

if result.image:
    print(f"Captured image: {result.image.width}x{result.image.height}")

for c in result.classifications:
    print(f"{c.class_name}: {c.confidence:.2%}")

ML Model Requirements

Your ML model should meet these requirements:

Input

Shape: [N, F] or [1, N, F] where:
- N = number of points (fixed)
- F = 3 (XYZ), 6 (XYZ+RGB), or 9 (XYZ+RGB+normals)

Output

Shape: [num_classes] or [1, num_classes]
Format: Classification logits (will be converted to probabilities via softmax)

Metadata (Optional)

Include class labels in the output metadata's extra field:

{
  "labels": "/path/to/labels.txt"
}

Where labels.txt contains one class name per line.

Development

Running Locally

./run.sh

This runs the module locally using the virtual environment. The module will be available through the Viam SDK module registry.

Building for Distribution

./build.sh

This uses PyInstaller to create a standalone executable with all dependencies bundled, then packages it as dist/archive.tar.gz for Viam module deployment.

Testing

uv run pytest

Architecture

The module is structured as follows:

src/main.py: Entry point that runs the module using Module.run_from_registry()
src/models/classifier.py: Core Vision service implementation
- Implements get_classifications_from_camera() for point cloud classification
- Implements capture_all_from_camera() for combined image and classification capture
- Handles point cloud preprocessing (normalization, sampling, feature extraction)
- Manages ML model inference and result conversion

Preprocessing Pipeline

Point Cloud Capture: Retrieves point cloud from camera
Feature Extraction: Extracts XYZ, RGB (if needed), and normals (if needed)
Sampling: Resamples to match model's expected point count
Normalization: Normalizes XYZ coordinates to unit sphere
Inference: Runs ML model inference
Post-processing: Converts logits to classifications with confidence scores

Supported Platforms

linux/amd64
linux/arm64
darwin/arm64
windows/amd64

Models

This module provides the following Vision service:

viam-labs:pointcloud-classification:classifier - Vision service for point cloud classification using ML models

License

See LICENSE file for details.

Contributing

Contributions are welcome! Please open an issue or pull request.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.github/workflows		.github/workflows
src		src
tests		tests
.gitignore		.gitignore
.python-version		.python-version
.viam-gen-info		.viam-gen-info
LICENSE		LICENSE
README.md		README.md
build.sh		build.sh
meta.json		meta.json
pyproject.toml		pyproject.toml
run.sh		run.sh
setup.sh		setup.sh
uv.lock		uv.lock
viam-labs_pointcloud-classification_classifier.md		viam-labs_pointcloud-classification_classifier.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Point Cloud Classification Module

Overview

Key Features

Prerequisites

Installation

From Viam Registry

Local Development

Configuration

Configuration Attributes

Usage

Python SDK Example

Using capture_all_from_camera

ML Model Requirements

Input

Output

Metadata (Optional)

Development

Running Locally

Building for Distribution

Testing

Architecture

Preprocessing Pipeline

Supported Platforms

Models

License

Contributing

About

Uh oh!

Releases 2

Contributors 2

Uh oh!

Languages

License

viam-labs/pointcloud-classification

Folders and files

Latest commit

History

Repository files navigation

Point Cloud Classification Module

Overview

Key Features

Prerequisites

Installation

From Viam Registry

Local Development

Configuration

Configuration Attributes

Usage

Python SDK Example

Using capture_all_from_camera

ML Model Requirements

Input

Output

Metadata (Optional)

Development

Running Locally

Building for Distribution

Testing

Architecture

Preprocessing Pipeline

Supported Platforms

Models

License

Contributing

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Contributors 2

Uh oh!

Languages