Adding a Model¶

Step-by-step guide to adding a new model architecture to BirdNET-STM32.

Overview¶

Models are registered via the model registry in birdnet_stm32/models/__init__.py. The registry maps architecture names (e.g., "dscnn") to builder functions that return a compiled Keras model.

Steps¶

1. Create the model module¶

Create birdnet_stm32/models/your_model.py with a builder function:

"""Your model architecture for BirdNET-STM32."""

import tensorflow as tf
from tensorflow.keras import layers, Model

from birdnet_stm32.models.blocks import _make_divisible


def build_your_model(
    input_shape: tuple[int, ...],
    num_classes: int,
    alpha: float = 1.0,
    **kwargs,
) -> Model:
    """Build your model.

    Args:
        input_shape: Input tensor shape (H, W, C).
        num_classes: Number of output classes.
        alpha: Width multiplier for channel counts.

    Returns:
        Compiled Keras model.
    """
    inputs = layers.Input(shape=input_shape)
    # ... your architecture ...
    outputs = layers.Dense(num_classes, activation="sigmoid")(x)
    return Model(inputs, outputs, name="your_model")

2. Register the model¶

In birdnet_stm32/models/__init__.py, register your builder:

from birdnet_stm32.models.your_model import build_your_model

_MODEL_REGISTRY["your_model"] = build_your_model

Now build_model("your_model", ...) will dispatch to your builder.

3. N6 compatibility constraints¶

Your model must satisfy these constraints:

Constraint	Requirement
Channel alignment	All channel counts must be multiples of 8
Supported ops	Only ops in the N6 supported set
Activation memory	Intermediate activations must fit in NPU SRAM
Output activation	Use `sigmoid` (not `softmax`) for multi-label classification
No FakeQuant ops	Model must be standard Keras — no QAT artifacts

Use _make_divisible(channels, 8) from birdnet_stm32/models/blocks.py for all channel counts.

4. Support scaling knobs¶

The training CLI exposes --alpha (width) and --depth_multiplier (depth). Your builder should accept and honor these parameters for consistency with the rest of the pipeline.

5. Add the model profiler check¶

Run birdnet_stm32/models/profiler.py on your model to verify:

from birdnet_stm32.models.profiler import profile_model, check_n6_compatibility

model = build_your_model(input_shape=(64, 256, 1), num_classes=100)
profile_model(model)       # Per-layer MACs, params, activation memory
check_n6_compatibility(model)  # Flags unsupported ops

6. Add tests¶

Create tests/test_your_model.py with:

Shape test: output is [B, num_classes]
Channel alignment test: all conv layers have channels divisible by 8
Scaling test: different alpha values produce different model sizes
Quantization smoke test: model converts to TFLite without errors

7. Update documentation¶

Add a section to docs/dev/model.md
Add the model to the registry table in the API reference
Update docs/index.md if the model is a significant addition

Reference: DS-CNN builder¶

See birdnet_stm32/models/dscnn.py for the reference implementation. Key patterns to follow:

Use _make_divisible() for all channel computations
Support residual connections when stride=1 and channels match
Use ReLU6 activation (better quantization than unbounded ReLU)
Add BatchNormalization after every convolution
Use GlobalAveragePooling2D → Dropout → Dense(sigmoid) as the head