Filter and Merge Detections

Filter raw bird call detections by confidence threshold and merge them into song segments — without re-running inference. This script is the post-processing companion to detect-birds --no-merge, allowing you to test different confidence thresholds instantly on the same raw detection file.

Usage Synopsis¶

Linux / macOSWindows (PowerShell)Windows (CMD)

python src/evaluation/filter_and_merge_detections.py \
    --raw-detections results \
    --conf 0.25

python src/evaluation/filter_and_merge_detections.py `
    --raw-detections results `
    --conf 0.25

python src/evaluation/filter_and_merge_detections.py ^
    --raw-detections results ^
    --conf 0.25

Parameters¶

Parameter	Type / Default	Required?	Description
`--raw-detections`	`PATH` / `results`	No	Raw detections file or `results/` directory from `detect_birds.py --no-merge`. Default resolves to `raw_detections.json` (follows `results/.active_run`).
`--conf`	`FLOAT` / `0.2`	No	Confidence threshold for filtering (0.0–1.0). All raw detections with `confidence < threshold` are discarded before merging.
`--song-gap`	`FLOAT` / from JSON or `0.1`	No	Maximum temporal gap in seconds between two detections of the same species that are still merged into one song segment. When omitted, the value stored in the JSON's `model_config.song_gap_threshold` is used. Falls back to `0.1` if not present.
`--output-path`	`PATH` / `results`	No	Output directory for result files. Each format writes a fixed descriptive filename inside this directory (see Output Files). The directory is created automatically if it does not exist.
`--output-format`	`CHOICE [...]` / `json-with-algorithm-metadata simplified-csv`	No	One or more output formats (space-separated). Accepts `json-with-algorithm-metadata`, `simplified-csv`, `xeno-canto-annota-json`, `raven-selection-table`, or `all`.

Parameter Deep-Dives¶

`--conf` — Confidence Threshold¶

This threshold is applied to the raw per-clip detections before song merging. Because it operates on raw clips (not merged songs), it is directly comparable to the --conf value you would have passed to detect_birds.py.

Why filter raw clips and not merged songs?

The filter-then-merge order is the same workflow used internally by the app. Filtering merged songs by their average or max confidence gives different — and generally worse — results than filtering the underlying raw clips first and then merging the survivors.

`--song-gap` — Song Gap Threshold¶

Controls how aggressively surviving detections are fused into continuous song segments after filtering. If you leave it unset, the script reuses the song_gap_threshold value from the JSON's model_config block (set when running detect_birds.py), ensuring consistency between detection and post-processing runs.

Value	Effect
omitted	Reads from JSON `model_config.song_gap_threshold`, falls back to `0.1`
`0.1`	Merge detections ≤ 0.1 s apart (recommended default)
`0.5`	Merge detections ≤ 0.5 s apart (moderate)
`2.0`	Merge detections ≤ 2 s apart (aggressive, may over-merge)

`--output-format` — Output Formats¶

Accepts one or more format names separated by spaces. Specify all to write every format in one run. For full schema documentation of each format see Detection Output Formats.

Output Files¶

When run with --output-path results/merged, the following files are written depending on --output-format:

results/merged/with_algorithm_metadata.json   ← json-with-algorithm-metadata
results/merged/simplified.csv                 ← simplified-csv
results/merged/xeno-canto-annota.json         ← xeno-canto-annota-json
results/merged/raven_selection_table.txt      ← raven-selection-table

For the structure of each output file see Detection Output Formats.

Examples¶

Basic filter and merge¶

CommandExpected Output

python src/evaluation/filter_and_merge_detections.py \
    --raw-detections results \
    --conf 0.35 \
    --output-path results/filtered_0.35

Loading detections from: results/raw_detections.json
Loaded 4823 total detections
Confidence range: 0.001 - 0.998
Mean confidence: 0.183

Filtered at conf>=0.35, merged (song_gap=0.1s) -> 23 segments

================================================================================
FILTERING SUMMARY
================================================================================
Confidence threshold: 0.35
Original detections: 4823
Merged segments: 23

All formats + explicit song-gap¶

CommandExpected Output

python src/evaluation/filter_and_merge_detections.py \
    --raw-detections results \
    --conf 0.25 \
    --song-gap 0.1 \
    --output-format all \
    --output-path results/filtered_0.25

Filtered at conf>=0.25, merged (song_gap=0.1s) -> 31 segments
Saved filtered detections to JSON: results/filtered_0.25/with_algorithm_metadata.json
Saved filtered detections to CSV: results/filtered_0.25/simplified.csv
Saved filtered detections to Xeno-Canto Annota-JSON: results/filtered_0.25/xeno-canto-annota.json
Saved filtered detections to Raven Selection Table: results/filtered_0.25/raven_selection_table.txt

Multiple formats¶

CommandExpected Output

python src/evaluation/filter_and_merge_detections.py \
    --raw-detections results \
    --conf 0.4 \
    --output-format json-with-algorithm-metadata simplified-csv \
    --output-path results/filtered_0.4

Filtered at conf>=0.40, merged (song_gap=0.1s) -> 18 segments
Saved filtered detections to JSON: results/filtered_0.4/with_algorithm_metadata.json
Saved filtered detections to CSV: results/filtered_0.4/simplified.csv

Typical Workflow¶

This script sits at Step 3 of the standard evaluation pipeline:

Step 1  detect_birds.py --conf 0.001 --no-merge    →  raw_detections.json
Step 2  f_beta_score_analysis.py                   →  optimal_thresholds.csv
Step 3  filter_and_merge_detections.py --conf 0.35 →  simplified.csv
Step 4  confusion_matrix_analysis.py               →  confusion_matrix/

When to Use This Tool

After F-beta analysis: Apply the optimal threshold found by f_beta_score_analysis.py without re-running inference.
For rapid threshold experiments: Test multiple thresholds on the same raw file in seconds.
To generate CSV files: Produce ground-truth-compatible CSV files for external evaluation tools or confusion_matrix_analysis.py.
For custom thresholds: Apply any threshold that falls outside your original --conf-range.

Filter and Merge Detections

Usage Synopsis¶

Parameters¶

Parameter Deep-Dives¶

--conf — Confidence Threshold¶

--song-gap — Song Gap Threshold¶

--output-format — Output Formats¶

Output Files¶

Examples¶

Basic filter and merge¶

All formats + explicit song-gap¶

Multiple formats¶

Typical Workflow¶

`--conf` — Confidence Threshold¶

`--song-gap` — Song Gap Threshold¶

`--output-format` — Output Formats¶