Pipeline Configuration: Mean Volume Backscattering Strength on AWS

Pipeline Configuration: Mean Volume Backscattering Strength on AWS#

In this section, we will provide you with the pipeline configuration that we’ll be using for our MVBS processing. The configuration is presented in YAML format, which is a structured and human-readable way to define settings for data processing.

Here’s the configuration we’ll be using:

active_recipe: standard 
use_local_dask: true
n_workers: 3
pipeline:
- recipe_name: standard 
  stages: 
  - name: echodataflow_open_raw 
    module: echodataflow.stages.subflows.open_raw 
    options: 
      save_raw_file: true
      use_raw_offline: true 
      use_offline: true 
  - name: echodataflow_combine_echodata
    module: echodataflow.stages.subflows.combine_echodata
    options:
      use_offline: true
  - name: echodataflow_compute_Sv
    module: echodataflow.stages.subflows.compute_Sv
    options:
      use_offline: true
  - name: echodataflow_compute_MVBS
    module: echodataflow.stages.subflows.compute_MVBS
    options:
      use_offline: true
    external_params:
      range_meter_bin: 20 
      ping_time_bin: 20S

Note: For a more comprehensive understanding of each option and its functionality, you can refer to the Pipeline documentation.

Keep in mind that in this example, we’ll be setting up a local Dask Cluster with 3 workers for parallel processing. This configuration will enable us to efficiently process our data for MVBS analysis. To turn it off, toggle use_local_dask to false.

Feel free to explore and modify the configuration to understand better.