Configuration files

AutoBernese uses configuration files to integrate with Bernese, have customisable shared settings and defaults, as well as campaign-specific settings.

The active configuration files are read into AutoBernese in the following order, each file overriding the previous, when merging is possible: 1) core functionality and default settings, 2) common user-defined settings and defaults, and 3) campaign-specific settings, created anew or from a template based on a previous campaign. The campaign-specific settings are only read, when working with a campaign.

In addition, AutoBernese also has a template system that enables users to set up re-usable campaign settings for common campaign types and use these templates to create new campaigns.

The different types of configuration are shown in the table below:

Configuration	File	Location	Purpose
Core	`core.yaml`	Inside the package	Integrate with activated Bernese environment and provide core and default settings.
Common	`autobernese.yaml`	AutoBernese runtime directory	Contain common data sources, campaign-creation setup and station sitelog settings.
Campaign	`campaign.yaml`	Campaign directory	Campaign-specific environment, data sources and actions
Campaign template	`<campaign-type>.yaml`	`templates` directory in AutoBernese runtime directory	Have pre-set campaign configuration for campaigns of the same type.

The configuration files are in YAML format, and how AutoBernese use it is explained in the examples given below together with the required and permitted content of each file.

Core configuration¶

The core configuration file makes the integration with Bernese possible, and establishes the location of the AutoBernese runtime directory for log data and files shared by the users of the given Bernese installation. It is built in to the package, and the user is not supposed to edit this file. Its contents are explained here, since the two user-edited configuration types are meant to utilise YAML features to re-use values in this file.

The overall structure of the core configuration file looks like this.

Main sections of the core configuration

bsw_env: {}
bsw_files: {}
env: ''
runtime: {}
campaign: {}

Each section is explained below.

BSW environment variables¶

Inside the bsw_env section are key-value pairs whose values are the names of relevant environment variables set by the LOADGPS.setvar script. The extracted values are made available for re-use in other parts of the configuration.

# Bernese GNSS Software [BSW] environment variables available after 'source'ing
# the shell script `LOADGPS.setvar` in the root of the installation directory.
bsw_env:

  # Installation directory
  C: &C !ENV C

  # Bernese documentation files
  DOC: &DOC !ENV DOC

  # Built-in Bernese program panels
  PAN: &PAN !ENV PAN

  # Global model files used by Bernese programs
  MODEL: &MODEL !ENV MODEL

  # Global configuration file used by Bernese programs
  CONFIG: &CONFIG !ENV CONFIG

  # DATAPOOL directory
  D: &D !ENV D

  # CAMPAIGN directory
  P: &P !ENV P

  # SAVEDISK directory
  S: &S !ENV S

  # User directory
  U: &U !ENV U

  # Temporary directory
  T: &T !ENV T

The section is a YAML mapping whose items have values that are dynamically generated, when the document is parsed. As defined here, the key C for instance, will have the string inside the the environment variable $C (or ${C} in the Perl syntax used by Bernese).

Starting from the right, the environment variable is grabbed by using the YAML tag !ENV before the string C. When parsed as a YAML document, this invokes a special constructor that takes the string C as an argument and returns the value of the environment variable of that name.

The additional YAML syntax &C defines a YAML anchor (also named C) which functions as a variable that can referenced and thus reused later in the document.

Files used by AutoBernese¶

The bsw_files section has entries for Bernese files used by AutoBernese. These file paths are derived by referencing the dynamically-loaded environment variables in the previous section.

# Specific files in the Bernese environment that we need to access
bsw_files:

  # Release-information file
  release_info: !Path [*DOC, RELEASE.TXT]

  # Input file with the list of existing Bernese campaigns
  campaign_menu: !Path [*PAN, MENU_CMP.INP]

Runtime settings¶

The env and runtime sections are settings that AutoBernese use to create and maintain a directory autobernese for its runtime files (listed below). It also defines names of configuration sections that one may override in the common configuration file autobernese.yaml or, for some of them, the campaign-specific configuration file; both described further below.

# We define the environment root directory as the one containing the BSW
# installation. It is assumed to be a directory that each user can can write to.
env: &env !Parent [*C]

# AutoBernese runtime environment
runtime:

  # Root directory for runtime files
  ab: &ab !Path [*env, autobernese]

  # Keyword arguments to the Python logging module
  logging:
    filename: !Path [*ab, autobernese.log]
    # Note the field `{user}` which is replaced at runtime with Python's getpass.getuser()
    format: '%(asctime)s | {user} | %(levelname)s | %(name)s | %(message)s'
    datefmt: '%Y-%m-%d %H:%M:%S'
    style: '%'
    level: DEBUG

  # Campaign-configuration template directory
  campaign_templates: !Path [*ab, templates]

  # Filename for the common configuration
  common_config: !Path [*ab, autobernese.yaml]

  # Sections that can be added to or overridden in the core configuration
  sections_to_override:
  - metadata
  - environment
  - sources
  - tasks
  - station
  - clean
  - troposphere
  - campaign

As there may be more than one Bernese installation on a system, the AutoBernese runtime directory is, currently, set to be next to the activated Bernese installation directory BERN54.

Bernese environmemt

The directory containing the AutoBernese runtime directory and the Bernese installation directory is referred to in this context, variously, as the Bernese environment, the activated environment or runtime environment.

The files in the AutoBernese runtime directory include:

The logfile autobernese.log
The common configuration file autobernese.yaml with possible overrides for the built-in configuration file.
Campaign-configuration templates inside the sub directory templates

Settings overridable¶

The campaign section determines the default directory structure of a new campaign. This section is overridable by the common configuration.

# Default content for the above sections_to_override. These sections can be
# overriden by the user in the general configuration file autobernese.yaml or in
# the campaign configuration (including the template).

campaign:
  directories:
  - name: ATM
  - name: BPE
  - name: GEN
    files:
    - !Path [*CONFIG, OBSERV.SEL]
    - !Path [*PAN, SESSIONS.SES]
  - name: GRD
  - name: OBS
  - name: ORB
  - name: ORX
  - name: OUT
  - name: RAW
  - name: SOL
  - name: STA

Common configuration¶

The common configuration file is for functionality that does not require a Bernese campaign or which is common to all campaigns. To use common settings, put them inside a file called autobernese.yaml in the AutoBernese runtime directory. The settings work as defaults for campaigns, unless they are overridden in the campaign-specific configuration file.

The AutoBernese runtime directory

/path/to/environment
├── autobernese
│   ├── autobernese.log  # Log file with user-separable entries
│   ├── autobernese.yaml # Users can (and should) add this to configure
│   │                    # campaign setup and external-data download
│   │
│   └── (...)
│
├── BERN54
└── (...)

With this file, users may configure the following:

The directory structure of a new Bernese campaign
Directory structure for troposphere-model data
Common sources of external data
Settings for generating a STA-file from station site-log files

Campaign-directory structure for new campaigns¶

Campaign-creation settings in autobernese.yaml
campaign:
  directories:
  - name: ATM
  - name: BPE
  - name: GEN
    files:
    - !Path [*CONFIG, OBSERV.SEL]
    - !Path [*PAN, SESSIONS.SES]
  - name: GRD
  - name: OBS
  - name: ORB
  - name: ORX
  - name: OUT
  - name: RAW
  - name: SOL
  - name: STA

Troposphere data directory structure¶

Troposphere-data settings in autobernese.yaml
troposphere:
  ipath: &TROPOSPHERE_IPATH !Path [*D, VMF3, '1x1_OP_H', '{date.year}']
  opath: &TROPOSPHERE_OPATH !Path [*D, VMF3, '1x1_OP_GRD', '{date.year}']

Data-source specification and local directory structure¶

Data source management settings in autobernese.yaml
sources:

- identifier: EUREF54_20_STA
  description: EUREF STA file from epncb
  url: ftp://epncb.oma.be/pub/station/general/EUREF54_20.STA
  destination: !Path [*D, REF54]
  max_age: 1

- identifier: BSW_MODEL
  description: BSW Model data
  url: ftp://ftp.aiub.unibe.ch/BSWUSER54/MODEL/
  destination: *MODEL
  filenames: ['*']
  max_age: 1

- identifier: BSW_CONFIG
  description: BSW Configuration data
  url: ftp://ftp.aiub.unibe.ch/BSWUSER54/CONFIG/
  destination: *CONFIG
  filenames: ['*']
  max_age: 1

- identifier: BSW_REF
  description: Universal and BSW-specific antenna files
  url: ftp://ftp.aiub.unibe.ch/BSWUSER54/REF/
  destination: !Path [*D, REF54]
  filenames:
  - ANTENNA_I20.PCV
  - I20.ATX
  - FES2014b.BLQ

  - IGB14.CRD
  - IGB14.FIX
  - IGB14.PSD
  - IGB14.VEL

  - IGS14.CRD
  - IGS14.FIX
  - IGS14.PSD
  - IGS14.VEL

  - IGS20.CRD
  - IGS20.FIX
  - IGS20.PSD
  - IGS20.VEL

  - IGB14_R.CRD
  - IGB14_R.VEL

  - IGS14_R.CRD
  - IGS14_R.VEL

  - IGS20_R.CRD
  - IGS20_R.VEL
  max_age: 1

- identifier: VMF3
  description: Troposphere mapping function (VMF3/grid/1x1/operational)
  url: https://vmf.geo.tuwien.ac.at/trop_products/GRID/1x1/VMF3/VMF3_OP/{date.year}/VMF3_{date.year}{date.month:02d}{date.day:02d}.H{hour}
  destination: *TROPOSPHERE_IPATH
  parameters:
    date: !DateRange { beg: 2024-06-01, end: 2024-06-01, extend_end_by: 1 }
    hour: ['00', '06', '12', '18']

Station site-log files to use for STA-file creation¶

Station site-log data to STA-file settings in autobernese.yaml
station:
  sitelogs: !Path [*D, sitelogs, '*.log']
  individually_calibrated: []
  output_sta_file: !Path [*D, station, sitelogs.STA]

Campaign configuration¶

Campaigns differ by their purpose, time interval, data needed and actions required. To run a Bernese campaign, AutoBernese requires a configuration file campaign.yaml in the root of the campaign directory.

Location of a campaign-specific configuration

/path/to/CAMPAIGN54
├── EXAMPLE
│   ├── (...)
│   └── campaign.yaml
│
└── (...)

It stores all information about the campaign, defines what data sources to retrieve and what actions [tasks] to perform. It also allows one to (re-)define environment variables which can be helpful, if a given campaign uses a special directory for the Bernese .PCF-files it needs.

One can build the campaign configuration file manually, e.g. by copying the example in this manual. But AutoBernese is made to automatically create a Bernese campaign from a pre-defined configuration template. AutoBernese comes with an empty, default template to allow basic campaign-creation.

Creating a new campaign configuration¶

A campaign-specific configuration file is created by AutoBernese by combining information gathered at the campaign creation time with a campaign-configuration template. The combination of these metadata and the template go into a campaign-specific configuration file campaign.yaml which is added to root of the newly-created campaign directory.

The gathered data are stored in a metadata section at the top. Its purpose is to specify the YAML anchors used in the template part of the campaign-configuration template file. The metadata are also used when displaying verbose information about the created campaign, when using AutoBernese to list all the campaigns in the environment.

Sections in a campaign-specific configuration file

# Required
metadata: {}

# Optional, but needed to make use of autobernese
environment: {}
tasks: []
sources: []
clean: []

It is also possible to use AutoBernese on existing campaigns by, manually, adding a campaign.yaml file to the campaign-directory root. The quick-start section provides an example of such a file, which is prepared to work with the EXAMPLE campaign that comes bundled with Bernese 5.4 (works for releases 2022-10-23 and 2023-10-16).

For reference, you can se the contents of that file by unfolding the section below.

Unfold to see an AutoBernese configuration for the pre-created EXAMPLE campaign

As it is only the presence of the configuration file that enables AutoBernese to do its work, existing campaigns can be run with AutoBernese by adding a campaign-configuration file to those campaign directories.

Below is an example of the campaign-configuration template campaign.yaml, which, when put into the EXAMPLE campaign directory /path/to/CAMPAIGN54/EXAMPLE enables AutoBernese to use this existing campaign:

EXAMPLE campaign configuration

metadata:
  version: &version 0.5.0
  username: &username USERNAME
  created: &created 2024-04-08
  template: &template example
  campaign: &campaign EXAMPLE
  beg: &beg 2019-02-13
  end: &end 2019-02-14

custom:
  dates: &dates
  - 2019-02-13
  - 2019-02-14
  - 2020-06-27
  - 2020-06-28
  - 2021-04-05
  - 2021-04-06

tasks:

- identifier: PPP
  description: Precise-Point Positioning
  run: RunBPE
  arguments:
    pcf_file: PPP
    campaign: *campaign
    year: '{date.year}'
    session: '{date.doy:0>3d}0'
    sysout: 'PPP_{date.doy:0>3d}0'
    status: 'PPP_{date.doy:0>3d}0.RUN'
    taskid: 'PPP_{date.doy:0>3d}0'
  parameters:
    date: dates

- identifier: RNX2SNX
  description: RINEX to SINEX
  run: RunBPE
  arguments:
    pcf_file: RNX2SNX
    campaign: *campaign
    year: '{date.year}'
    session: '{date.doy:0>3d}0'
    sysout: 'RNX2SNX_{date.doy:0>3d}0'
    status: 'RNX2SNX_{date.doy:0>3d}0.RUN'
    taskid: 'R2S_{date.doy:0>3d}0'
  parameters:
    date: dates

- identifier: BASTST
  description: |-
    Baseline by baseline processing for trouble shooting
  run: RunBPE
  arguments:
    pcf_file: BASTST
    campaign: *campaign
    year: '{date.year}'
    session: '{date.doy:0>3d}0'
    sysout: 'BASTST_{date.doy:0>3d}0'
    status: 'BASTST_{date.doy:0>3d}0.RUN'
    taskid: 'BASTST_{date.doy:0>3d}0'
  parameters:
    date: dates

- identifier: CLKDET
  description: |-
    Zero-difference network solution providing clock corrections
  run: RunBPE
  arguments:
    pcf_file: CLKDET
    campaign: *campaign
    year: '{date.year}'
    session: '{date.doy:0>3d}0'
    sysout: 'CLKDET_{date.doy:0>3d}0'
    status: 'CLKDET_{date.doy:0>3d}0.RUN'
    taskid: 'CLKDET_{date.doy:0>3d}0'
  parameters:
    date: dates

- identifier: IONDET
  description: |-
    Zero-difference network solution providing station-wise, regional, or global
    ionosphere maps and the related biases
  run: RunBPE
  arguments:
    pcf_file: IONDET
    campaign: *campaign
    year: '{date.year}'
    session: '{date.doy:0>3d}0'
    sysout: 'IONDET_{date.doy:0>3d}0'
    status: 'IONDET_{date.doy:0>3d}0.RUN'
    taskid: 'IONDET_{date.doy:0>3d}0'
  parameters:
    date: dates

- identifier: LEOPOD
  description: |-
    Precise Orbit Determination for a Low Earth Orbiting Satellites based on
    on-board GPS-measurements with phase ambiguity resolution
  run: RunBPE
  arguments:
    pcf_file: LEOPOD
    campaign: *campaign
    year: '{date.year}'
    session: '{date.doy:0>3d}0'
    sysout: 'LEOPOD_{date.doy:0>3d}0'
    status: 'LEOPOD_{date.doy:0>3d}0.RUN'
    taskid: 'LEOPOD_{date.doy:0>3d}0'
  parameters:
    date: !DateRange {beg: *beg, end: *end}

- identifier: SLRVAL
  description: |-
    Validation of an existing GNSS or LEO orbit using SLR measurements
  run: RunBPE
  arguments:
    pcf_file: SLRVAL
    campaign: *campaign
    year: '{date.year}'
    session: '{date.doy:0>3d}0'
    sysout: 'SLRVAL_{date.doy:0>3d}0'
    status: 'SLRVAL_{date.doy:0>3d}0.RUN'
    taskid: 'SLRVAL_{date.doy:0>3d}0'
  parameters:
    date: dates

- identifier: ITRF
  description: |-
    Derives a coordinate and linear velocity approximation from the ITRF
    solution containing non-linear PSD corrections
  run: RunBPE
  arguments:
    pcf_file: ITRF
    campaign: *campaign
    year: '{date.year}'
    session: '{date.doy:0>3d}0'
    sysout: 'ITRF_{date.doy:0>3d}0'
    status: 'ITRF_{date.doy:0>3d}0.RUN'
    taskid: 'ITRF_{date.doy:0>3d}0'
  parameters:
    date: dates

sources:

- identifier: ITRF14
  description: IERS data needed for the EXAMPLE campaign
  url: https://datacenter.iers.org/products/reference-systems/terrestrial/itrf/itrf2014/
  filenames:
  - ITRF2014-IGS-TRF.SNX.gz  # 1.4 GB
  - ITRF2014-psd-gnss.dat  # 38 KB
  destination: !Path [*D, ITRF14]

What follows is a description of the contents in each main section of the campaign-specific configuration file.

The `metadata` section¶

The metadata section is there, because the data written here can be referred to in the rest of the document using the YAML anchors [the words starting with &...] prefixing the values for each key in the section. The keys are seen in the task lists, which is explained below.

Campaign metadata

metadata:
  version: &version 0.3.0
  username: &username USERNAME
  created: &created 2024-04-08
  template: &template example
  campaign: &campaign EXAMPLE
  beg: &beg 2019-02-13
  end: &end 2019-02-14

The metadata section contains data about the campaign. The first four items in the mapping contain the context in which the campaign was created. version refers to the AutoBernese version, username is the user that created the campaign, created is the campaign-creation time, and template is the filename without suffix for the campaign template that was used to create the [in this case example.yaml].

The last three items are shortcuts available, primarily, for the tasks in the tasks section. The string value in campaign is the name of the directory in the campaign directory containing the Bernese campaign. The YAML anchor &campaign can be resolved in other places in the YAML document to re-use the string value, in this case EXAMPLE. The same is the case for the items beg and end which denote the beginning and end date [both included] that the campaign covers.

Key idea

This is particularly useful for specifying a BPE task to be run for the campaign, since the campaign name, thanks to the YAML specification, need not be repeated explicitly, but can be written once. Similarly, as illustrated, one may use the beginning and end dates to define the beginning and end date for which a given task needs to be run.

The `environment` section¶

Adding an environment section to a campaign configuration, you are able to set or update environment variables for the given campaign at runtime. The variables are set after the campaign configuration has been loaded, so it affects only actions invoked by AutoBernese afterwards which rely on these variables.

For Bernese, campaign-specific input can be defined in a single location from which values are created/updated at runtime, 1) dynamically, changing directories, before running Bernese, and 2) propagating input data to .PCF files and scripts using them.

This addresses three issues:

Reproducibility

By default, each user has their own Bernese-user environment (the directory $U) with .PCF files and options set for their work. These files require effort to maintain, and different users doing the same calculations, might have diverging settings in their user-environment. This can be a problem.

Settings should come from a single source of truth if one wants to maintain a high-quality workflow and avoid human errors.
Adaptability

The .PCF files in a Bernese-user environment also have variables for easier re-use throughout the file. This makes them easier to maintain and adapt for campaigns of the same type. The values of these variables can be set from existing environment variables, which means that users can avoid changing the .PCF file itself and thus use the same file across different campaigns. This adaptability is great.

But as with the need for reproducibility, having a single source of settings reduces maintenance effort and improves quality assurrance, by needing fewer mechanisms for changing those settings, dynamically.
Separation of concerns

Having all one's critical .PCF files and scripts in one directory, and maybe having some .PCF files use the same OPT-directories for their input, can be good for reproducibility and adaptability.

But the maintenance effort increases, when complexity of the file organisation and functionality dependence increases. The overview may be lost and/or require a specialist to see the consequences of changes in shared .INP files and scripts. Testing also becomes harder with increasing complexity. If there is no testing done to see what these consequences might be, it is not easy to ascertain the quality of the results that come out.

Similarly, a user might share a common workflow with colleagues in other organisations. Having the workflow stored separate from one's own user-environment allows for a much cleaner way to maintain and test this workflow, but also to share it, e.g. using distributed version control systems such as Git or simply copying from a shared FTP server.

The environment section solves these issues in the following ways:

A user should be able to specify what directory to use as for a given campaign in order to separate workflows between different campaign types, thus decreasing maintenaince effort and allow them to, easily, share workflows with other organisations.

The example below shows how to do this, by changing the variable $U to point to another directory.
A user should be able to change or add environment variables as needed for different campaigns, thus having a single source from which to control the variable input in e.g. .PCF files, and in turn make use of these files for different campaigns easier.

The example below shows how a new environment variable AB_PATH_CAMPAIGN_REFDIR is defined and added during runtime. Its value, a path to the given campaign, is thus available to any software component run by AutoBernese using this as input.

Custom environment variables for a campaign

environment:
- variable: U
  # Use a campaign-specific directory with only the needed PCF files and settings
  value: !Path [*U, .., campaign_type]

# Set a variable that can be used e.g. inside the the campaign-specific PCF files.
- variable: AB_PATH_CAMPAIGN_REFDIR
  value: !Path [*D, REF54, *campaign]

Is this safe?

Answer: Yes.

The user may set/change their environment variables before execution of any command, so the mutation of any variable is already possible.

Redirection is a powerful thing and can be done at many levels, and one of the main powers of Bernese GNSS Software is the use of environment variables to define where things are located. AutoBernese is designed to take advantage of this without forcing the user to have to re-structure their entire workflow setup, but, instead, make it easier to use the existing abstractions and centralise control at the campaign-level, not as a requirement, but as a liberating alternative.

The `tasks` section¶

When running a campaign using the ab campaign run command, AutoBernese looks in the tasks section of the configuration file, loads the selected task definitions and runs the tasks they define in the order they are defined.

The following is an example of a single task definition which illustrates the features implemented for our primary usecase: It instructs AutoBernese to run the Bernese Processing Engine for each day in the given campaign. The repetitive pattern is encoded once as a template, and the actual sessions run are based on a set of parameters, here a single one, the date for which to run the BPE:

Example of a task definition

tasks:

- identifier: BPE_PPP
  description: Run the BPE using PPP.PCF
  run: RunBPE
  arguments:
    pcf_file: PPP
    campaign: *campaign
    year: '{date.year}'
    session: '{date.doy:0>3d}0'
    sysout: 'PPP_{date.doy:0>3d}0'
    status: 'PPP_{date.doy:0>3d}0.RUN'
    taskid: PPP
  parameters:
    date: !DateRange {beg: *beg, end: *end}

As seen, the campaign-argument gets its value by referencing the YAML alias *campaign which in turn is defined in the metadata section of the configuration file.

Arguments year, session, sysout and status use the Python format-string syntax to access the four-digit year or day-of-year [doy] of the parameter date.

As mentioned, the dates that date refers to are the ones covering the campaign. The sequence is generated at runtime, when the YAML aliases *beg and *end are sent to a custom constructor in AutoBernese by using the custom YAML tag !DateRange in front of the mapping.

Execution order¶

In general, task definitions can be seen as separate steps, and the tasks they define are sub-steps. The order in which task definitions are written in the configuration file, is the order in which they will be executed. The obvious reason for this is the need for imposing the logical structure of the workflow to the system performing the tasks.

The task-definition tasks are essentially running the same function with different arguments. As mentioned, tasks are executed sequentially, in the order they are defined. A task thus has a fixed thing to run, with only the actual arguments being varied. The argument permutations are generated so that the top-most parameter being used changes the slowest, as it is only going through its possible values once. The next parameter goes through its values twice, and so on to the bottom-most parameter which goes through all its values as many times as there are parameters above it plus one.

In practise this means that one should consider the order of parameters, when writing a task definition.

Example

In the tasks below, the only difference is in the order of entries in the parameter section.

The two examples may do the same thing, but the order of the parameter permutations will be different, and thus the order in which each call to to the underlying API function.

tasks:

- identifier: EXAMPLE_1
  description: Non-existing dummy task
  run: SomeExample
  arguments:
    arg1: '{date.year}'
    arg2: '{station}'
  parameters:
    date: [1990-01-01, 2000-01-01]
    station: [aaaa, bbbb, cccc]

- identifier: EXAMPLE_2
  description: Non-existing dummy task
  run: SomeExample
  arguments:
    arg1: '{date.year}'
    arg2: '{station}'
  parameters:
    station: [aaaa, bbbb, cccc]
    date: [1990-01-01, 2000-01-01]

For the task EXAMPLE_1, the permutations will be:

Permutation order for EXAMPLE_1

[
  {"arg1": "1990", "arg2": "aaaa"},
  {"arg1": "1990", "arg2": "bbbb"},
  {"arg1": "1990", "arg2": "cccc"},
  {"arg1": "2000", "arg2": "aaaa"},
  {"arg1": "2000", "arg2": "bbbb"},
  {"arg1": "2000", "arg2": "cccc"},
]

And for the task EXAMPLE_2, the permutations come in this order:

Permutation order for EXAMPLE_2

[
  {"arg1": "1990", "arg2": "aaaa"},
  {"arg1": "2000", "arg2": "aaaa"},
  {"arg1": "1990", "arg2": "bbbb"},
  {"arg1": "2000", "arg2": "bbbb"},
  {"arg1": "1990", "arg2": "cccc"},
  {"arg1": "2000", "arg2": "cccc"},
]

The ordering in the permutation lists above is just to show, what is held fixed first, second, and so on, when the permutations are build by the system.

If the order of execution for a task definition's tasks does not matter, it is possible to have them run, asynchronously, by adding the keyword asynchronous: True (the default is False).

This option is aimed at computationally inexpensive tasks that mostly perform (rather, wait for) input/output during execution. If using this option to speed up execution, care should be taken, when certain tasks still need to be performed in a determined order.

When set to true, tasks resolved by a given task definition are not guaranteed to run in the order defined by the task definition parameters. However, note that the order of task definitions is always kept the same as defined in the configuration.

Example

Task definitions must be organised, carefully, if the order of certain tasks are important.

For example, it might be tempting to combine all BPE tasks in the following manner:

Example of a task definition

tasks:

- identifier: BPE
description: Run all PCF files
run: RunBPE
arguments:
    pcf_file: '{pcf_file}'
    campaign: *campaign
    year: '{date.year}'
    session: '{date.doy:0>3d}0'
    sysout: '{pcf_file}_{date.doy:0>3d}0'
    status: '{pcf_file}_{date.doy:0>3d}0.RUN'
    taskid: '{pcf_file}'
parameters:
    date: !DateRange {beg: *beg, end: *end}
    pcf_file: [PRODUCE_RESULTS, CONSUME_RESULTS]
asynchronous: True

But, since all tasks in this task definition are executed, asynchronously, the event loop running the tasks does not know the necessary order. To obtain this order, while still running, asynchronously, one would need to have two asynchronous task definitions in the supposed order.

The task definition¶

As seen in the above example, a task definition is a mapping that in a compact way may define a larger set of tasks. It names an API-level function to run, the arguments needed and possible parameters used in the argument mapping. When loaded, the task definition expands to a set of concrete tasks that are run with the run sub-command. So far, tasks are run, sequentially.

A task definition needs a unique identifier, a description, something to run as a task, arguments (if needed) for the task and parameters (if specified in the arguments section) that will be used to create a set of concrete tasks based on the task definition. If no parameters are used, the arguments provided are used to perform a single task. If no arguments are needed, the API-level function is run as a single task with no arguments given to the function.

Requirements:

The identifier must be without whitespace. It is used, when selecting only specific tasks definitions. (The concrete tasks may still be several.)
The description can be long, but short descriptions are easier to read when printed to the terminal.
run needs a key to a mapping, currently, hard-coded in AutoBernese; or a reference to the API-level Python function that you want to run.
dispatch_with needs a value in the same way as the run part. If not present, the arguments are sent directly to the API-level function refered to by run. When defined, the arguments are first sent through the function pointed to by dispatch_with to be pre-processed or to convert their input to match the signature of the API-level function.

The point of the dispatch function is to make independent API-level calls move up to a corresponding task so that the task runner may handle the delegation of ressources at a finer level.
Keyword arguments inside the arguments section contain arguments readable by the function in run or, if used, dispatch_with.
Any keyword arguments using string templates, must have a key of the same name in the parameters section. The value must be a sequence or iterable of values that the parameter may take inside any of the argument string template formats.
asynchronous is not needed, but when set it must be either True or False (the latter being the default). It determines the scheduling of tasks at the task-definition level.

More examples using built-in run and dispatch_with shortcuts

CompressCompress (wildcards)SFTPUploadSitelogs2STAFileBuildVMFCheckVMF

This example uses Compress which points to a function that takes a concrete path. If the filenames are fully specified, the task definition only needs the run key.

Example of a task definition

tasks:

- identifier: GZIP
  description: Compress results using gzip
  run: Compress
  arguments:
    fname: !PathStr [*P, *campaign, '{filename}']
  parameters:
    filename:
    - /OUT/MY.PRC
    - /SOL/MY.SNX
    - /ATM/MY.TRO

If the filenames are given with wildcards, the dispatch_with key must be set to DispatchCompress which is a shortcut to a function that resolves any wildcards in the given paths.

Example of a task definition

tasks:

- identifier: GZIP
  description: Compress results using gzip
  run: Compress
  dispatch_with: DispatchCompress
  arguments:
    fname: !PathStr [*P, *campaign, '{filename}']
  parameters:
    filename:
    - /OUT/*.PRC
    - /SOL/*.SNX
    - /ATM/*.TRO

This example uses SFTPUpload which points to a function that takes the list of files and the single remote directory and transfers them in a batch operation to the server.

The remote directory tree is created, before the files are transferred.

Example of a task definition

tasks:

- identifier: SFTP_TO_COLLABORATION
  description: Upload campaign results to collaboration directory
  run: SFTPUpload
  arguments:
    host: ftp.example.com
    pairs:
    - fname: !PathStr [*P, *campaign, '{filename}']
      remote_dir: !PathStr [Collaboration/GNSS/, '{date.gps_week}']
  parameters:
    date: [*beg]
    filename:
    - /STA/*.CRD
    - /OUT/*.PRC.gz
    - /OUT/*.SUM
    - /SOL/*.SNX.gz
    - /ATM/*.TRO.gz

This example uses Sitelogs2STAFile which builds a .STA-file from existing sitelog-files.

The shortcut Sitelogs2STAFile points to a function that takes the same input as the needed for the stand-alone command (see the command reference).

Example of a task definition

tasks:

- identifier: SITELOGS2STA_DNK
  description: Derive STA file from sitelog files
  run: Sitelogs2STAFile
  arguments:
    sitelogs: !Path [*D, sitelogs, '*dnk*log']
    output_sta_file: !Path [*D, sitelogs_DNK.STA]

This example uses BuildVMF which builds day files with the troposphere-model data. Input files are assumed to exist.

The shortcut BuildVMF points to a function that takes a single DayFileBuilder instance as input and thus requires a dispatch function to convert the input arguments to something that the runner can run.

To learn more of how the input can be generated, see the command-line reference on the stand-alone tools for doing the exact same thing.

Example of a task definition

tasks:

- identifier: BUILD_VMF
  description: Build VMF GRD files from hour files
  run: BuildVMF
  dispatch_with: DispatchVMF
  arguments:
    ipath: *CORE_TROPO_OP_DIR_H
    opath: *CORE_TROPO_OP_DIR_GRD
    beg: *beg
    end: *end

This example uses CheckVMF which checks that day files were correctly concatenated from the input troposphere-model data.

The shortcut CheckVMF points to a function that takes a single DayFileBuilder instance as input and thus requires a dispatch function to convert the input arguments to something that the runner can run.

To learn more of how the input can be generated, see the command-line reference on the stand-alone tools for doing the exact same thing.

Example of a task definition

tasks:

- identifier: CHECK_VMF
  description: Check VMF GRD files from hour files
  run: CheckVMF
  dispatch_with: DispatchVMF
  arguments:
    ipath: *CORE_TROPO_OP_DIR_H
    opath: *CORE_TROPO_OP_DIR_GRD
    beg: *beg
    end: *end

The `sources` section¶

Campaign-specific sources of external data can be specified in a sources section in the campaign configuration.

As illustrated in the EXAMPLE-campaign configuration file, a single FTP source with two files needed for the built-in process-control file ITRF.PCF is included so that the data can be downloaded to the DATAPOOL area to be used in this (and in this case, other) campaigns.

sources:

- identifier: ITRF14
  description: IERS data needed for the EXAMPLE campaign
  url: https://datacenter.iers.org/products/reference-systems/terrestrial/itrf/itrf2014/
  filenames:
  - ITRF2014-IGS-TRF.SNX.gz  # 1.4 GB
  - ITRF2014-psd-gnss.dat  # 38 KB
  destination: !Path [*D, ITRF14]

The `clean` section¶

With the campaign command clean, it is possible to specify directories at the root of the campaign directory which will have their entire content deleted, if the user grants it when prompted.

To use the clean command, add a clean section to the campaign (template) configuration and provide a list of directories that exist at the root of the campaign.

Example of a clean section in a campaign-specific configuration file

clean: [SOL, OUT]

Campaign templates¶

A main feature of AutoBernese is the ability to store common campaign settings as templates and use them to create campaigns of the same type. This will further speed up common workflows, since newly-created Bernese campaigns are ready to be 'run' with AutoBernese. By adding different templates for different campaign types, users avoid having to copy over settings to set environment variables, download data, and run tasks.

This campaign type is defined by a template campaign-configuration file in the templates directory of the AutoBernese runtime directory.

Campaign-configuration template location in the AutoBernese runtime directory

/path/to/environment
├── autobernese
│   ├── (...)
│   └── templates        # Directory for user-created templates,
│       │                # one for each campaign type
│       │
│       └── default.yaml # This default file is used as
│                        # template if none specified.
├── BERN54
└── (...)

The default campaign template is empty and comes with the AutoBernese package, but is selected if none given by the user.

Template maintenance

Over time, templates will need to be changed. Changes to templates will not change the configuration in existing campaigns, and these would need to be edited, manually. However, needing to 're-run' these campaigns anyway, it might be safer to start over and create new campaigns to 're-create' the output.

Notes on Python-string templates in YAML documents¶

This section describes some caveats when using Python string template as values in a YAML document.

Python has multiple ways to work with strings, and many of the string-values given in the AutoBernese configuration files have content that requires soe knowldge about Python's Template strings. In addition, the format of these strings may clash with the YAML syntax, so a few potential problems and solutions are shown below.

Caveats on difference between Template strings and `f` string syntax¶

This example shows the difference in Python syntax between two Python-string types:

f stringString template

import datetime as dt

date = dt.date(2019, 2, 13)
date.day
# 13
f'{date.day}'
# '13'
f'{date.day:3d}'
# ' 13'
f'{date.day:03d}'
# '013'

import datetime as dt

date = dt.date(2019, 2, 13)
date.day
# 13
'{date.day}'.format(date=date)
# '13'
'{date.day:3d}'.format(date=date)
# ' 13'
'{date.day:03d}'.format(date=date)
# '013'

String templates are used in the YAML files that are used as configuration-file format.

Clash with the the YAML syntax¶

There is a subtle drawback in clarity, when the string template begins with a template part, since it clashes with the YAML syntax for a mapping which also begins and ends with { and }, respectively. In these cases, one must, explicitly, put quotes around the string:

Without explicit quotesWith explicit quotes

Example

import yaml
s = """\
some configuration key: a string with {date.day}
another configuration key: {template} is at the beginning of the string.
"""
yaml.safe_load(s)

Output

# Output:
# ParserError: while parsing a block mapping
#   in "<unicode string>", line 1, column 1:
#     some configuration key: a string ...
#     ^
# expected <block end>, but found '<scalar>'
#   in "<unicode string>", line 2, column 37:
#     another configuration key: {prefix} is at the beginning of the string.

Example

import yaml
s = """\
some configuration key: a string with {date.day}
another configuration key: '{template} is at the beginning of the string.'
"""
yaml.safe_load(s)

Output

# Output:
# {
#   'some configuration key': 'a string with {date.day}',
#  'another configuration key': '{template} is at the beginning of the string.'
# }

Configuration files

Core configuration¶

BSW environment variables¶

Files used by AutoBernese¶

Runtime settings¶

Settings overridable¶

Common configuration¶

Campaign-directory structure for new campaigns¶

Troposphere data directory structure¶

Data-source specification and local directory structure¶

Station site-log files to use for STA-file creation¶

Campaign configuration¶

Creating a new campaign configuration¶

The metadata section¶

The environment section¶

The tasks section¶

Execution order¶

The task definition¶

The sources section¶

The clean section¶

Campaign templates¶

Notes on Python-string templates in YAML documents¶

Caveats on difference between Template strings and f string syntax¶

Clash with the the YAML syntax¶

The `metadata` section¶

The `environment` section¶

The `tasks` section¶

The `sources` section¶

The `clean` section¶

Caveats on difference between Template strings and `f` string syntax¶