Indicator estimation procedures

Introduction

In this vignette, you will learn:

How to format your data for indicator estimation.
How to discover which indicator functions are available, their data requirements and additional parameters.
How to perform the estimation of multiple indicators at once.

Data frame inputs

Data inputs for the estimation of indicators are data frames. Not all indicators require the same inputs, and the data frames can be of four different kinds, depending on:

Whether they describe stand-level vs. plant-level data
Whether they contain static vs. dynamic data

The following subsections provide examples of the four different data frames. A single indicator may only need data of one kind (e.g. plant-level dynamic data), but when calculating several indicators at once, one may end-up supplying more than one data frame.

Stand-level static data

This data frame contains STAND-level information that DOES NOT change through time, such as topographic features, the area represented, etc.

The actual columns will depend on the indicator to be calculated, but at least a column id_stand (character) should be defined to identify stands. An example of stand-level static data frame is the following:

example_stand_static_input
#>   id_stand  area slope
#> 1   080001  14.3   5.1
#> 2   080002 362.6   3.1
#> 3   080003 319.6   8.9
#> 4   080004 635.2   4.4
#> 5   080005 774.8   8.9
#> 6   080006 166.6   5.3
#> 7   080007 368.0   4.7
#> 8   080008 945.9   4.3
#> 9   080009 548.9   1.4

Stand-level dynamic data

This data frame contains STAND-level information that DOES change through time, such as temperature, precipitation, the stand leaf area index, canopy cover, etc.

The actual columns will depend on the indicator to be calculated, but at least columns id_stand (character) and date (Date) should be defined to identify stands and time points, respectively. An example of stand-level dynamic data frame is the following:

example_stand_dynamic_input
#>    id_stand       date mean_precipitation canopy_cover
#> 1    080001 2025-01-01              353.2          8.4
#> 2    080002 2025-01-01              355.1         63.2
#> 3    080003 2025-01-01              377.0         68.9
#> 4    080004 2025-01-01              836.6         24.8
#> 5    080005 2025-01-01              742.4         45.4
#> 6    080006 2025-01-01              405.7         12.7
#> 7    080007 2025-01-01              103.3         96.3
#> 8    080008 2025-01-01              561.8          1.6
#> 9    080009 2025-01-01              522.1         10.6
#> 10   080001 2025-02-01              383.2         22.5
#> 11   080002 2025-02-01              511.2         85.9
#> 12   080003 2025-02-01              552.0         61.7
#> 13   080004 2025-02-01              813.3         32.9
#> 14   080005 2025-02-01              368.0         50.7
#> 15   080006 2025-02-01              436.1         30.2
#> 16   080007 2025-02-01              867.9         51.0
#> 17   080008 2025-02-01              304.6         85.8
#> 18   080009 2025-02-01              349.1         67.8
#> 19   080001 2025-03-01               46.3         44.1
#> 20   080002 2025-03-01              335.6         33.1
#> 21   080003 2025-03-01              509.6         96.3
#> 22   080004 2025-03-01              824.6         59.1
#> 23   080005 2025-03-01              607.5         76.7
#> 24   080006 2025-03-01              554.8         57.4
#> 25   080007 2025-03-01              710.4         43.8
#> 26   080008 2025-03-01              111.7         17.6
#> 27   080009 2025-03-01              623.2         12.1

Plant-level static data

This data frame contains PLANT-level information that DOES NOT change through time, such as taxonomy, traits, etc.

The actual columns will depend on the indicator to be calculated, but at least a columns id_stand (character) and plant_entity (character) should be defined to identify stands and plants, respectively. An example of plant-level static data frame is the following:

example_plant_static_input
#>    id_stand     plant_entity beautiness
#> 1    080001 Pinus halepensis       0.26
#> 2    080002 Pinus halepensis       0.03
#> 3    080003 Pinus sylvestris       0.28
#> 4    080004 Pinus sylvestris       0.02
#> 5    080005     Quercus ilex       0.52
#> 6    080006     Quercus ilex       0.36
#> 7    080007     Quercus ilex       0.51
#> 8    080008    Quercus suber       0.08
#> 9    080009    Quercus suber       0.98
#> 10   080001      Pinus nigra       0.62
#> 11   080002      Pinus nigra       0.42
#> 12   080005 Pinus halepensis       0.72
#> 13   080006 Pinus halepensis       0.43

Plant-level dynamic data

This data frame contains PLANT-level information that DOES change through time, such as dbh, height, cover, etc.

The actual columns will depend on the indicator to be calculated, but at least a columns id_stand (character), plant_entity (character) and date (Date) should be defined to identify stands and plants, respectively. An example of plant-level dynamic data frame is the following:

example_plant_dynamic_input
#>    id_stand     plant_entity       date state  dbh    h   n
#> 1    080001 Pinus halepensis 2025-01-01  live 18.3 22.3 352
#> 2    080001 Pinus halepensis 2025-01-01  live 36.5 18.8 891
#> 3    080001      Pinus nigra 2025-01-01  live 51.1  5.3 644
#> 4    080001      Pinus nigra 2025-01-01  live 36.9 18.4 543
#> 5    080001      Pinus nigra 2025-01-01  live 13.9 13.3 852
#> 6    080001 Pinus halepensis 2025-02-01  live 49.1 17.1 698
#> 7    080001 Pinus halepensis 2025-02-01  live 24.4  3.2 885
#> 8    080001      Pinus nigra 2025-02-01  live 12.0 10.1 454
#> 9    080001      Pinus nigra 2025-02-01  live 45.3 20.6 312
#> 10   080001      Pinus nigra 2025-02-01  live 35.1 22.5 651
#> 11   080001 Pinus halepensis 2025-03-01  live 49.8 13.6 424
#> 12   080001 Pinus halepensis 2025-03-01  live 22.7  4.6 506
#> 13   080001      Pinus nigra 2025-03-01  live 36.9  6.4  59
#> 14   080001      Pinus nigra 2025-03-01  live  9.1 15.5 445
#> 15   080001      Pinus nigra 2025-03-01  live 41.2  9.6 851
#> 16   080005     Quercus ilex 2025-01-01  live 22.4 18.1 491
#> 17   080005     Quercus ilex 2025-01-01  live  8.3  5.8 565
#> 18   080005 Pinus halepensis 2025-01-01  live 40.9 11.7 433
#> 19   080005     Quercus ilex 2025-02-01  live 44.8 22.7 576
#> 20   080005     Quercus ilex 2025-02-01  live 33.1 13.1 769
#> 21   080005 Pinus halepensis 2025-02-01  live 13.9 21.3 904
#> 22   080005     Quercus ilex 2025-03-01  live 27.4 22.5 118
#> 23   080005     Quercus ilex 2025-03-01  live 19.5 18.6 818
#> 24   080005 Pinus halepensis 2025-03-01  live 50.0 12.0 592
#> 25   080005 Pinus halepensis 2025-03-01  live 11.2 12.5 357
#> 26   080001 Pinus halepensis 2025-01-01   cut 43.9 18.3  21
#> 27   080001 Pinus halepensis 2025-01-01   cut 41.4  3.5 166
#> 28   080001      Pinus nigra 2025-01-01   cut 29.1 18.9   0
#> 29   080001 Pinus halepensis 2025-02-01   cut 49.4  5.8  73
#> 30   080001      Pinus nigra 2025-03-01   cut 13.0 20.0 194
#> 31   080005     Quercus ilex 2025-01-01   cut 17.2 20.6 147
#> 32   080005 Pinus halepensis 2025-01-01   cut 10.0  4.3 200
#> 33   080005     Quercus ilex 2025-02-01   cut 44.1  7.1  22
#> 34   080005 Pinus halepensis 2025-03-01   cut 45.7 11.8  17
#> 35   080001 Pinus halepensis 2025-03-01  dead 26.1 13.3  81
#> 36   080001      Pinus nigra 2025-03-01  dead 36.2 11.0  82
#> 37   080001      Pinus nigra 2025-03-01  dead 49.6  6.9  72
#> 38   080001      Pinus nigra 2025-03-01  dead 40.5 10.9  33
#> 39   080005     Quercus ilex 2025-01-01  dead 25.9  7.7   8
#> 40   080005     Quercus ilex 2025-01-01  dead 18.7 13.4  75
#> 41   080005 Pinus halepensis 2025-01-01  dead 16.7 18.0  97
#> 42   080005 Pinus halepensis 2025-02-01  dead 31.0  4.6  43
#> 43   080005     Quercus ilex 2025-03-01  dead 35.1  2.5  74
#> 44   080005     Quercus ilex 2025-03-01  dead  7.1  7.6  69

Indicators

Indicator definition table

All the information regarding which indicators are included in the package and their requirements can be found in the data table indicator_definition. For each indicator, it includes:

Indicator name
Method of estimation
Interpretation
Output units
Person responsible of its implementation in the package, who is to be contacted in case of issues/doubts.
Variable names required for its estimation (grouped by input data frame)

Variable definition table

The type and units of all input variables can be found in table variable_definition.

Indicator functions

Each indicator is implemented in the package as an internal function, maintained by the responsible person. This functions is not directly accessible to the user, but is called via estimate_indicators() (see below).

Additional parameters

The estimation of indicators may require other information in addition to the input data frame. Table additional_parameters contains the description of additional parameters for those functions that require them. The way these parameters are specified is described below.

Indicator estimation procedure

The general procedure for the estimation of forest indicators follows three main steps, which we illustrate in the following subsections:

Step 1: Choose target indicators and examine their requirements

The first step is to decide which indicators are to be estimated. The list of all indicators is given in table indicator_definition. Additionally, you can examine the names of the indicators in the package that could be estimated with your data by calling function available_indicators():

available_indicators(plant_dynamic_input = example_plant_dynamic_input)
#> [1] "basal_area"        "carbon_stock"      "density_dead_wood"
#> [4] "timber_harvest"

Say we decide to estimate the timber harvest volume (i.e. indicator "timber_harvest") and the density of dead wood (i.e. indicator "density_dead_wood").

We may need to check which data inputs are required, their units, etc. We can find this information using function show_information(), for example:

show_information("timber_harvest")
#> A) DEFINITION
#> 
#>    Name:  timber_harvest 
#>    Estimation:  NA 
#>    Interpretation:  Volume over bark of harvested wood 
#>    Output units:  NA 
#> 
#> 
#> B) DATA INPUTS
#> 
#> B.1) Stand-level static data: <NONE>
#> B.2) Stand-level dynamic data: <NONE>
#> B.3) Plant-level static data: <NONE>
#> B.4) Plant-level dynamic data:
#> 
#>  variable      type   units
#>       dbh   numeric      cm
#>         h   numeric       m
#>         n   numeric ind./ha
#>     state character       -
#>                                                    description
#>                                      diameter at breast height
#>                                                         height
#>                                                        density
#>  state of the cohort, tree, … can be 'live', 'cut', or 'death'
#> 
#> 
#> C) ADDITIONAL PARAMETERS
#> 
#>         parameter default_value
#>          province          <NA>
#>      min_tree_dbh          <NA>
#>      max_tree_dbh          <NA>
#>  targeted_species          <NA>
#>  excluded_species          <NA>
#>                                                                           description
#>                                                       Spanish province (numeric code)
#>             Minimum  tree diameter to be included in the estimation of harvest volume
#>  Maximum  tree diameter to be included in the estimation of density of harvest volume
#>                          Tree species included in the calculation of harvested volume
#>                      Tree species not included in the calculation of harvested volume

Step 2: Assemble inputs and define values for additional parameters

Once the requested format and content of inputs is know, it is the responsibility of the user to build the necessary data frames to be used as inputs. In our case, we will use example_plant_dynamic_input that already contains the necessary data for the estimation of the two indicators.

We will normally need to define additional parameters to fine-tune the estimation of indicators, For that, we define a named list where each element corresponds to a different indicator and, in turn, contains a named list of parameters. In our case:

params <- list(timber_harvest = list(province = 8),
               density_dead_wood = list(max_tree_dbh = 20))

Step 3: Call general estimation function

Once we have our inputs ready, the indicators are estimated by calling function estimate_indicators() as follows:

res <- estimate_indicators(indicators = c("timber_harvest", "density_dead_wood"),
                           plant_dynamic_input = example_plant_dynamic_input,
                           timber_volume_function = forestindicators:::.ifn_volume_forestindicators,
                           additional_params = params)
#> ℹ Checking overall inputs
#> ℹ Checking inputs for 'timber_harvest'.
#> ✔ Checking inputs for 'timber_harvest'. [4ms]
#> 
#> ℹ Checking overall inputs✔ Checking overall inputs [26ms]
#> 
#> ℹ Processing 'timber_harvest'.
#> ℹ Checking inputs for 'density_dead_wood'.
#> ✔ Checking inputs for 'density_dead_wood'. [9ms]
#> 
#> ℹ Processing 'timber_harvest'.✔ Processing 'timber_harvest'. [126ms]
#> 
#> ℹ Processing 'density_dead_wood'.
#> ✔ Processing 'density_dead_wood'. [20ms]

Note that "timber_harvest" requires specifying the function to be used to calculate timber volumes, i.e. timber_volume_function, for which in our case we use a predefined package function.

The result of the estimation is the following:

res
#> # A tibble: 6 × 4
#>   id_stand date       timber_harvest density_dead_wood
#>   <chr>    <date>              <dbl>             <dbl>
#> 1 080001   2025-01-01          51.9                187
#> 2 080001   2025-02-01          29.2                 73
#> 3 080001   2025-03-01          25.6                 NA
#> 4 080005   2025-01-01          34.4                 NA
#> 5 080005   2025-02-01           6.98                22
#> 6 080005   2025-03-01          11.3                 17

Miquel De Cáceres / Núria Aquilué

2025-10-16