Using processed/intermediate data from previous model runs for the SDR model

lukezw · October 1, 2024, 10:50am

Hello everyone

I am running the SDR model over a large area which is taking around 2 hours per model run.

A large part of the processing time is taken up by the warping and alignment of layers, calculation of flow direction and accumulation etc.

The only thing I am changing between model runs is the biophysical table to improve alignment with measured sediment loads. To save time, I wanted to know if the intermediate outputs from previous model runs (e.g. aligned raster layers, filled DEM, flow routings) can be used in re-runs, to avoid re-generating these every time the model is run?

jdouglass · October 1, 2024, 10:17pm

Hello @lukezw ,

Yes, the model should re-use any inputs it can, so only modifying the biophysical table should not result in the model re-aligning or routing the large spatial data, it should pick up with just those tasks that involve the biophysical table.

One thing to keep in mind, though, is that the safest way to do this across lots of different model runs in order to save as much time as possible is to:

Always use the same workspace and
Do not modify the input spatial data in between runs and
Don’t change the suffix across multiple runs

If you need access to outputs across multiple successive runs, you’ll probably want to copy that output to somewhere outside of the workspace for later reference.

Let us know if you have any questions!
James

swolny · October 1, 2024, 10:35pm

@jdouglass I’d like to flag (1) and (3) as being incompatible in practice. If we are using the same Workspace, then we must change the Suffix, else we overwrite our results, which you alluded to with the note to copy the output elsewhere between runs. I can understand why it needs the same Workspace to avoid re-processing overhead, but why would a change in Suffix cause InVEST to re-run everything? Can that be fixed?

~ Stacie

jdouglass · October 1, 2024, 11:37pm

I completely understand, and I’m sorry for the constraint!

As for whether it can be fixed, this is something that is still on our list and it is something we would very much like to fix.

As for why the suffix causes an issue, it’s surprisingly difficult to write software that treats two files with different filenames (different suffixes) as the same file, and to do so in such a way that we use file A in place of file B in a set of tasks that executes in a nondeterministic order. So yes, I’m sure this could be fixed, but doing so will require a major rework of the underlying task graph.

lukezw · October 7, 2024, 9:25am

Thanks both for the feedback on this.
I am indeed able to get the model to pick up the previous intermediate outputs if using the same workspace, spatial inputs and suffix as pointed out by James.

Obviously this approach requires some care to keep track of what is what when copying out files from previous runs, while keeping the suffix identical between runs, but at least it is doable. If it ever can be done, it would be great to have the option to change the suffix but still use previous intermediate data when changing biophysical parameters, as it would make it easier to keep track of the adjustments being made. But I understand the difficult of adding this in currently!

system · January 5, 2025, 9:25am

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Invest SDR 3.7 failing after 11 hours Software Support sdr	16	1237	December 3, 2019
SDR Model Question and Drainages Error Software Support sdr	4	320	May 22, 2023
SDR Model fails to produce values for watershed_results_sdr Software Support sdr	8	630	February 23, 2021
SDR Model - Running Software Issue Software Support sdr	10	1618	May 19, 2020
ValueError in SDR model : Input Rasters are note the same dimensions Software Support sdr	2	729	December 17, 2019

Using processed/intermediate data from previous model runs for the SDR model

Related topics