Operations as dependencies during checks

copah · May 4, 2021, 10:17pm

I have a guild model that looks something like this:

- include: source_code_config.yml
- model: _check
  extends:
    - source_code_config
  operations:
    _test_segmentation_training:
      steps:
        - run: segmentation:train dryrun=yes num_epochs=1 input_database="x.csv"
          expect:
            - file: experiments/best_model.pt
    _test_segmentation_testing:
      steps:
        - run:segmentation:test input_database="x.csv"
          expect:
            - output: "Testing done."
    _all:
      steps:
        - _test_segmentation_training
        - _test_segmentation_testing

I use this for integration testing my training and tests scripts.

The segmentation model looks something like this:

- model: segmentation
  extends:
    - source_code_config
  operations:
    train:
      main: scripts/training/train_segmentation
      flags:
        $include:
          - segmentation_flags
          - train_flags
          - common_flags
      requires:
        - prepared_data
    test:
      main: scripts/training/test_segmentation
      flags:
        batch_size: 1
        $include:
          - test_flags
          - common_flags
      requires:
        - prepared_data
        - trained_model
  resources:
    trained_model:
      sources:
        - operation: train
          select:
            - experiments
            - .guild/attrs/flags
          target-type: copy
          rename:
            - flags training-flags.yml # See https://github.com/guildai/guildai/blob/0.7.2/examples/upstream-flags/guild.yml

Now when I run

guild run segmentation:train

Followed by:

guild run segmentation:test

guild will automatically resolve the trained_model dependency and find the latest segmentation:train run.

If I instead run:

guild _check:_all

The _test_segmentation_training runs successfully, but the _test_segmentation_testing operation is not able to resolve the trained_model resource.

Ideally I would be able to run this pipeline as a step in my integration testing.

EDIT:

Using guild version 0.7.3

garrett · May 7, 2021, 7:17pm

You’re running into a purported safeguard where steps can only see runs generated within the parent steps operation. To tell Guild to look at all runs, set isolate-runs: no within the step.

Here’s an example:

gist.github.com

https://gist.github.com/gar1t/b3cd7e75bfe72780ddc39455e0e4452f

guild.yml

up: guild.pass

down:
  main: guild.pass
  requires:
    - operation: up
      warn-if-empty: no

up-steps:
  steps:

This file has been truncated. show original

This is annoyingly subtle behavior—Guild should at least print a message pointing you to an answer.

Be sure to disable isolate-runs the two applicable steps/pipeline operations.

Topic		Replies	Views
How can I define models in guild and run them against different training procedures? General	1	539	March 22, 2022
Test your Guild file How To	2	1050	November 20, 2020
"Looping" over operations and requirements General	1	525	August 13, 2020
Dependencies Concepts	0	3375	June 12, 2020
Optional operation/run dependencies RFC	5	348	October 20, 2022

Operations as dependencies during checks

Related topics