Open source and experiment tracking

garrett · July 20, 2020, 1:05pm

Guild goes out of its way to let you track experiments without changing your code. There are a couple reasons for all the fuss:

It keeps your code independent of the experiment tracking scheme

Okay there’s just one reason.

Why is this important?

When you embed a dependency in your code, your code carries that dependency wherever it goes. Everyone must satisfy it. If the requirements are minimal, no big deal. But what if they’re not?

Consider what many experiment tracking tools require:

Databases
Distributed file systems
Network connectivity to back-end systems
Authorization credentials

You want to “just run” your code? Not so fast.

This is not only questionable design — it violates the principle of separation of concerns — it undermines the principles of open source software.

Richard Stallman:

Creativity can be a social contribution, but only in so far as society is free to use the results.

This is never more true than in machine learning. If you can’t run a piece of software because it’s tied to systems you can’t access, you can’t use it.

While you’re code may be open source, it’s stymied. People are free to study your code, but not to run it.

Where’s the experimentation in that?

Topic		Replies	Views
Guild AI and Netpune General	0	576	July 13, 2020
I wrote a helpful github gist General	4	434	January 5, 2021
Experiment tracking with reinforcment learning General	1	309	March 4, 2022
Adding `git diff` to run info General	2	537	August 12, 2020
Tracking source code that is a python package Troubleshooting	2	343	February 26, 2022

Open source and experiment tracking

Related topics