Run log
Along with tracking the progress and status of the execution of the pipeline, run log also keeps a track of parameters, experiment tracking metrics, data flowing through the pipeline and any reproducibility metrics emitted by the tasks of the pipeline.
Please refer here for detailed information about run log.
buffered¶
Stores all the run log in-memory. The run log is not persisted and destroyed immediately after the execution is complete.
Parallel execution
buffered
run log stores suffers from race conditions when two tasks
need to update status concurrently.
Configuration¶
file-system¶
Stores the run log as a json
file in the file-system accessible by all the steps
of the pipeline.
Parallel execution
file-system
based run log stores suffers from race conditions when two tasks
need to update status concurrently. Use chunked
version to avoid this behavior
or disable parallelism.
Configuration¶
Example¶
Assumed to be present at examples/configs/fs-run_log.yaml
The configuration can be provided dynamically by setting the environment variable
runnable_CONFIGURATION_FILE
.
Executing the pipeline with:
runnable_CONFIGURATION_FILE=examples/configs/fs-run_log.yaml python examples/concepts/simple.py
The structure of the run log is detailed in concepts.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 |
|
chunked-fs¶
Chunked file system is similar to the file-system
but stores concents of the run log
that have concurrency blocks in separate files.
Configuration¶
Assumed to be present at examples/configs/chunked-fs-run_log.yaml
The configuration can be provided dynamically by setting the environment variable
runnable_CONFIGURATION_FILE
.
Executing the pipeline with:
runnable_CONFIGURATION_FILE=examples/configs/chunked-fs-run_log.yaml python examples/concepts/simple.py
The structure of the run log is detailed in concepts.
Stores only the metadata of the run log. The contents of this are safe for concurrent executions.
Contains only the information of the single step simple
.
The name of the file follows the pattern:
StepLog-<Step name>-<timestamp>.json
. The timestamp allows runnable to infer
the order of execution of the steps.
All the run logs are stored in .run_log_store with the directory name being the run_id
.
Instead of storing a single json
file, the contents are stored in the folder
by the name of the `run_id
.