hyrax.verbs.create_splits#
Attributes#
Classes#
Create and persist reproducible dataset splits. |
Module Contents#
- class CreateSplits(config)[source]#
Bases:
hyrax.verbs.verb_registry.VerbCreate and persist reproducible dataset splits.
Overall initialization for all verbs that saves the config
- run()[source]#
Compute dataset splits and write them to a results directory.
Reads the
[split]and[balance]config tables to determine how to partition each data group, then persists.npzindex files and asplit_config.tomlunder a timestamped*-splits-*results directory. Subsequent verbs (train,infer,test) can point at this directory to reuse the same split without recomputing it.- Returns:
The populated dataset providers, keyed by group name.
- Return type:
dict[str, DataProvider]