schrodinger.seam.options.pipeline_options module

class schrodinger.seam.options.pipeline_options.SeamOptions(flags: Optional[List[str]] = None, **kwargs)

Bases: apache_beam.options.pipeline_options.PipelineOptions

INFERRED = -1
__init__(flags: Optional[List[str]] = None, **kwargs)

Initialize an options class.

The initializer will traverse all subclasses, add all their argparse arguments and then parse the command line specified by flags or by default the one obtained from sys.argv.

The subclasses of PipelineOptions do not need to redefine __init__.

Args:
flags: An iterable of command line arguments to be used. If not specified

then sys.argv will be used as input for parsing arguments.

**kwargs: Add overrides for arguments passed in flags. For overrides

of arguments, please pass the option names instead of flag names. Option names: These are defined as dest in the parser.add_argument() for each flag. Passing flags like {no_use_public_ips: True}, for which the dest is defined to a different flag name in the parser, would be discarded. Instead, pass the dest of the flag (dest of no_use_public_ips is use_public_ips).

isDefault() bool
getMaxWorkers() int

Get the maximum number of workers to use for the pipeline. If –max-workers is unset, this value is inferred by the context of the job. If not in the context of a job, the runner only scales to 1 worker. If in the context of a job, the number of workers is set to the number of subjobs specified in the HOST or SUBHOST argument. If no subjobs are specified, the number of workers is set to _DEFAULT_MAX_WORKERS.

validate(validator: apache_beam.options.pipeline_options_validator.PipelineOptionsValidator) List[str]
getWorkerTimeoutInSeconds() int