i6_core.features.extraction

class i6_core.features.extraction.FeatureExtractionJob(*args, **kwargs)

Runs feature extraction of a given corpus into cache files

The cache files can be accessed as bundle Path (out_feature_bundle) or as MultiOutputPath (out_feature_path)

Parameters:
  • crp (rasr.crp.CommonRasrParameters) – common RASR parameters

  • feature_flow (rasr.flow.FlowNetwork) – feature flow for feature foraging

  • port_name_mapping (dict[str,str]) – mapping between output ports (key) and name of the features (value)

  • one_dimensional_outputs (set[str]|None) – set of output ports with one dimensional features

  • job_name (str) – name used in sisyphus visualization and job folder name

  • rtf (float) – real-time-factor of the feature-extraction

  • mem (int) – memory required for the job

  • parallel (int) – maximum number of parallely running tasks

  • indirect_write (bool) – if true will write to temporary directory first before copying to output folder

  • extra_config (rasr.config.RasrConfig|None) – additional RASR config merged into the final config

  • extra_post_config (rasr.config.RasrConfig|None) – additional RASR config that will not be part of the hash

cleanup_before_run(cmd, retry, task_id, *args)
classmethod create_config(crp, feature_flow, extra_config, extra_post_config, **kwargs)
Parameters:
Returns:

config, post_config

Return type:

(rasr.config.RasrConfig, rasr.config.RasrConfig)

create_files()
classmethod hash(kwargs)
Parameters:

parsed_args (dict[str]) –

Returns:

hash for job given the arguments

Return type:

str

run(task_id)
tasks()
Returns:

yields Task’s

Return type:

list[sisyphus.task.Task]