i6_core.sat.flow

i6_core.sat.flow.add_cmllr_transform(feature_net: FlowNetwork, map_file: Path, transform_dir: Path, matrix_name: str = '$input(corpus-key).matrix') FlowNetwork
Parameters:
  • feature_net – flow network for feature extraction, e.g. one from i6_core.features

  • map_file – RASR corpus-key-map file, e.g. out_cluster_map_file from SegmentCorpusBySpeakerJob

  • transform_dir – Directory containing the transformation matrix files, e.g. EstimateCMLLRJob.out_transforms

  • matrix_name – Name pattern for the matrix files in the transform_dir

Returns:

A new flow network with the CMLLR transformation added

i6_core.sat.flow.segment_clustering_flow(feature_flow=None, file='cluster.map.$(TASK)', minframes=0, mincluster=2, maxcluster=100000, threshold=0, _lambda=1, minalpha=0.4, maxalpha=0.6, alpha=-1, amalgamation=0, infile=None, **kwargs)
Parameters:
  • feature_flow – Flownetwork of features used for clustering

  • file – Name of the cluster outputfile

  • minframes – minimum number of frames in a segment to consider the segment for clustering

  • mincluster – minimum number of clusters

  • maxcluster – maximum number of clusters

  • threshold – Threshold for BIC which is added to the model-complexity based penalty

  • _lambda – Weight for the model-complexity-based penalty (only lambda=1 corresponds to the definition of BIC; decreasing lambda increases the number of segment clusters.

  • minalpha – Minimum Alpha scaling value used within distance scaling optimization

  • maxalpha – Maximum Alpha scaling value used within distance scaling optimization

  • alpha – Weighting Factor for correlation-based distance (default is automatic alpha estimation using minalpha and maxalpha values)

  • amalgamation – Amalgamation Rule 1=Max Linkage, 0=Concatenation

  • infile – Name of inputfile of clusters

Returns:

(FlowNetwork)