i6_core.g2p.train

class i6_core.g2p.train.TrainG2PModelJob(*args, **kwargs)

Train a G2P model using Sequitur

see https://github.com/sequitur-g2p/sequitur-g2p

Parameters:
  • g2p_lexicon (Path) – g2p_lexicon for training, use BlissLexiconToG2PLexiconJob to generate a g2p_lexicon from a bliss lexicon

  • num_ramp_ups (int) – number of global ramp-ups (n-gram-iness)

  • min_iter (int) – minimum iterations per ramp-up

  • max_iter (int) – maximum iteration sper ramp-up

  • devel (str) – passed as -d argument, percent of train lexicon held out as validation set

  • size_constrains (str) – passed as -s argument, multigrams must have l1 … l2 left-symbols and r1 … r2 right-symbols

  • extra_args (list[str]) – extra cmd arguments that are passed to the g2p process

  • g2p_path (Optional[Path]) – path to the g2p installation. If None, searches for a global G2P_PATH, and uses the default binary path if not existing.

  • g2p_python (Optional[Path]) – path to the g2p python binary. If None, searches for a global G2P_PYTHON, and uses the default python binary if not existing.

run()
tasks()
Returns:

yields Task’s

Return type:

list[sisyphus.task.Task]