i6_core.lib.corpus

Helper functions and classes for Bliss xml corpus loading and writing

class i6_core.lib.corpus.Corpus

This class represents a corpus in the Bliss format. It is also used to represent subcorpora when the parent_corpus attribute is set. Corpora with include statements can be read but are written back as a single file.

add_recording(recording: Recording)
add_speaker(speaker: Speaker)
add_subcorpus(corpus: Corpus)
all_recordings() Iterable[Recording]
all_speakers() Iterable[Speaker]
dump(path: str)
Parameters:

path – target .xml or .xml.gz path

filter_segments(filter_function: Callable[[Corpus, Recording, Segment], bool])

filter all segments (including in subcorpora) using filter_function :param filter_function: takes arguments corpus, recording and segment, returns True if segment should be kept

fullname() str
load(path: str)
Parameters:

path – corpus .xml or .xml.gz

remove_recording(recording: Recording)
segments() Iterable[Segment]
Returns:

an iterator over all segments within the corpus

speaker(speaker_name: Optional[str], default_speaker: Optional[Speaker]) Speaker
top_level_recordings() Iterable[Recording]
top_level_speakers() Iterable[Speaker]
top_level_subcorpora() Iterable[Corpus]
class i6_core.lib.corpus.CorpusSection
class i6_core.lib.corpus.NamedEntity
class i6_core.lib.corpus.Recording
add_segment(segment: Segment)
dump(out: TextIO, indentation: str = '')
fullname() str
speaker(speaker_name: Optional[str] = None) Speaker
class i6_core.lib.corpus.Segment
dump(out: TextIO, indentation: str = '')
fullname() str
speaker() Speaker
class i6_core.lib.corpus.Speaker
dump(out: TextIO, indentation: str = '')