i6_core.lib.corpus
¶
Helper functions and classes for Bliss xml corpus loading and writing
- class i6_core.lib.corpus.Corpus¶
This class represents a corpus in the Bliss format. It is also used to represent subcorpora when the parent_corpus attribute is set. Corpora with include statements can be read but are written back as a single file.
- dump(path: str)¶
- Parameters:
path – target .xml or .xml.gz path
- filter_segments(filter_function: Callable[[Corpus, Recording, Segment], bool])¶
filter all segments (including in subcorpora) using filter_function :param filter_function: takes arguments corpus, recording and segment, returns True if segment should be kept
- fullname() str ¶
- load(path: str)¶
- Parameters:
path – corpus .xml or .xml.gz
- class i6_core.lib.corpus.CorpusSection¶
- class i6_core.lib.corpus.NamedEntity¶