aac_datasets.datasets.base module¶
- class AACDataset(
- raw_data: dict[str, list[Any]] | None =
None, - transform: Callable[[ItemType], Any] | None =
None, - column_names: Iterable[str] | None =
None, - flat_captions: bool =
False, - sr: int | Iterable[int] | None =
None, - verbose: int =
0, Bases:
Generic[ItemType],Dataset[ItemType]Base class for AAC datasets.
- add_online_columns(
- post_columns_fns: dict[str, Callable[[Any, int], Any]],
- allow_replace: bool =
False, Add several new post-processed columns to this dataset.
- add_raw_column(
- column_name: str,
- column_data: list[Any],
- allow_replace: bool =
False, Add a new raw column to this dataset.
- get_item( ) ItemType[source]¶
- get_item( ) list
- get_item( ) dict[str, list]
- get_item(
- index: int | Iterable[int] | Iterable[bool] | Tensor | slice | None,
- column: str | Iterable[str] | None,
Get a specific data field.
- Parameters:¶
- index: int¶
- index: Iterable[int] | Iterable[bool] | slice | None
- index: int | Iterable[int] | Iterable[bool] | Tensor | slice | None
The index or slice of the value in range [0, len(dataset)-1].
- column: str
- column: Iterable[str] | None =
None - column: str | Iterable[str] | None
The name(s) of the column. Can be any value of
columns().
- Returns:¶
The field value. The type depends of the column.
- static new_empty() AACDataset[source]¶
Create a new empty dataset.
- preload_online_column(
- column: str,
- allow_replace: bool =
False, Load all data from a post-column data into raw data.
- rename_column(
- old_column: str,
- new_column: str,
- allow_replace: bool =
False, Renames a column from this dataset.
- to_dict(
- load_online_values: bool =
False, Convert dataset to dictionary.
- to_hf_dataset(
- load_online_values: bool =
False,
- to_list(
- load_online_values: bool =
False, Convert dataset to list.