library.phases.phases_implementation.dataset.dataset module

class library.phases.phases_implementation.dataset.dataset.Dataset(dataset_path: str, model_task: str, random_state: int = 99)[source]

Bases: object

Created dataframe, provides info, splits and encodes

eliminate_variables(variables_to_eliminate: list[str], after_split: bool = False)[source]
library.phases.phases_implementation.dataset.dataset.create_split_strategy(dataset, is_time_series: bool = False)[source]

Factory method to create the appropriate split strategy based on the dataset type.

Parameters:
  • dataset (Dataset) – The dataset to split

  • is_time_series (bool) – Whether the dataset is a time series

Returns:

The appropriate split strategy

Return type:

Split