Loading Dataset

SDGnE provides a demo dataset that helps you get started. You can load the demo dataset as below.

from sdgne.demodata.demodataset import download_demodata

dataset = download_demodata()

The demo dataset contains 25 columns among which we include a class column. The class column helps us identify the minority class, for which we would like to generate synthetic data.

Below, we show a few columns from the dataset.

y_am_pef
tempin
humidin
.
.
class

0.264

0.671

0.423

.

.

1

0.475

0.767

0.557

.

.

1

0.39

0.847

0.56

.

.

0

SDGnE requires all values to be scaled using min-max scaling to generate synthetic data.

Last updated