AI Data Augmenter
I am evaluating a new technique to clone or augment existing small datasets. The technique uses bootstrapping and adaptive Gaussian noise to create data that is statistically very similar but has no duplicates. I have named this technique BootsGN. Gemini successfully created an app that utilized this technique, which included a necessary upload window, an option to select the size of the augmented dataset, and an automatic download feature for the dataset. I then thought, why not add the statistical tests at the same time to show the degree of similarity? Now, when the data is generated, so is a report that provides descriptive statistics, Kolmogorov-Smirnov tests, Mann-Whitney tests, t-tests, and chi-square tests, where appropriate. You can access the report in the app or the downloaded text report. It also includes histograms.
It is likely that you will have to make some modifications in the process, so this is why there is an "Ask Gemini" option. For example, if it incorrectly identifies a variable as numerical but it is instead categorical, you would point that out to Gemini and repeat the augmentation.



