Machine Learning with Imbalanced Data

In many practical applications, datasets used for training neural networks are highly unbalanced or scarce in volume. Examples can be found in medical imaging, where data from common diseases are significanlty more than that of rare diseases, or in financial transactions with much more healthy transactions than fradulent ones, or in the analysis of network traces with hackers traffic constituting only a small amount of communications. Training deep neural networks with unbalanced data tends to bias the network towards the classes that contain more data.
In our research, we address these shortcomings by data augmentation:

Data augmentation via radial transformation

Radially Transformed Images



Data Augmentation via Generative Adversarial Networks

Synthesized Chest Xray

X-Ray Generation