Skip to main content

Machine Learning Challenges with Imbalanced Data

Abstract:

Application of Machine learning algorithms to some of the real-world problems pertaining to areas, like fraud/intrusion detection, medical diagnosis/monitoring, bio-informatics, text categorization and et al. where data set are not approximately equally distributed suffer from the perspective of reduced performance. The imbalances in class distribution often causes machine learning algorithms to perform poorly on the minority class. The cost minority class mis-classification is often unknown at learning time and can be far too high. A number of technique in data sampling, predominantly over-sampling and under-sampling, are proposed to address issues related to imbalanced data without discussing exactly how or why such methods work or what underlying issues they address. This paper tries to highlight some of the key challenges related to classification of imbalanced data while applying standard classification technique. This discusses some of the prevalent methods related to balancing the imbalanced data sets and their short comes in a hunt for better methods to handle the imbalanced data.  

Awaiting session recording. Will post it soon.

Comments

Popular posts from this blog

Just Buzz... Where is AI?

Speaking to Recode’s Kara Swisher and MSNBC’s Ari Melber, Pichai said AI is “one of the most important things that humanity is working on. It’s more profound than, I don’t know, electricity or fire,” adding that people learned to harness fire for the benefits of humanity, but also needed to overcome its downsides, too. Pichai also said that AI could be used to help solve climate change issues, or to cure cancer. We are seeing some exciting things in the industry, Samsung’s massive 8K TVs apparently use AI to upscale lower resolution images for the big screen. Sony has created a new version of the Aibo robot dog, which this time promises more artificial intelligence. Travelmate’s robot suitcase will use AI to drive around and follow its owner wherever they go.  Kohler has invented Numi, a toilet that has Amazon’s Alexa voice assistant built in etc., But despite all this, it does leave me wondering: is artificial intelligence really what we should be calling this revolution?...

Evolving App Paradigm

Over last two decades we have seen enterprise application landscape changing very rapidly with rapidly evolving technology stacks and changing industry dynamics. It is time for another paradigm shift in terms of how we conceptualize, design, develop, test and maintain our applications to meet volatile business requirements. Some of the prime features of this evolving paradigm Flexible business user centric infrastructure (than IT centric) IT as a service model with emphasis on user self service. Flexible development and deployment infrastructure which is readily available over cloud (no setup time, no high budgets and initial spending) Develop application using modern development languages which help improve the developer Usage of more dynamic meta programming languages Reusable assets, code generators, ... Build once and run anywhere across platforms and devices End-to-End application lifecycle integration through ALM and DevOps Improve communication and collaborat...

Augmented Reality SDK - Comparison

Uh very hectic day... do not wanted to drop on "Learn something interesting today". Wanted to find out options for Augmented Reality development for new projects planning to kick off tomorrow. Here is what I found... what I think a very rare and interesting comparison on Augmented Reality SDKs. A feast for AR aspirant. Enjoy!!! http://socialcompare.com/en/comparison/augmented-reality-sdks