Outlier Detection in Big Data
Research output: Chapter in Book/Report/Conference proceeding › Chapter (peer-reviewed) › peer-review
Title of host publication | Encyclopedia of Business Analytics and Optimization |
---|
Date | Published - 1 Apr 2014 |
---|
Pages | 1762-1771 |
---|
Number of pages | 10 |
---|
Publisher | Hershey, PA: IGI Global |
---|
Editors | J. Wang |
---|
Original language | English |
---|
Name | Encyclopedia of Business Analytics and Optimization |
---|
Outlier detection (or anomaly detection) is a fundamental task in data mining. Outliers are data that deviate from the norm and outlier detection is often compared to “finding a needle in a haystack”. However, the outliers may generate high value if they are found, value in terms of cost savings, improved efficiency, compute time savings, fraud reduction and failure prevention. Detection can identify faults before they escalate with potentially catastrophic consequences. Big Data refers to large, dynamic collections of data. These vast and complex data appear problematic for traditional outlier detection methods to process but, Big Data provides considerable opportunity to uncover new outliers and data relationships. This chapter highlights some of the research issues for outlier detection in Big Data and covers the solutions used and research directions taken along with an analysis of some current outlier detection approaches for Big Data applications.
I have been given permission to publish this version of the chapter on Uni of York research database. I have a signed authorisation form from IGI in PDF format giving authorisation.
Discover related content
Find related publications, people, projects, datasets and more using interactive charts.
View graph of relations