TY - CHAP
T1 - Outlier Detection in Big Data
AU - Hodge, Victoria J.
A2 - Wang, J.
A2 - Wang, J.
N1 - I have been given permission to publish this version of the chapter on Uni of York research database. I have a signed authorisation form from IGI in PDF format giving authorisation.
PY - 2014/4/1
Y1 - 2014/4/1
N2 - Outlier detection (or anomaly detection) is a fundamental task in data mining. Outliers are data that deviate from the norm and outlier detection is often compared to “finding a needle in a haystack”. However, the outliers may generate high value if they are found, value in terms of cost savings, improved efficiency, compute time savings, fraud reduction and failure prevention. Detection can identify faults before they escalate with potentially catastrophic consequences. Big Data refers to large, dynamic collections of data. These vast and complex data appear problematic for traditional outlier detection methods to process but, Big Data provides considerable opportunity to uncover new outliers and data relationships. This chapter highlights some of the research issues for outlier detection in Big Data and covers the solutions used and research directions taken along with an analysis of some current outlier detection approaches for Big Data applications.
AB - Outlier detection (or anomaly detection) is a fundamental task in data mining. Outliers are data that deviate from the norm and outlier detection is often compared to “finding a needle in a haystack”. However, the outliers may generate high value if they are found, value in terms of cost savings, improved efficiency, compute time savings, fraud reduction and failure prevention. Detection can identify faults before they escalate with potentially catastrophic consequences. Big Data refers to large, dynamic collections of data. These vast and complex data appear problematic for traditional outlier detection methods to process but, Big Data provides considerable opportunity to uncover new outliers and data relationships. This chapter highlights some of the research issues for outlier detection in Big Data and covers the solutions used and research directions taken along with an analysis of some current outlier detection approaches for Big Data applications.
U2 - 10.4018/978-1-4666-5202-6
DO - 10.4018/978-1-4666-5202-6
M3 - Chapter (peer-reviewed)
T3 - Encyclopedia of Business Analytics and Optimization
SP - 1762
EP - 1771
BT - Encyclopedia of Business Analytics and Optimization
PB - Hershey, PA: IGI Global
ER -