AB - We consider the problem of anomaly detection with a small set of partially labeled anomaly examples and a large-scale unlabeled dataset. This is a common scenario in many important applications. Existing related methods either exclusively fit the limited anomaly examples that typically do not span the entire set of anomalies, or proceed with unsupervised learning from the unlabeled data. We propose here instead a deep reinforcement learning-based approach that enables an end-to-end optimization of the detection of both labeled and unlabeled anomalies. This approach learns the known abnormality by automatically interacting with an anomaly-biased simulation environment, while continuously extending the learned abnormality to novel classes of anomaly (i.e., unknown anomalies) by actively exploring possible anomalies in the unlabeled data. This is achieved by jointly optimizing the exploitation of the small labeled anomaly data and the exploration of the rare unlabeled anomalies. Extensive experiments on 48 real-world datasets show that our model significantly outperforms five state-of-the-art competing methods. AU - Pang, G AU - van den Hengel, A AU - Shen, C AU - Cao, L DA - 2021/08/14 DO - 10.1145/3447548.3467417 EP - 1308 JO - KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining PB - ACM PY - 2021/08/14 SP - 1298 TI - Toward Deep Supervised Anomaly Detection Y1 - 2021/08/14 Y2 - 2024/03/29 ER -