Hauptnavigation

schubert-klein.jpg Email: erich.schubert cs.tu-dortmund.de
Phone: 0231/755-7876
Fax: 0231/755-5105
Room-No.: OH14 R334

Consultation hour:
Consultation hours begin in december
During the semester break (solely by arrangement)

About

Professor for Data Mining

No international interships!: I do not take international students for an internship for the academic years 2019/2020/2021. Emails asking for an internship are likely to remain unanswered.

TU Dortmund students are welcome to talk to me about thesises, of course! Above paragraph applies to internship applications from outside Europe, and has become necessary because of the flood of such requests.

Publications

Schubert/Rousseeuw/2018a Schubert, Erich and Rousseeuw, Peter J.. Faster k-Medoids Clustering: Improving the PAM, CLARA, and CLARANS Algorithms. In arXiv preprint, Vol. 1810.05691, 2019.
Houle/etal/2018a Michael E. Houle and Erich Schubert and Arthur Zimek. On the Correlation Between Local Intrinsic Dimensionality and Outlierness. In Proceedings of the 11th International Conference on Similarity Search and Applications (SISAP), Lima, Peru, pages 177-191, 2018.
Schubert/etal/2018a Erich Schubert and Andreas Spitz and Michael Gertz. Exploring Significant Interactions in Live News. In Proceedings of the 2nd International Workshop on Recent Trends in News Information Retrieval (NewsIR'18) co-located with 40th European Conference on Information Retrieval (ECIR 2018), Grenoble, France, pages 39--44, 2018.
Schubert/etal/2018b Erich Schubert and Sibylle Hess and Katharina Morik. The Relationship of DBSCAN to Matrix Factorization and Spectral Clustering. In Proceedings of the Conference "Lernen, Wissen, Daten, Analysen" (LWDA), Mannheim, Germany, pages 330--334, 2018.
Schubert/Gertz/2018a Erich Schubert and Michael Gertz. Numerically Stable Parallel Computation of (Co-)Variance. In Proceedings of the 30th International Conference on Scientific and Statistical Database Management (SSDBM), Bolzano-Bozen, Italy, pages 10:1--10:12, 2018.
Schubert/Gertz/2018b Erich Schubert and Michael Gertz. Improving the Cluster Structure Extracted from OPTICS Plots. In Proceedings of the Conference "Lernen, Wissen, Daten, Analysen" (LWDA), Mannheim, Germany, pages 318--329, 2018.
Casanova/etal/2017a Guillaume Casanova and Elias Englmeier and Michael E. Houle and Peer Kröger and Michael Nett and Erich Schubert and Arthur Zimek. Dimensional Testing for Reverse k-Nearest Neighbor Search. In Proceedings of the VLDB Endowment, Vol. 10, No. 7, pages 769--780, 2017.
Kirner/etal/2017a Evelyn Kirner and Erich Schubert and Arthur Zimek. Good and Bad Neighborhood Approximations for Outlier Detection Ensembles. In Proceedings of the 10th International Conference on Similarity Search and Applications (SISAP), Munich, Germany, pages 173--187, 2017.
Kriegel/etal/2017a Hans-Peter Kriegel and Erich Schubert and Arthur Zimek. The (black) art of runtime evaluation: Are we comparing algorithms or implementations?. In Knowledge and Information Systems (KAIS), Vol. 52, No. 2, pages 341--378, 2017.
Schubert/etal/2017b Erich Schubert and Andreas Spitz and Michael Weiler and Johanna Geiß and Michael Gertz. Semantic Word Clouds with Background Corpus Normalization and t-distributed Stochastic Neighbor Embedding. In CoRR, Vol. abs/1708.03569, 2017.
Schubert/etal/2017c Erich Schubert and Jörg Sander and Martin Ester and Hans-Peter Kriegel and Xiaowei Xu. DBSCAN Revisited, Revisited: Why and How You Should (Still) Use DBSCAN. In ACM Transactions on Database Systems (TODS), Vol. 42, No. 3, pages 19:1--19:21, 2017.
Schubert/Gertz/2017a Erich Schubert and Michael Gertz. Intrinsic t-Stochastic Neighbor Embedding for Visualization and Outlier Detection - A Remedy Against the Curse of Dimensionality?. In Proceedings of the 10th International Conference on Similarity Search and Applications (SISAP), Munich, Germany, pages 188--203, 2017.
Zimek/Schubert/2017a Arthur Zimek and Erich Schubert. Outlier Detection. In Ling Liu and M. Tamer Özsu (editors), Encyclopedia of Database Systems, pages 5, Springer, 2017.
Amsaleg/etal/2016a Laurent Amsaleg and Michael E. Houle and Erich Schubert (editors). Similarity Search and Applications - 9th International Conference, SISAP 2016, Tokyo, Japan, October 24-26, 2016. Proceedings. Vol. 9939, 2016.
Campos/etal/2016a Guilherme O. Campos and Arthur Zimek and Jörg Sander and Ricardo J. G. B. Campello and Barbora Micenková and Erich Schubert and Ira Assent and Michael E. Houle. On the Evaluation of Outlier Detection: Measures, Datasets, and an Empirical Study Continued. In Proceedings of the Conference "Lernen, Wissen, Daten, Analysen" (LWDA), Potsdam, Germany, 2016.
Campos/etal/2016b Guilherme O. Campos and Arthur Zimek and Jörg Sander and Ricardo J. G. B. Campello and Barbora Micenková and Erich Schubert and Ira Assent and Michael E. Houle. On the Evaluation of Unsupervised Outlier Detection: Measures, Datasets, and an Empirical Study. In Data Mining and Knowledge Discovery, Vol. 30, No. 4, pages 891--927, 2016.
Schubert/etal/2016a Erich Schubert and Michael Weiler and Hans-Peter Kriegel. Scalable Detection of Emerging Topics and Geo-spatial Events in Large Textual Streams. In Proceedings of the Conference "Lernen, Wissen, Daten, Analysen" (LWDA), Potsdam, Germany, 2016.
Schubert/etal/2016b Erich Schubert and Michael Weiler and Hans-Peter Kriegel. SPOTHOT: Scalable Detection of Geo-spatial Events in Large Textual Streams. In Proceedings of the 28th International Conference on Scientific and Statistical Database Management (SSDBM), Budapest, Hungary, pages 8:1--8:12, 2016.
Schubert/2015a Erich Schubert and OpenStreetMap Contributors. Fast Reverse Geocoder using OpenStreetMap data. 2015.
Schubert/etal/2015a Erich Schubert and Alexander Koos and Tobias Emrich and Andreas Züfle and Klaus Arthur Schmid and Arthur Zimek. A Framework for Clustering Uncertain Data. In Proceedings of the VLDB Endowment, Vol. 8, No. 12, pages 1976--1979, 2015.
Schubert/etal/2015b Erich Schubert and Michael Weiler and Arthur Zimek. Outlier Detection and Trend Detection: Two Sides of the Same Coin. In 1st International Workshop on Event Analytics using Social Media Data at the 15th IEEE International Conference on Data Mining (ICDM), Atlantic City, NJ, pages 40--46, 2015.
Schubert/etal/2015c Erich Schubert and Arthur Zimek and Hans-Peter Kriegel. Fast and Scalable Outlier Detection with Approximate Nearest Neighbor Ensembles. In Proceedings of the 20th International Conference on Database Systems for Advanced Applications (DASFAA), Hanoi, Vietnam, pages 19--36, 2015.
Dang/etal/2014a Xuan Hong Dang and Ira Assent and Raymond T. Ng and Arthur Zimek and Erich Schubert. Discriminative Features for Identifying and Interpreting Outliers. In Proceedings of the 30th International Conference on Data Engineering (ICDE), Chicago, IL, pages 88--99, 2014.
Schubert/etal/2014a Erich Schubert and Michael Weiler and Hans-Peter Kriegel. SigniTrend: Scalable Detection of Emerging Topics in Textual Streams by Hashed Significance Thresholds. In Proceedings of the 20th ACM International Conference on Knowledge Discovery and Data Mining (SIGKDD), New York, NY, pages 871--880, 2014.
Schubert/etal/2014b Erich Schubert and Arthur Zimek and Hans-Peter Kriegel. Local Outlier Detection Reconsidered: a Generalized View on Locality with Applications to Spatial, Video, and Network Outlier Detection. In Data Mining and Knowledge Discovery, Vol. 28, No. 1, pages 190--237, 2014.
Schubert/etal/2014c Erich Schubert and Arthur Zimek and Hans-Peter Kriegel. Generalized Outlier Detection with Flexible Kernel Density Estimates. In Proceedings of the 14th SIAM International Conference on Data Mining (SDM), Philadelphia, PA, pages 542--550, 2014.
Achtert/etal/2013a Elke Achtert and Hans-Peter Kriegel and Erich Schubert and Arthur Zimek. Interactive Data Mining with 3D-Parallel-Coordinate-Trees. In Proceedings of the ACM International Conference on Management of Data (SIGMOD), New York City, NY, pages 1009--1012, 2013.
Schubert/2013a Erich Schubert. Generalized and Efficient Outlier Detection for Spatial, Temporal, and High-Dimensional Data Mining. Ludwig-Maximilians-Universität München, Munich, Germany, 2013.
Schubert/etal/2013a Erich Schubert and Arthur Zimek and Hans-Peter Kriegel. Geodetic Distance Queries on R-Trees for Indexing Geographic Data. In Proceedings of the 13th International Symposium on Spatial and Temporal Databases (SSTD), Munich, Germany, pages 146--164, 2013.
Zimek/etal/2013a Arthur Zimek and Erich Schubert and Hans-Peter Kriegel. Outlier Detection in High-Dimensional Data. 2013.
Achtert/etal/2012a Elke Achtert and Sascha Goldhofer and Hans-Peter Kriegel and Erich Schubert and Arthur Zimek. Evaluation of Clusterings -- Metrics and Visual Support. In Proceedings of the 28th International Conference on Data Engineering (ICDE), Washington, DC, pages 1285--1288, 2012.
Kriegel/etal/2012a Hans-Peter Kriegel and Peer Kröger and Erich Schubert and Arthur Zimek. Outlier Detection in Arbitrarily Oriented Subspaces. In Proceedings of the 12th IEEE International Conference on Data Mining (ICDM), Brussels, Belgium, pages 379--388, 2012.
Schubert/etal/2012a Erich Schubert and Remigius Wojdanowski and Arthur Zimek and Hans-Peter Kriegel. On Evaluation of Outlier Rankings and Outlier Scores. In Proceedings of the 12th SIAM International Conference on Data Mining (SDM), Anaheim, CA, pages 1047--1058, 2012.
Zimek/etal/2012a Arthur Zimek and Erich Schubert and Hans-Peter Kriegel. Outlier Detection in High-Dimensional Data. pages xxx--xxxii, 2012.
Zimek/etal/2012b Arthur Zimek and Erich Schubert and Hans-Peter Kriegel. A Survey on Unsupervised Outlier Detection in High-Dimensional Numerical Data. In Statistical Analysis and Data Mining, Vol. 5, No. 5, pages 363--387, 2012.
Achtert/etal/2011a Elke Achtert and Ahmed Hettab and Hans-Peter Kriegel and Erich Schubert and Arthur Zimek. Spatial Outlier Detection: Data, Algorithms, Visualizations. In Proceedings of the 12th International Symposium on Spatial and Temporal Databases (SSTD), Minneapolis, MN, pages 512--516, 2011.
Bernecker/etal/2011a Thomas Bernecker and Michael E. Houle and Hans-Peter Kriegel and Peer Kröger and Matthias Renz and Erich Schubert and Arthur Zimek. Quality of Similarity Rankings in Time Series. In Proceedings of the 12th International Symposium on Spatial and Temporal Databases (SSTD), Minneapolis, MN, pages 422--440, 2011.
Kriegel/etal/2011a Hans-Peter Kriegel and Erich Schubert and Arthur Zimek. Evaluation of Multiple Clustering Solutions. In 2nd MultiClust Workshop: Discovering, Summarizing and Using Multiple Clusterings Held in Conjunction with ECML PKDD 2011, Athens, Greece, pages 55--66, 2011.
Kriegel/etal/2011b Hans-Peter Kriegel and Peer Kröger and Erich Schubert and Arthur Zimek. Interpreting and Unifying Outlier Scores. In Proceedings of the 11th SIAM International Conference on Data Mining (SDM), Mesa, AZ, pages 13--24, 2011.
Achtert/etal/2010a Elke Achtert and Hans-Peter Kriegel and Lisa Reichert and Erich Schubert and Remigius Wojdanowski and Arthur Zimek. Visual Evaluation of Outlier Detection Models. In Proceedings of the 15th International Conference on Database Systems for Advanced Applications (DASFAA), Tsukuba, Japan, pages 396--399, 2010.
Bernecker/etal/2010a Thomas Bernecker and Tobias Emrich and Franz Graf and Hans-Peter Kriegel and Peer Kröger and Matthias Renz and Erich Schubert and Arthur Zimek. Subspace Similarity Search Using the Ideas of Ranking and Top-k Retrieval. In Proceedings of the 26th International Conference on Data Engineering (ICDE) Workshop on Ranking in Databases (DBRank), Long Beach, CA, pages 4--9, 2010.
Bernecker/etal/2010b Thomas Bernecker and Tobias Emrich and Franz Graf and Hans-Peter Kriegel and Peer Kröger and Matthias Renz and Erich Schubert and Arthur Zimek. Subspace Similarity Search: Efficient k-NN Queries in Arbitrary Subspaces. In Proceedings of the 22nd International Conference on Scientific and Statistical Database Management (SSDBM), Heidelberg, Germany, pages 555--564, 2010.
Faerber/etal/2010a Ines Färber and Stephan Günnemann and Hans-Peter Kriegel and Peer Kröger and Emmanuel Müller and Erich Schubert and Thomas Seidl and Arthur Zimek. On Using Class-Labels in Evaluation of Clusterings. In MultiClust: 1st International Workshop on Discovering, Summarizing and Using Multiple Clusterings Held in Conjunction with KDD 2010, Washington, DC, 2010.
Houle/etal/2010a Michael E. Houle and Hans-Peter Kriegel and Peer Kröger and Erich Schubert and Arthur Zimek. Can Shared-Neighbor Distances Defeat the Curse of Dimensionality?. In Proceedings of the 22nd International Conference on Scientific and Statistical Database Management (SSDBM), Heidelberg, Germany, pages 482--500, 2010.
Achtert/etal/2009a Elke Achtert and Thomas Bernecker and Hans-Peter Kriegel and Erich Schubert and Arthur Zimek. ELKI in Time: ELKI 0.2 for the Performance Evaluation of Distance Measures for Time Series. In Proceedings of the 11th International Symposium on Spatial and Temporal Databases (SSTD), Aalborg, Denmark, pages 436--440, 2009.
Kriegel/etal/2009b Hans-Peter Kriegel and Peer Kröger and Erich Schubert and Arthur Zimek. Outlier Detection in Axis-Parallel Subspaces of High Dimensional Data. In Proceedings of the 13th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), Bangkok, Thailand, pages 831--838, 2009.
Kriegel/etal/2009d Hans-Peter Kriegel and Peer Kröger and Erich Schubert and Arthur Zimek. LoOP: Local Outlier Probabilities. In Proceedings of the 18th ACM Conference on Information and Knowledge Management (CIKM), Hong Kong, China, pages 1649--1652, 2009.
Kriegel/etal/2008b Hans-Peter Kriegel and Peer Kröger and Erich Schubert and Arthur Zimek. A General Framework for Increasing the Robustness of PCA-Based Correlation Clustering Algorithms. In Proceedings of the 20th International Conference on Scientific and Statistical Database Management (SSDBM), Hong Kong, China, pages 418--435, 2008.
Riley/Schubert/2005a Patrick F. Riley and Erich Schubert. mReplay: Mobile Sports Replay and Fan Democracy. In Axmedis 2005: Proceedings of the 1st International conference on Automated production of Cross Media content for Multi-channel distribution, 2005.
Schubert/2005a Erich Schubert. Structure Preserving Difference Search in Semistructured Data. Ludwig-Maximilians-Universität München, Munich, Germany, 2005.
Schubert/etal/2005a Erich Schubert and Sebastian Schaffert and François Bry. Structure-Preserving Difference Search for XML Documents. In Proceedings of the Extreme Markup Languages 2005 Conference, Montreal, Quebec, Canada, 2005.