Textbook
Monographs
Refereed Journal Articles
Refereed Papers in Conference Proceedings
Workshop papers
Tutorials
Textbook
J. Han, M. Kamber, and J. Pei. Data Mining: Concepts and Techniques, third edition, Morgan Kaufmann, 2011, ISBN: 978-0-1238-1479-1.
Monographs
- M. Hua[student] and J. Pei, Ranking Queries on Uncertain Data, Springer, 2011, ISBN: 978-1-4419-9379-3.
- G. Dong and J. Pei, Sequence Data Mining, Springer, 2007, ISBN: 978-0-3876-9936-3.
Refereed Journal Articles
- H. Shi, M. Tayebi, J. Pei, and J. Cao. “Cost-Sensitive Learning for Medical Insurance Fraud Detection with Temporal Information”. To appear in IEEE Transactions on Knowledge and Data Engineering, IEEE Computer Society.
- R. Huang, J. Wang, S. Song, X. Lin, X. Zhu, and J. Pei. “Efficiently Cleaning Structured Event Logs: A Graph Repair Approach”. To appear in ACM Transactions on Database Systems, ACM Press.
- L. Wu, Y. Chen, K. Shen, X. Guo, H. Gao, S. Li, J. Pei, and B. Long. “Graph Neural Networks for Natural Language Processing: A Survey”. To appear in Foundations and Trends in Machine Learning, NOW Publishers.
- L. Xia, C. Huang, Y. Xu, and J. Pei. “Multi-Behavior Sequential Recommendation with Temporal Graph Transformer”. To appear in IEEE Transactions on Knowledge and Data Engineering, IEEE Computer Society.
- H. Jiang, J. Pei, D. Yu, J. Yu, B. Gong, and X. Cheng. “Applications of Differential Privacy in Social Network Analysis: A Survey”. To appear in IEEE Transactions on Knowledge and Data Engineering, IEEE Computer Society.
- Z. Zhang, P. Cui, J. Pei, X. Wang, and W. Zhu. “Eigen-GNN: a Graph Structure Preserving Plug-in for GNNs”. To appear in IEEE Transactions on Knowledge and Data Engineering, IEEE Computer Society.
- Z. Zhang, C. Niu, P. Cui, J. Pei, B. Zhang, and W. Zhu. “Permutation-equivariant and Proximity-aware Graph Neural Networks with Stochastic Message Passing”. To appear in IEEE Transactions on Knowledge and Data Engineering, IEEE Computer Society.
- C. Liu, Z. Zhou, J. Pei, Y. Zhang, and Y. Shi. “Decentralized Composite Optimization in Stochastic Networks: A Dual Averaging Approach with Linear Convergence”. To appear in IEEE Transactions on Automatic Control, IEEE Control Systems Society.
- H. Shi, Y. Yang, L. Wang, D. Ma, M.F. Beg, J. Pei, and J. Cao. “Two-Dimensional Functional Principal Component Analysis for Image Feature Extraction”. Volume 31, Issue 4, pages 1127-1140, 2022, Journal of Computational and Graphical Statistics, Taylor & Francis Group.
- J. Pei. “A Survey on Data Pricing: from Economics to Data Science”. IEEE Transactions on Knowledge and Data Engineering, Volume 34, Issue 10, pages 4586-4608, October 2022, IEEE Computer Society.
- Z. Cong, X. Luo, J. Pei, F. Zhu, and Y. Zhang. “Data Pricing in Machine Learning Pipelines”. Knowledge and Information Systems, Volume 65, Issue 6, June 2022, Springer-Verlag.
- F. Huang, S. Gao, J. Pei, and H. Huang. “Accelerated Zeroth-Order and First-Order Momentum Methods from Mini to Minimax Optimization”. Journal of Machine Learning Research, Volume 23, Number 36, pages 1-70, January 2022, JMLR, Inc.
- X. Hu[student], L. Chu, J. Pei, W. Liu, and J. Bian. “Model Complexity of Deep Learning: a Survey”. Knowledge and Information Systems, Volume 63, Issue 10, pages 2585-2619, August 2021, Springer-Verlag.
- J. Liu, L. Xiong, J. Pei, J. Luo, H. Zhang, and W. Yu. “Group-based Skyline for Pareto Optimal Groups”. IEEE Transactions on Knowledge and Data Engineering, Volume 33, Issue 7, pages 2914-2929, July 2021, IEEE Computer Society.
- H. Shi, D. Ma, Y. Nie, M. Beg, J. Pei, and J. Cao. “Early Diagnosis of Alzheimer’s Disease on ADNI Data Using Novel Longitudinal Score Based on Functional Principal Component Analysis”. Journal of Medical Imaging, Volume 8, Issue 2, April 2021, SPIE.
- Y. Yang[student] and J. Pei. “Influence Analysis in Evolving Networks: A Survey”. IEEE Transactions on Knowledge and Data Engineering, Volume 33, Issue 3, pages 1045-1063, March 2021, IEEE Computer Society.
- W. Yu, X. He, J. Pei, X. Chen, L. Xiong, J. Liu, and Z. Qin. “Visually Aware Recommendation with Aesthetic Features”. The VLDB Journal, Volume 30, Issue 4, pages 495-513, July 2021, Springer Berlin / Heidelberg.
- J. Liu, J. Yang, L. Xiong, J. Pei, J. Luo, Y. Guo, S. Ma, and C. Fan. “Skyline Diagram: Efficient Space Partitioning for Skyline Queries”. IEEE Transactions on Knowledge and Data Engineering, Volume 33, Issue 1, pages 271-286, January 2021, IEEE Computer Society.
- D-W. Choi[student], J. Pei, and X. Lin. “On Spatial Keyword Covering”. Knowledge and Information Systems: An International Journal, Volume 62, Issue 7, pages 2577-2612, July 2020, Springer-Verlag.
- W. Yu, J. Liu, J. Pei, L. Xiong, X. Chen, and Z. Qin. “Efficient Contour Computation of Group-based Skyline”. IEEE Transactions on Knowledge and Data Engineering, Volume 32, Issue 7, pages 1317-1332, July 2020, IEEE Computer Society.
- Y. Yang[student], X. Mao[student], J. Pei, and X. He. “Continuous Influence Maximization”. ACM Transactions on Knowledge Discovery from Data, Volume 14, No. 3, Article 29, 38 pages, March 2020, ACM Press.
- M. Lei[visitor], L. Chu[student], Z. Wang[visitor], J. Pei, C. He, and X. Zhang. “Mining Top-k Sequential Patterns in Transaction Database Graphs: A New Challenging Problem and a Sampling-based Approach”. World Wide Web Journal, Volume 23, Issue 1, pages 103-130, January 2020, Springer-Verlag.
- Z. Zhao[student], L. Chu[student], D. Tao, and J. Pei. “Classification with Label Noise: A Markov Chain Sampling Framework”. Data Mining and Knowledge Discovery, Volume 33, Issue 5, pages 1468–1504, September 2019, Springer-Verlag.
- J. Liu, J. Yang, L. Xiong, and J. Pei. “Secure and Efficient Skyline Queries on Encrypted Data”. IEEE Transactions on Knowledge and Data Engineering, Volume 31, Issue 7, pages 1397-1411, July 2019, IEEE Computer Society.
- W. Yu, X. Lin, W. Zhang, J. Pei, and J. McCann. “SimRank*: Effective and Scalable Pairwise Similarity Search Based on Graph Topology”. The VLDB Journal, Volume 28, Issue 3, pages 401-426, June 2019, Springer Berlin / Heidelberg.
- P. Cui, X. Wang, J. Pei, and W. Zhu. “A Survey on Network Embeddings”. IEEE Transactions on Knowledge and Data Engineering, Volume 31, Issue 5, pages 833-852, May 2019, IEEE Computer Society.
- D. Zhu, P. Cui, Z. Zhang, J. Pei, and W. Zhu. “High-order Proximity Preserved Embeddings for Dynamic Networks”. IEEE Transactions on Knowledge and Data Engineering, Volume 30, Issue 11, pages 2134-2144, November 2018, IEEE Computer Society.
- J. Hu[student] and J. Pei. “Subspace Multi-clustering: A Review”. Knowledge and Information Systems: An International Journal, Volume 56, Issue 2, pages 257-284, August 2018, Springer-Verlag.
- Y. Yang[student], Z. Wang[visitor], J. Pei, and E. Chen. “Tracking Influential Individuals in Dynamic Networks”. IEEE Transactions on Knowledge and Data Engineering, Volume 29, Issue 11, pages 2615-2628, November 2017, IEEE Computer Society.
- Z. Wang[visitor], Y. Yang[student], J. Pei, L. Chu[student], and E. Chen. “Activity Maximization by Effective Information Diffusion in Social Networks”. IEEE Transactions on Knowledge and Data Engineering, Volume 29, Issue 11, pages 2374-2387, November 2017, IEEE Computer Society.
- Y. Yang[student], J. Pei, and A. Al-Barakati. “Measuring In-Network Node Similarity Based on Neighborhoods: A Unified Parametric Approach”. Knowledge and Information Systems: An International Journal, Volume 53, Issue 1, pages 43-70, October 2017, Springer-Verlag.
- J. Hu[student], Q. Qian, J. Pei, R. Jin, and S. Zhu. “Finding Multiple Stable Clusterings”. Knowledge and Information Systems: An International Journal, Volume 51, Issue 3, pages 991-1021, June 2017, Springer-Verlag.
- A. Campbell[student], X. Mao [student], J. Pei, and A. Al-Barakati. “Multidimensional Business Benchmarking Analysis on Data Warehouses”. Intelligent Data Analysis – An International Journal, Volume 13, Issue 1, pages 51-75, January 2017, IOS Press.
- K. Yu[student], X. Wu, W. Ding, and J. Pei. “Scalable and Accurate Online Feature Selection for Big Data”. ACM Transactions on Knowledge Discovery from Data (TKDD), Volume 11 Issue 2, Article No. 6, December 2016, ACM Press.
- S. Liu, J. Yin, X. Wang, W. Cui, K. Cao, and J. Pei. “Online Visual Analytics of Text Streams”. IEEE Transactions on Visualization and Computer Graphics, Volume 22, Issue 11, pages 2451-2466, November 2016, IEEE Computer Society.
- N. X. Vinh, J. Chan, S. Romano, J. Bailey, C. Leckie, K. Ramamohanarao, and J. Pei. “Discovering Outlying Aspects in Large Datasets”. Data Mining and Knowledge Discovery, Volume 30, Issue 6, pages 1520-1555, November 2016, Springer-Verlag.
- X. Xu[student], C. Gao[student], J. Pei, K. Wang, and A. Al-Barakati. “Continuous Similarity Search for Evolving Queries”. Knowledge and Information Systems: An International Journal, Volume 48, Issue 3, pages 649-678, September 2016, Springer-Verlag.
- L. Duan[visitor], G. Tang[student], J. Pei, J. Bailey, G. Dong, V. Nguyen, A. Campbell, and C. Tang. “Efficient Discovery of Contrast Subspaces for Object Explanation and Characterization”. Knowledge and Information Systems: An International Journal, Volume 47, Issue 1, pages 99-129, April 2016, Springer-Verlag.
- L. Duan[visitor], G. Tang[student], J. Pei, J. Bailey, A. Campbell[student], and C. Tang. “Mining Outlying Aspects on Numeric Data”. Data Mining and Knowledge Discovery, Volume 29, Issue 5, pages 1116-1151, September 2015, Springer-Verlag.
- G. Tang[student], J. Pei, J. Bailey, and G. Dong. “Mining Multidimensional Contextual Outliers from Categorical Relational Data”. Intelligent Data Analysis – An International Journal, Volume 19, No. 5, pages 1171-1192, September 2015, IOS Press.
- X. Zhang, W. Dou, J. Pei, S. Nepal, C. Yang, C. Liu, and J. Chen. “Proximity-aware Local-recoding Anonymization with MapReduce for Scalable Big Data Privacy Preservation in Cloud”. IEEE Transactions on Computers, Volume 64, Issue 8, pages 2293-2307, August 2015, IEEE Computer Society.
- K. Yu[student], W. Ding, D.A. Simovici, H. Wang, J. Pei, and X. Wu. “Classification with Streaming Features: An Emerging Pattern Mining Approach”. ACM Transactions on Knowledge Discovery from Data, Volume 9, Issue 4, Article No. 30, June 2015, ACM Press.
- Y. Yang, Q. S. Lu, G. Tang[student], and J. Pei. “The Impact of Market Competition on Search Advertising”. Journal of Interactive Marketing, Volume 30, Issue C, pages 46-55, May 2015, Elsevier.
- C. Yang, X. Zhang, C. Zhong, C. Liu, J. Pei, K. Ramamohanarao, and J. Chen. “A spatiotemporal compression based approach for efficient big data processing on Cloud”. Journal of Computer and System Sciences, Volume 80, Issue 8, pages 1563-1583, December 2014, Elsevier.
- D. Huang[student], K. Xu, and J. Pei. “Malicious URL Detection by Dynamically Mining Patterns without Pre-defined Elements”. World Wide Web Journal, Volume 17, Issue 6, pages 1375-1394, November 2014, Springer-Verlag.
- G. Tang[student], J. Pei, and W-S. Luk. “Email Mining: Tasks, Common Techniques, and Tools”. Knowledge and Information Systems: An International Journal, Volume 41, Issue 1, pages 1-31, October 2014, Springer-Verlag.
- Y. Yang, J. X. Yu, H. Gao, J. Pei, and J. Li. “Mining Most Frequently Changing Component in Evolving Graphs”. World Wide Web Journal, Volume 17, Issue 3, pages 351-376, May 2014, Springer-Verlag.
- Y. Zhang, W. Zhang, J. Pei, X. Lin, Q. Lin, and A. Li. “Consensus-based Ranking of Multi-valued Objects: A Generalized Borda Count Approach”. IEEE Transactions on Knowledge and Data Engineering, Volume 26, Issue 1, pages 83-96, January 2014, IEEE Computer Society.
- Y-C. Lo, J-Y. Li, M-Y. Yeh, S-D. Lin, and J. Pei. “What Distinguish One from Its Peers in Social Networks?”. Data Mining and Knowledge Discovery, Volume 27, Issue 3, pages 396-420, November 2013, Springer-Verlag.
- Z. Liao, D. Jiang, J. Pei, Y. Huang, E. Chen, H. Cao, and H. Li. “A vlHMM Approach to Context-Aware Search”. ACM Transactions on the Web, Volume 7, Issue 4, Article No. 2, 38 pages, October 2013, ACM Press.
- D. Jiang, J. Pei, and H. Li. “Mining Search and Browse Logs for Web Search: A Survey”. ACM Transactions on Intelligent Systems and Technology, Volume 4, Issue 4, Article Number 57, 37 pages, September 2013, ACM Press.
- B. Jiang[student], J. Pei, Y. Tao, and X. Lin. “Clustering Uncertain Data Based on Probability Distribution Similarity”. IEEE Transactions on Knowledge and Data Engineering, Volume 25, No. 4, pages 751-763, April 2013, IEEE Computer Society.
- Y. Cui[student], J. Pei, G. Tang[student], W-S. Luk, D. Jiang, and M. Hua. “Finding Email Correspondents in Online Social Networks”. World Wide Web Journal, Volume 16, Issue 2, pages 195-218, March 2013, Springer-Verlag.
- J. Chen[visitor], J. Huang[visitor], B. Jiang[student], J. Pei, and J. Yin. “Recommendations for Two-Way Selections Using Skyline View Queries”. Knowledge and Information Systems: An International Journal, Volume 34, Issue 2, pages 397-424, February 2013, Springer-Verlag.
- J. Huang[visitor], B. Jiang[student], J. Pei, J. Chen[visitor], and Y. Tang. “Skyline Distance: A Measure of Multidimensional Competence”. Knowledge and Information Systems: An International Journal, Volume 34, Issue 2, pages 373-396, February 2013, Springer-Verlag.
- Y. Liu, Y. Zhao, L. Chen, J. Pei, and J. Han. “Mining Frequent Trajectory Patterns for Activity Monitoring Using Radio Frequency Tag Arrays”. IEEE Transactions on Parallel and Distributed Systems, Volume 23, Issue 11, pages 2138-2149, November 2012, IEEE Computer Society.
- Q. Jiang[student], A. Campbell, G. Tang[student], and J. Pei. “Multi-level Relationship Outlier Detection”.the International Journal of Business Intelligence and Data Mining (IJBIDM), Volume 7, Number 4, pages 253-273, October 2012, InderScience Publishers.
- L. Li[student], S. Petschulat, G. Tang[student], J. Pei, W-S. Luk. “Efficient and Effective Aggregate Keyword Search on Relational Databases”. the International Journal of Data Warehousing and Mining (IJDWM), Volume 7, Issue 4, pages 41-81, October 2012, InderScience Publishers.
- M. Hua[student] and J. Pei. “Clustering in Applications with Multiple Data Sources — A Mutual Subspace Clustering Approach”. Neurocomputing, Volume 92, pages 133-144, September 2012, Elsevier.
- Z. Xing[student], J. Pei, and P.S. Yu. “Early Classification on Time Series”. Knowledge and Information Systems: An International Journal, Volume 31, Issue 1, pages 105-127, April 2012, Springer-Verlag.
- B. Zhou[student] and J. Pei. “Aggregate Keyword Search on Large Relational Databases”. Knowledge and Information Systems: An International Journal, Volume 30, Number 2, pages 283-318, February 2012, Springer-Verlag.
- B. Jiang[student], J. Pei, X. Lin, and Y. Yuan. “Probabilistic Skylines on Uncertain Data: Model and Bounding-Pruning-Refining Methods”. Journal of Intelligent Information Systems, Volume 38, Number 1, pages 1-39, February 2012, Springer-Verlag.
- X. Sun, H. Wang, J. Li, and J. Pei. “Publishing Anonymous Survey Rating Data”. Data Mining and Knowledge Discovery, Volume 23, pages 379-406, November 2011, Springer-Verlag.
- Z. Liao, D. Jiang[student], E. Chen, J. Pei, H. Cao, and H. Li. “Mining Concept Sequences from Large-Scale Search Logs for Context-Aware Query Suggestion”. ACM Transactions on Intelligent Systems and Technology, Volume 3, Issue 1, pages 17:1-40, October 2011, ACM Press.
- R. C.W. Wong, A. W.C. Fu, K. Wang, P. S. Yu, and J. Pei. “Can the Utility of Anonymized Data Be Used for Privacy Breaches?”. ACM Transactions on Knowledge Discovery in Data, Volume 5, Issue 3, pages 16:1-24, August 2011, ACM Press.
- B. Zhou[student] and J. Pei. “K-Anonymity and L-Diversity Approaches for Privacy Preservation in Social Networks against Neighborhood Attacks”. Knowledge and Information Systems: An International Journal, Volume 28, Number 1, pages 47-77, July 2011, Springer-Verlag.
- Y. Zhang, W. Zhang, X. Lin, B. Jiang[student], and J. Pei. “Ranking Uncertain Sky: the Probabilistic Top-k Skyline Operator”. Information System Journal, Volume 36, Issue 5, pages 898-915, July 2011, Elsevier Ltd.
- M. Hua[student], J. Pei, and X. Lin. “Ranking Queries on Uncertain Data”. The VLDB Journal, Volume 20, Number 1, pages 129-153, February 2011, Springer Berlin / Heidelberg.
- Z. Xing[student], J. Pei, and E. Keogh. “A Brief Survey on Sequence Classification”. ACM SIGKDD Explorations, Volume 12, Issue 1, pages 40-48, June 2010, ACM Press.
- Z. Lin[student], B. Jiang[student], and J. Pei. “Mining Discriminative Items in Multiple Data Streams”.World Wide Web Journal, Volume 13, Issue 4, pages 497-522, December 2010, Springer-Verlag.
- Z. Xing[student] and J. Pei. “Exploring Disease Association from the NHANES Data: Data Mining, Pattern Summarization, and Visual Analytics”. the International Journal of Data Warehousing and Mining (IJDWM), Volume 6, Issue 3, pages 11-27, July-September 2010, InderScience Publishers.
- E. Loekito, J. Bailey, and J. Pei. “Binary Decision Diagram Based Approach for Mining Frequent Subsequences”. Knowledge and Information Systems: An International Journal, Volume 24, Number 2, pages 235-268, August 2010, Springer-Verlag.
- S. Yuen, Y. Tao, X. Xiao, J. Pei, D. Zhang. “Superseding Nearest Neighbor Search on Uncertain Spatial Databases”. IEEE Transactions on Knowledge and Data Engineering, Volume 22, Number 7, pages 1041-1055, July 2010, IEEE Computer Society.
- X. Cheng, J. Xu[student], J. Pei, and J. Liu. “Hierarchical Distributed Data Classification in Wireless Sensor Networks”. Computer Communication, Volume 33, Issue 12, pages 1404-1413, July 15, 2010, Elsevier.
- W. Zhang, X. Lin, Y. Zhang, J. Pei, and W. Wang. “Threshold-based Probabilistic Top-k Dominating Queries”, The VLDB Journal, Volume 19, Number 2, pages 283-305, April 2010, Springer Berlin / Heidelberg.
- M. A. Cheema, X. Lin, W. Wang, W. Zhang, and J. Pei. “Probabilistic Reverse Nearest Neighbor Queries on Uncertain Data”. IEEE Transactions on Knowledge and Data Engineering, Volume 22, Number 4, pages 550-564, April 2010, IEEE Computer Society.
- B. Aljaber, N. Stokes, J. Bailey, and J. Pei. “Document Clustering of Scientific Texts Using Citations Contexts”. Information Retrieval, Volume 13, Number 2, pages 101-131, April 2010, Springer-Verlag.
- M. Hua[student], M. K. Lau[student], J. Pei, and K. Wu. “Continuous K-Means Monitoring with Low Reporting Cost in Sensor Networks”. IEEE Transactions on Knowledge and Data Engineering, Volume 21, Number 12, pages 1679-1691, December 2009, IEEE Computer Society.
- X. Zeng[student], J. Pei, K. Wang, and J. Li. “PADS: A Simple Yet Effective Pattern-Aware Dynamic Search Method for Fast Maximal Frequent Pattern”. Knowledge and Information Systems: An International Journal, Volume 20, Number 3, pages 375-391, September 2009, Springer-Verlag.
- M. Hua[student] and J. Pei. “Continuously Monitoring Top-K Uncertain Data Streams: A Probabilistic Threshold Method”. Distributed and Parallel Databases: An International Journal, Volume 26, Number 1, (special issue on ranking in databases), pages 29-65, August, 2009, Springer-Verlag.
- B. Zhou[student] and J. Pei. “Link Spam Target Detection Using Page Farms”. ACM Transactions on Knowledge Discovery in Data, Volume 3, Number 3, pages 13:1-38, July 2009, ACM Press.
- R. She, J. S.-C. Chu, K. Wang, J. Pei, and N. Chen. “genBlastA: Enabling BLAST to identify homologous gene sequences”. Genome Research, Number 19, pages 143-149, 2009, Cold Spring Harbor Laboratory Press.
- M. P. Ng, I. A. Vergara, C. Frech, Q. Chen, X. Zeng[student], J. Pei, and N. Chen. “OrthoClusterDB: a Web Server for Synteny Blocks”. BMC Bioinformatics, Volume 10, Article 192, 2009.
- M. Hua[student], J. Pei, A. W.-C. Fu, X. Lin, and H-F Leung. “Top-k Typicality Queries and Efficient Query Answering Methods on Large Databases”. The VLDB Journal, Volume 18, Number 3, pages 809-835, June 2009, Springer Berlin / Heidelberg.
- R. C.-W. Wong[visitor], A. W.-C. Fu, K. Wang, and J. Pei. “Anonymization based Attack in Privacy Preserving Data Publishing”. ACM Transactions on Database Systems, Volume 34, Issue 2, pages 8:1-46, June 2009, ACM Press.
- D. Jiang[student] and J. Pei. “Mining Frequent Cross-Graph Quasi-Cliques”. ACM Transactions on Knowledge Discovery in Data, Volume 2, Number 4, pages 16:1-42, January 2009, ACM Press.
- R. C.-W. Wong[visitor], J. Pei, A. W.-C. Fu, and K. Wang. “Online Skyline Analysis with Dynamic Preferences on Nominal Attributes”. IEEE Transactions on Knowledge and Data Engineering, Volume 21, Number 1, pages 35-49, January 2009, IEEE Computer Society.
- B. Zhou[student], J. Pei, and W.-S. Luk. “A Brief Survey on Anonymization Techniques for Privacy Preserving Publishing of Social Network Data”. ACM SIGKDD Explorations, Volume 10, Issue 2, pages 12-22, December 2008, ACM Press.
- J. Li, R. C.-W. Wong, A. W.-C. Fu, and J. Pei. “Anonymisation by Local Recoding in Data with Attribute Hierarchical Taxonomies”. IEEE Transactions on Knowledge and Data Engineering, Volume 20, Number 9, pages 1181-1194, September 2008, IEEE Computer Society.
- H. Wang and J. Pei. “Clustering by Pattern Similarity”. Journal of Computer Science and Technology, Volume 23, Number 4, pages 481-496, July, 2008, Springer.
- D. Jiang[student], J. Pei, M. Ramanathan, C. Lin, C. Tang, and A. Zhang. “Mining Gene-Sample-Time Microarray Data: A Coherent Gene Cluster Discovery Approach”. Knowledge and Information Systems: An International Journal, Volume 13, Number 3, pages 305-335, November 2007, Springer-Verlag.
- Y. Tao, X. Xiao, and J. Pei. “Efficient Skyline and Top-k Retrieval in Subspaces”. IEEE Transactions on Knowledge and Data Engineering, Volume 19, Number 8, pages 1072-1088, August 2007, IEEE Computer Society.
- M. Cho[student], J. Pei, and K. Wang. “Answering Ad Hoc Aggregate Queries from Data Streams Using Prefix Aggregate Trees”. Knowledge and Information Systems: An International Journal, Volume 12, Number 3, pages 301-329, August 2007, Springer-Verlag.
- C. Liu, K. Wu, and J. Pei. “An Energy Efficient Data Collection Framework for Wireless Sensor Networks by Exploiting Spatiotemporal Correlation”. IEEE Transactions on Parallel and Distributed Systems, Volume 18, Number 7, pages 1010-1023, July 2007, IEEE Computer Society.
- J. Pei, J. Han, H. Lu, S. Nishio, S. Tang, and D. Yang. “H-Mine: Fast and space-preserving frequent pattern mining in large databases”. IIE Transactions, Volume 39, Issue 6, pages 593-605, June, 2007, Taylor & Francis.
- J. Pei, J. Han, and W. Wang. “Constraint-Based Sequential Pattern Mining: The Pattern-Growth Methods”. Journal of Intelligent Information Systems, Volume 28, Number 2, pages 133-160, April, 2007, Springer-Verlag.
- J. Pei, Y. Yuan, X. Lin, W. Jin, M. Ester, Q. Liu, W. Wang, Y. Tao, J.X. Yu, and Q. Zhang. “Towards Multidimensional Subspace Skyline Analysis”. ACM Transactions on Database Systems, Volume 31, Number 4, pages 1335-1381, December 2006, ACM Press.
- Y. Chen, G. Dong, J. Han, J. Pei, B. W. Wah, and J. Wang. “Regression Cubes with Lossless Compression and Aggregation”. IEEE Transactions on Knowledge and Data Engineering, Volume 18, Number 12, pages 1585-1599, December 2006, IEEE Computer Society.
- J. Xu, W. Wang, J. Pei, X. Wang, B. Shi, and A. W. Fu. “Utility-Based Anonymization for Privacy Preservation with Less Information Loss”. ACM SIGKDD Explorations, Volume 8, Issue 2, pages 21-30, December 2006.
- J. Pei, H. Wang, J. Liu[student], K. Wang, J. Wang, and P. S. Yu. “Discovering Frequent Closed Partial Orders from Strings”. IEEE Transactions on Knowledge and Data Engineering, Volume 18, Number 11, pages 1467-1481, November 2006, IEEE Computer Society.
- I. Pekerskaya[student], J. Pei, and K. Wang. “Mining Changing Regions from Access-Constrained Snapshots: A Cluster-Embedded Decision Tree Approach”. Journal of Intelligent Information Systems (Special Issue on Mining Spatio-Temporal Data), Volume 27, Number 3, pages 215-242, November 2006, Springer-Verlag.
- Y. Huang, J. Pei, and H. Xiong. “Co-location Mining with Rare Spatial Features”. GeoInformatica, Volume 10, Number 3, pages 239-260, September 2006, Springer Netherlands.
- J. Wang, J. Han, and J. Pei. “Closed Constrained-Gradient Mining in Retail Databases”. IEEE Transactions on Knowledge and Data Engineering, Volume 18, Number 6, pages 764-769, June 2006, IEEE Computer Society.
- J. Han, Y. Chen, G. Dong, J. Pei, B. W. Wah, J. Wang, and Y. D. Cai. “Stream Cube: An Architecture for Multi-Dimensional Analysis of Data Streams”. Distributed and Parallel Databases, Volume 18, Number 2, pages 173-197, September 2005, Springer Science + Business Media.
- D. Jiang[student], J. Pei, and A. Zhang. “An Interactive Approach to Mining Gene Expression Data”. IEEE Transactions on Knowledge and Data Engineering, Volume 17, Number 10, pages 1363-1378, October 2005, IEEE Computer Society.
- M. Cho[student], J. Pei, H. Wang, and W. Wang. “Preference-based Frequent Pattern Mining”. International Journal of Data Warehousing and Mining, Volume 1, Number 4, pages 56-77, October-December, 2005, InderScience Publishers.
- J. Pei, J. Han, B. Mortazavi-Asl, J. Wang, H. Pinto, Q. Chen, U. Dayal, and M.C. Hsu. “Mining Sequential Patterns by Pattern-growth: The PrefixSpan Approach”. IEEE Transactions on Knowledge and Data Engineering, Volume 16, Number 11, pages 1424-1440, November 2004, IEEE Computer Society.
- J. Pei, G. Dong, W. Zou, and J. Han. “Mining Condensed Frequent Pattern Bases”. Knowledge and Information Systems: An International Journal, Volume 6, Number 5, pages 570-594, September 2004, Springer-Verlag.
- G. Dong, J. Han, J. Lam, J. Pei, K. Wang, and W. Zou. “Mining Constrained Gradients in Large Databases”. IEEE Transactions on Knowledge and Data Engineering, Volume 16, Number 8, pages 922-938, August 2004, IEEE Computer Society.
- J. Pei, J. Han, and L. V. S. Lakshmanan. “Pushing Convertible Constraints in Frequent Itemset Mining”.Data Mining and Knowledge Discovery: An International Journal, Volume 8, Issue 3, pages 227-252, May 2004, Kluwer Academic Publishers.
- J. Han, J. Pei, and X. Yan. “From Sequential Pattern Mining to Structured Pattern Mining: A Pattern-Growth Approach”. (Invited paper) Journal of Computer Science and Technology, Volume 19, No. 3, pages 257-279, May 2004, Allerton Press, Inc.
- J. Han, J. Pei, Y. Yin, and R. Mao. “Mining Frequent Patterns without Candidate Generation: A Frequent-Pattern Tree Approach”. Data Mining and Knowledge Discovery: An International Journal, Volume 8, Issue 1, pages 53-87, January 2004, Kluwer Academic Publishers.
- D. Jiang[student], J. Pei, and A. Zhang. “Towards Interactive Exploration of Gene Expression Patterns”.ACM SIGKDD Explorations (Special Issue on Microarray Data Analysis), Volume 5, Issue 2, pages 79-90, December 2003.
- Z. Chen, C. Li, J. Pei, Y. Tao, H. Wang, W. Wang, J. Yang, J. Yang, and D. Zhang. “Recent Progress on Selected Topics in Database Research: A Report from Nine Young Chinese Researchers Working in the United States”. (Invited paper) Journal of Computer Science and Technology, Volume 18, No. 5, September, 2003, pages 538-552, Allerton Press, Inc.
- J. Pei and J. Han. “Constrained Frequent Pattern Mining: A Pattern-Growth View”. ACM SIGKDD Explorations (Special Issue on Constraints in Data Mining), Volume 4, Issue 1, pages 31-39, June 2002.
- J. Han and J. Pei. “Mining Frequent Patterns by Pattern-Growth: Methodology and Implications”. ACM SIGKDD Explorations (Special Issue on Scalable Data Mining Algorithms), Volume 2, Issue 2, pages 14-20, December, 2000.
Refereed Papers in Conference Proceedings
- X. Wu, Z. Hu, J. Pei, and H. Huang. “Serverless Biased Stochastic Methods for Multi-Party Collaborative Imbalanced Data Mining”. In Proceedings of the Twenty-ninth ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD’23), Long Beach, CA, USA, August 6-10, 2023.
- R. Xue, H. Han, M. Torkamani, J. Pei, and X. Liu. “LazyGNN: Large-Scale Graph Neural Networks via Lazy Propagation”. In Proceedings of the Fortieth International Conference on Machine Learning (ICML’23), Honolulu, HI, USA, July 23-29, 2023.
- N. Chen, L. Shou, J. Pei, M. Gong, B. Cao, J. Chang, J. Li, and D. Jiang. “Alleviating Over-smoothing for Unsupervised Sentence Representation”. In Proceedings of the Sixty-first Annual Meeting of the Association for Computational Linguistics (ACL’23), Toronto, ON, Canada, July 9-14, 2023.
- N. Chen, L. Shou, T. Song, M. Gong, J. Pei, J. Chang, D. Jiang, and J. Li. “Structural Contrastive Pretraining for Cross-Lingual Comprehension”. In Findings of the Sixty-first Annual Meeting of the Association for Computational Linguistics (ACL’23), Toronto, ON, Canada, July 9-14, 2023.
- J. Zhang, Q. Sun, J. Liu, L. Xiong, J. Pei, and K. Ren. “Efficient Sampling Approaches to Shapley Value Approximation”. In Proceedings of the 2023 ACM SIGMOD International Conference on Management of Data (SIGMOD’23), Seattle, WA, USA, June 18-23, 2023.
- J. Peng, H. Zou, J. Liu, S. Li, Y. Jiang, J. Pei, and P. Cui. “Offline Policy Evaluation in Large Action Spaces via Outcome-Oriented Action Grouping”. In Proceedings of the 2023 ACM Web Conference (WWW’23), Austin, TX, USA, April 30 – May 4, 2023.
- H. Zou, H. Wang, R. Xu, B. Li, J. Pei, Y.J. Jian, and P. Cui. “Factual Observation Based Heterogeneity Learning for Counterfactual Prediction”. In Proceedings of the Second Conference on Causal Learning and Reasoning (CLeaR’23), Tübingen, German, April 11-14, 2023.
- J. Zhang, H. Xia, Q. Sun, J. Liu, L. Xiong, J. Pei, and K. Ren. “Dynamic Shapley Value Computation”. In Proceedings of the Thirty-Ninth IEEE International Conference on Data Engineering (ICDE’23), Anaheim, CA, USA, April 3-7, 2023.
- L. Xia, Y. Shao, C. Huang, Y. Xu, H. Xu, and J. Pei. “Disentangled Graph Social Recommendation”. In Proceedings of the Thirty-ninth IEEE International Conference on Data Engineering (ICDE’23), Anaheim, CA, USA, April 3-7, 2023.
- Z. Xu, L. Shou, J. Pei, M. Gong, Q. Su, X. Quan, and J. Jiang. “A Graph Fusion Approach for Cross- Lingual Machine Reading Comprehension”. In Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI’23), Washington, DC, USA, February 7-14, 2023.
- S. Wu, R. Zhao, Y. Zheng, J. Pei, and B. Liu. “Identify Event Causality with Knowledge and Analogy”. In Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI’23), Washington, DC, USA, February 7-14, 2023.
- S. Liang, L. Shou, J. Pei, M. Gong, W. Zuo, X. Zuo, and D. Jiang. “Label-aware Multi-level Contrastive Learning for Cross-lingual Spoken Language Understanding”. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP’22), Abu Dhabi, December 7-11, 2022.
- H. Ren, L. Shou, J. Pei, N. Wu, M. Gong, and D. Jiang. “Lexicon-Enhanced Self-Supervised Training for Multilingual Dense Retrieval”. In Findings of the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP’22), Abu Dhabi, December 7-11, 2022.
- N. Liu, X. Wang, D. Bo, C. Shi, and J. Pei. “Revisiting Graph Contrastive Learning from the Perspective of Graph Spectrum”. In Proceedings of the Thirty-sixth Conference on Neural Information Processing Systems (NeurIPS’22), New Orleans, LA, November 28 – December 3, 2022.
- Y. Zhang, R. Bao, J. Pei, and H. Huang. “Toward Unified Data and Algorithm Fairness via Adversarial Data Augmentation and Adaptive Model Fine-tuning”. In Proceedings of the Twenty-second IEEE International Conference on Data Mining (ICDM’22), Orlando, FL, USA, November 28 – December 1, 2022.
- X. Luo, J. Pei, Z. Cong, and C. Xu. “On Shapley Value in Data Assemblage Under Independent Utility”. In Proceedings of the Forty-eighth International Conference on Very Large Databases (VLDB’22), September 5-9, 2022, Sydney, Australia.
- J. Li, J. Pei, and H. Huang. “Communication-Efficient Robust Federated Learning with Noisy Labels”. In Proceedings of the Twenty-eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’22), Washington DC, USA, August 14-18, 2022.
- Y. Zhang, S. Gao, J. Pei, and H. Huang. “Improving Social Network Embedding via New Second-Order Continuous Graph Neural Networks”. In Proceedings of the Twenty-eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’22), Washington DC, USA, August 14-18, 2022.
- N. Chen, L. Shou, M. Gong, J. Pei, and D. Jiang. “Bridging the Gap between Language Models and Cross-Lingual Sequence Labeling”. In proceedings of the 2022 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL’22), Seattle, Washington, WA, July 10-15, 2022.
- Z. Fan, H. Fang, Z. Zhou, J. Pei, M. Friedlander, C. Liu, and Y. Zhang. “Improving Fairness for Data Valuation in Horizontal Federated Learning”. In Proceedings of the Thirty-eighth IEEE International Conference on Data Engineering (ICDE’22), Kuala Lumpur, Malaysia, May 9-12, 2022.
- H. Wang, C. Xu, C. Zhang, J. Xu, P. Zhe, and J. Pei. “vChain+: Optimizing Verifiable Blockchain Boolean Range Queries”. In Proceedings of the Thirty-eighth IEEE International Conference on Data Engineering (ICDE’22), Kuala Lumpur, Malaysia, May 9-12, 2022.
- Z. Li, C. Huang, L. Xia, Y. Xu, and J. Pei. “Spatial-Temporal Hypergraph Self-Supervised Learning for Urban Crime Prediction”. In Proceedings of the Thirty-eighth IEEE International Conference on Data Engineering (ICDE’22), Kuala Lumpur, Malaysia, May 9-12, 2022.
- Y. Zhang, H. Gao, J. Pei, and H. Huang. “Robust Self-Supervised Structural Graph Neural Network for Social Network Prediction”. In Proceedings of the Thirty-first Web Conference (WWW’22), Lyon, France, April 25-29, 2022.
- Y. Zhang, L. Wu, Q. Shen, Y. Pang, Z. Wei, F. Xu, B. Long, and J. Pei. “Multiple Choice Questions based Multi-Interest Policy Learning for Conversational Recommendation”. In Proceedings of the Thirty-first Web Conference (WWW’22), Lyon, France, April 25-29, 2022.
- G. Singh, L. Chu, L. Wang, J. Pei, and Y. Zhang. “Mining Minority-class Examples with Uncertainty Estimates”. In Proceedings of the Twenty-eighth International Conference on Multimedia Modeling (MMM’22), Qui Nhon, Vietnam, April 5-8, 2022.
- L. Charette, L. Chu, Y. Chen, L. Wang, J. Pei, and Y. Zhang. “Cosine Model Watermarking Against Ensemble Distillation”. In Proceedings of the Thirty-sixth AAAI Conference on Artificial Intelligence (AAAI’22), Vancouver, BC, Canada, February 22 – March 1, 2022.
- N. Che, L. Shou, M. Gong, and J. Pei. “From Good to Best: Two-Stage Training for Cross-lingual Machine Reading Comprehension”. In Proceedings of the Thirty-sixth AAAI Conference on Artificial Intelligence (AAAI’22), Vancouver, BC, Canada, February 22 – March 1, 2022.
- Y. Pang, L. Wu, Q. Shen, Y. Zhang, Z. Wei, F. Xu, E. Chang, B. Long, and J. Pei. “Heterogeneous Global Graph Neural Networks for Personalized Session-based Recommendation”. In Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining (WSDM’22), Phoenix, AZ, USA, February 21-25, 2022.
- M. Bajaj, L. Chu, Z. Y. Xue, J. Pei, L. Wang, P. C.-H. Lam, and Y. Zhang. “Robust Counterfactual Explanations on Graph Neural Networks”. In Proceedings of the Thirty-fifth Conference on Neural Information Processing Systems (NeurIPS’21), December 6-14, 2021.
- Y. Guo, L. Shou, J. Pei, M. Gong, M. Xu, Z. Wu, and D. Jiang. “Learning from Multiple Noisy Augmented Data Sets for Better Cross-Lingual Spoken Language Understanding”. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP’21), Punta Cana, Dominican Republic, November 7-11, 2021.
- C. H. Lam, L. Chu, M. Torgonskiy, J. Pei, Y. Zhang, and L. Wang. “Finding Representative Interpretations on Convolutional Neural Networks”. In Proceedings of the 2021 IEEE International Conference on Computer Vision (ICCV’21), Virtual, October 11-17, 2021.
- Z. Cong, L. Chu, Y. Yang, and J. Pei. “Comprehensible Counterfactual Explanation on Kolmogorov-Smirnov Test”. In Proceedings of the Forty-seventh International Conference on Very Large Data Bases (VLDB’21), Copenhagen, Denmark, August 16-20, 2021.
- X. Cheng, C. Zhang, J. Xu, and J. Pei. “SlimChain: Scaling Blockchain Transactions through Off-Chain Storage and Parallel Processing”. In Proceedings of the Forty-seventh International Conference on Very Large Data Bases (VLDB’21), Copenhagen, Denmark, August 16-20, 2021.
- J. Liu, J. Lou, J. Liu, L. Xiong, J. Pei, and J. Sun. “Dealer: An End-to-End Model Marketplace with Differential Privacy”. In Proceedings of the Forty-seventh International Conference on Very Large Data Bases (VLDB’21), Copenhagen, Denmark, August 16-20, 2021.
- S. Liang, M. Gong, J. Pei, L. Shou, W. Zuo, X. Zuo, and D. Jiang. “Reinforced Iterative Knowledge Distillation for Cross-Lingual Named Entity Recognition”. In Proceedings of the Twenty-seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’21), Singapore, August 14-18, 2021.
- A. Banitalebi-Dehkordi, N. Vedula, J. Pei, F. Xia, L. Wang, and Y. Zhang. “Auto-Split: A General Framework of Collaborative Edge-Cloud AI”. In Proceedings of the Twenty-seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’21), Singapore, August 14-18, 2021.
- Q. Zhang, B. Gu, C. Deng, J. Pei, and H. Huang. “AsySQN: Faster Vertical Federated Learning Algorithms with Better Computation Resource Utilization”. In Proceedings of the Twenty-seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’21), Singapore, August 14-18, 2021.
- H. Huang, X. Geng, J. Pei, G. Long, and D. Jiang. “Reasoning over Entity-Action-Location Graph for Procedural Text Understanding”. In Proceedings of the Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP 2021), Bangkok, Thailand, August 1-6, 2021.
- Y. Zhou, X. Geng, T. Shen, J. Pei, W. Zhang, and D. Jiang. “Modeling Event-Pair Relations in External Knowledge Graphs for Script Reasoning”. In Findings of ACL in the Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP 2021), Bangkok, Thailand, August 1-6, 2021.
- P. Wang, J. Wang, J. Pei, and W. Zheng. “Automating Entity Matching Model Development”. In Proceedings of the Thirty-seventh IEEE International Conference on Data Engineering (ICDE’21), Chania, Crete, Greece, April 19-22, 2021.
- J. Liu, L. Xiong, Q. Zhang, J. Pei, and J. Luo. “Eclipse: Generalizing kNN and Skyline”. In Proceedings of the Thirty-seventh IEEE International Conference on Data Engineering (ICDE’21), Chania, Crete, Greece, April 19-22, 2021.
- S. Liang, L. Shou, J. Pei, M. Gong, W. Zuo, and D. Jiang. “CalibreNet: Calibration Networks for Multilingual Sequence Labeling”. In Proceedings of the Fourteenth ACM International Conference on Web Search and Data Mining (WSDM’21), Jerusalem, Israel, March 8-12, 2021.
- F. Yuan, L. Shou, J. Pei, W. Lin, M. Gong, Y. Fu, and D. Jiang. “Reinforced Multi-Teacher Selection for Knowledge Distillation”. In Proceedings of the Thirty-fifth AAAI Conference on Artificial Intelligence (AAAI’21), Online, February 2-9, 2021. (Acceptance rate: 1692/9034)
- Y. Huang, L. Chu, Z. Zhou, L. Wang, J. Liu, J. Pei, and Y. Zhang. “Personalized Cross-Silo Federated Learning on Non-IID Data”. In Proceedings of the Thirty-fifth AAAI Conference on Artificial Intelligence (AAAI’21), Online, February 2-9, 2021. (Acceptance rate: 1692/9034)
- L. Xia, C. Huang, Y. Xu, P. Dai, X. Zhang, H. Yang, J. Pei, and L. Bo. “Knowledge-Enhanced Hierarchical Graph Transformer Network for Multi-Behavior Recommendation”. In Proceedings of the Thirty-fifth AAAI Conference on Artificial Intelligence (AAAI’21), Online, February 2-9, 2021. (Acceptance rate: 1692/9034)
- J. Liu, L. Shou, J. Pei, M. Gong, M. Yang, and D. Jiang. “Cross-lingual Machine Reading Comprehension with Language Branch Knowledge Distillation”. In Proceedings of the Twenty-eighth International Conference on Computational Linguistics (COLING’20), Online, December 8-13, 2020.
- X. Zhang, L. Shou, J. Pei, M. Gong, L. Wen, and D. Jiang. “A Graph Representation of Semi-structured Data for Web Question Answering”. In Proceedings of the Twenty-eighth International Conference on Computational Linguistics (COLING’20), Online, December 8-13, 2020.
- L. Chu[student], Y. Zhang[student], Y. Yang[student], L. Wang, and J. Pei. “Online Density Bursting Subgraph Detection from Temporal Graphs”. In Proceedings of the Forty-sixth International Conference on Very Large Data Bases (VLDB’20), Tokyo, Japan, August 31 – September 4, 2020.
- X. Hu[student], W. Liu, J. Bian, and J. Pei. “Measuring Model Complexity of Neural Networks with Curve Activation Functions”. In Proceedings of the Twenty-sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’20), San Diego, CA, USA, August 23-27, 2020. (Acceptance rate: 216/1279)
- L. Shou, S. Bo, F. Cheng, M. Gong, J. Pei, and D. Jiang.. “Mining Implicit Relevance Feedback from User Behavior for Web Question Answering” (Applied Data Science Track). In Proceedings of the Twenty-sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’20), San Diego, CA, USA, August 23-27, 2020. (Acceptance rate: 121/756)
- X. Wang, M. Zhu, D. Bo, P. Cui, C. Shi, and J. Pei. “AM-GCN: Adaptive Multi-channel Graph Convolutional Networks”. In Proceedings of the Twenty-sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’20), San Diego, CA, USA, August 23-27, 2020. (Acceptance rate: 216/1279)
- F. Huang, S. Gao, J. Pei, and H. Huang. “Momentum-Based Policy Gradient Methods”. In Proceedings of the Thirty-seventh International Conference on Machine Learning (ICML’20), Vienna, Austria, July 13-18, 2020.
- L. Luo, J. Pei, and H. Huang. “Sinkhorn Regression”. In Proceedings of the Twenty-ninth International Joint Conference on Artificial Intelligence and the Seventeenth Pacific Rim International Conference on Ar- tificial Intelligence (IJCAI-PRICAI’20), Yokohama, Japan, July 11-17, 2020.
- S. Gao, F. Huang, J. Pei, and H. Huang. “Discrete Model Compression with Resource Constraint”. In Proceedings of the 2020 IEEE Conference on Computer Vision and Pattern Recognition (CVPR’20), Seattle, WA, USA, June 14-18, 2020.
- Z. Cong[student], L. Chu[student], L. Wang, X. Hu[student], and J. Pei. “Exact and Consistent Interpretation of Piecewise Linear Models Hidden behind APIs: A Closed Form Solution”. In Proceedings of the Thirty-sixth IEEE International Conference on Data Engineering (ICDE’20), Dallas, TX, USA, April 20-24, 2020.
- Y. Yang[student], Z. Wang, T. Jin, J. Pei, and E. Chen. “Tracking Top-k Influential Users with Relative Errors”. In Proceedings of the Twenty-eighth ACM International Conference on Information and Knowledge Management (CIKM 2019) (long paper with oral presentation), Beijing, China, November 3-7, 2019.
- Z. Zhao[student], L. Chu[student], D. Tao, and J. Pei. “Classification with Label Noise: A Markov Chain Sampling Framework”. In Proceedings of the 2019 European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML-PKDD’19), Wu ̈rzburg, Germany, September 16-20, 2019.
- L. Chu[student], Z. Wang[visitor], J. Pei, Y. Zhang[student], Y. Yang[student], and E. Chen. “Finding Theme Communities from Database Networks”. In Proceedings of the Forty-fifth International Conference on Very Large Data Bases (VLDB’19), Los Angles, CA, USA, August 26-30, 2019.
- M. Dolatshah, M. Teoh[student], J. Wang, and J. Pei. “Cleaning Crowdsourced Labels Using Oracles for Statistical Classification”. In Proceedings of the Forty-fifth International Conference on Very Large Data Bases (VLDB’19), Los Angles, CA, USA, August 26-30, 2019.
- H. Gao, J. Pei and H. Huang. “ProGAN: Network Embeddings via Proximity Generative Adversarial Network” (oral presentation). In Proceedings of the Twenty-fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’19), Anchorage, AL, USA. August 4-8, 2019. (Acceptance rate: 110/1200)
- H. Gao, J. Pei and H. Huang. “Conditional Random Field Enhanced Graph Convolutional Neural Networks” (oral presentation). In Proceedings of the Twenty-fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’19), Anchorage, AL, USA. August 4-8, 2019. (Acceptance rate: 110/1200)
- K. Tu, J. Ma, P. Cui, J. Pei, and W. Zhu. “AutoNE: Hyperparameter Optimization for Massive Network Embeddings”. In Proceedings of the Twenty-fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’19), Anchorage, AL, USA. August 4-8, 2019. (Acceptance rate: 110/1200)
- C. Fan, Y. Zhang, Y. Pan, X. Li, C. Zhang, R. Yuan, D. Wu, W. Wang, J. Pei, and H. Huang. “MultiHorizon Time Series Forecasting with Temporal Attention Learning”. In Proceedings of the Twenty-fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’19), Anchorage, AL, USA. August 4-8, 2019. (Acceptance rate: 110/1200)
- S. Yu, B. Gu, K. Ning, H. Chen, J. Pei and H. Huang. “Tackle Balancing Constraint for Incremental Semi-Supervised Support Vector Learning”. In Proceedings of the Twenty-fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’19), Anchorage, AL, USA. August 4-8, 2019. (Acceptance rate: 170/1200)
- H. Gao, J. Pei, and H. Huang. “Demystifying Dropout”. In Proceedings of the Thirty-sixth International Conference on Machine Learning (ICML’19), Long Beach, CA, USA, June 9-15, 2019.
- W. Yang, L. Tan, C. Lu, A. Cui, H. Li, X. Chen, K. Xiong, M. Wang, M. Li, J. Pei, and J. Lin. “Detecting Customer Complaint Escalation with Recurrent Neural Networks and Manually-Engineered Features”. In Proceedings of the Seventeenth Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT’19), Minneapolis, MN,USA, June 2-7, 2019.
- D-W. Choi[student], J. Pei, and T. Heinis. “Efficient Mining of Regional Movement Patterns in Semantic Trajectories”. In Proceedings of the Forty-fourth International Conference on Very Large Data Bases (VLDB’18), Rio de Janeiro, Brazil, August 27-31, 2018.
- L. Chu[student], X. Hu[student], J. Hu[student], L Wang, and J. Pei. “Exact and Consistent Interpretation for Piecewise Linear Neural Networks: A Closed Form Solution”. In Proceedings of the Twenty-fourth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’18), London, United Kingdom, August 19-23, 2018. (Acceptance rate: 107/983)
- Z. Zhang, P. Cui, X. Wang, J. Pei, X. Yao, and W. Zhu. “Arbitrary-Order Proximity Preserved Network Embeddings”. In Proceedings of the Twenty-fourth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’18), London, United Kingdom, August 19-23, 2018. (Acceptance rate: 107/983)
- L. Luo, W. Zhang, Z. Zhang, W. Zhu, T. Zhang, and J. Pei. “Sketched Follow-The-Regularized-Leader for Online Factorization Machine”. In Proceedings of the Twenty-fourth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’18), London, United Kingdom, August 19-23, 2018. (Acceptance rate: 181/983)
- J. Peng, D. Zhang[student], J. Wang, and J. Pei. “AQP++: Connecting Approximate Query Processing With Aggregate Precomputation for Interactive Analytics”. In Proceedings of the 2018 ACM SIGMOD International Conference on Management of Data (SIGMOD’18), Huston, TX, USA, June 10-15, 2018.
- X. Lin, W. Zhang, M. Zhang, P. Zhao, W. Zhu, J. Pei and J. Huang. “Online Compact Convexified Factorization Machine”. In Proceedings of the Twenty-seventh International World Wide Web Conference (WWW’18), Lyon, France, April 23-27, 2018.
- Y. Yang[student], L. Chu[student], Y. Zhang[student], Z. Wang[student], J. Pei, and E. Chen. “Mining Density Contrast Subgraphs”. In Proceedings of the Thirty-fourth IEEE International Conference on Data Engineering (ICDE’18), Paris, France, April 16-19, 2018.
- J. Liu, J. Yang, L. Xiong, J. Pei, and J. Luo. “Skyline Diagram: Finding the Voronoi Counterpart for Skyline Queries”. In Proceedings of the Thirty-fourth IEEE International Conference on Data Engineering (ICDE’18), Paris, France, April 16-19, 2018.
- Z. Zhang, P. Cui, J. Pei, X. Wang, and W. Zhu. “TIMERS: Error-Bounded SVD Restart on Dynamic Networks”. In Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence (AAAI-18), New Orleans, LA, February 2-7, 2018. (Acceptance rate: 933/3800+)
- C. Gao[student], J. Wang, J. Pei, R. Li, and Y. Chang. “Preference-driven Similarity Join”. In Proceedings of the 2017 IEEE/WIC/ACM International Conference on Web Intelligence (WI’17) (long paper and Best Student Paper Award), Leipzig, Germany, August 23-26, 2017.
- C. Gao[student], J. Pei, J. Wang, and Y. Chang. “Schemaless Join for Result Set Preferences”. In Proceedings of the Eighteenth IEEE International Conference on Information Reuse and Integration (IRI 2017) (full regular paper for oral presentation), San Diego, CA, USA, August 4-6, 2017.
- C-Y. Kuo, M-Y. Yeh, and J. Pei. “Principal Pattern Mining on Graphs”. In Proceedings of the 2017 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM’17), Sydney, Australia, July 31-August 3, 2017.
- J. Liu, J. Yang, L. Xiong, and J. Pei. “Secure Skyline Queries on Cloud Platform”. In Proceedings of the Thirty-third IEEE International Conference on Data Engineering (ICDE’17), San Diego, CA, USA, April 19-22, 2017.
- X. Wang, P. Cui, J. Wang, J. Pei, W. Zhu, and S. Yang. “Community Preserving Network Embeddings”. InProceedings of the Thirty-first AAAI Conference on Artificial Intelligence (AAAI’17) (oral presentation), San Francisco, CA, USA, February 4-9, 2017. (Acceptance rate: 638/2590)
- Z. Zheng, D. Wang, J. Pei, Y. Yuan, C. Fan, and L. Xiao. “Urban Traffic Prediction through the Second Use of Inexpensive Big Data from Buildings”. In Proceedings of the Twenty-fifth ACM International Conference on Information and Knowledge Management (CIKM 2016) (Industry track), Indianapolis, IN, USA, October 24-28, 2016. (Acceptance rate: 22/111)
- J. Liu, L. Xiong, J. Pei, J. Luo, and H. Zhang. “Finding Pareto Optimal Groups: Group-based Skyline”. In Proceedings of the Forty-second International Conference on Very Large Data Bases (VLDB’16), New Delhi, India, September 5-9, 2016.
- Z. Wang[visitor], L. Chu[student], J. Pei, E. Chen, and A. Al-Barakati. “Tradeoffs between Density and Size in Extracting Dense Subgraphs: A Unified Framework” (full paper). In Proceedings of the 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM’16), San Francisco, CA, USA, August 18-21, 2016. (Acceptance rate: 43/316)
- L. Chu[student], Z. Wang[visitor], J. Pei, J. Wang, Z. Zhao, and E. Chen. “Finding Gangs in War from Signed Networks”(full paper with poster presentation). In Proceedings of the Twenty-second ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD’16), San Francisco, CA, USA, August 13-17, 2016. (Acceptance rate: 142/784)
- M. Ou, P. Cui, J. Pei, and W. Zhu. “Asymmetric Transitivity Preserving Graph Embeddings” (full paper with oral presentation). In Proceedings of the Twenty-second ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD’16), San Francisco, CA, USA, August 13-17, 2016. (Acceptance rate: 70/784)
- H-J. Hung, H-H. Shuai, D-N. Yang, L-H. Huang, W-C. Lee, J. Pei, and M-S. Chen. “When Social Influence Meets Item Inference” (full paper with oral presentation). In Proceedings of the Twenty-second ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD’16), San Francisco, CA, USA, August 13-17, 2016. (Acceptance rate: 70/784)
- Y. Yang[student], X. Mao[student], J. Pei, and X. He. “Continuous Influence Maximization: What Discounts Should We Offer to Social Network Users?”. In Proceedings of the 2016 ACM SIGMOD International Conference on Management of Data (SIGMOD’16), San Francisco, CA, USA, June 26-July 1, 2016.
- D-W. Choi[student], J. Pei, and X. Lin. “Finding the Minimum Spatial Keyword Cover”. In Proceedings of the Thirty-second IEEE International Conference on Data Engineering (ICDE’16), Helsinki, Finland, May 16-20, 2016.
- J. Hu[student], Q. Qian, J. Pei, R. Jin, and S. Zhu. “Multi-clustering via Clustering Stability”. In Proceedings of the Fifteenth IEEE International Conference on Data Mining series (ICDM’15), Atlantic City, NJ, USA, November 14-17, 2015. (Acceptance rate: 8.4%)
- L. Duan[visitor], G. Tang[student], J. Pei, J. Bailey, A. Campbell[student], and C. Tang. “Mining Outlying Aspects on Numeric Data”. In Proceedings of the 2015 European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECMLPKDD’15), Porto, Portugal, September 7-11, 2015. (Journal track)
- L. Chu, S. Wang, S. Liu, Q. Huang, and J. Pei. “ALID: Scalable Dominant Cluster Detection”. InProceedings of the Forty-first International Conference on Very Large Data Bases (VLDB’15), Kohala Coast, HI, USA, August 31-September 4, 2015.
- Y. Zhang, J. Tang, Z. Yang, J. Pei, and P.S. Yu. “COSNET: Connecting Heterogeneous Social Networks with Local and Global Consistency”. In Proceedings of the Twenty-first ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD’15), Sydney, NW, Australia, August 10-13, 2015. (Acceptance rate: 159/819)
- K. Yu[student], D. Wang, W. Ding, D. Small, J. Pei, X. Wu, and S. Islam. “Tornado Forecasting with Multiple Markov Boundaries”. In Proceedings of the Twenty-first ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD’15), Sydney, NW, Australia, August 10-13, 2015.
- N. X. Vinh, J. Chan, J. Bailey, C. Leckie, K. Ramamohanarao, and J. Pei. “Scalable Outlying-Inlying Aspects Discovery via Feature Ranking”. In Proceedings of the Nineteenth Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD’15), Ho Chi Minh City, Viet Nam, May 19-22, 2015. (Acceptance rate: 90/405)
- Y.-F. Lin, H.-H. Chen, V. S. Tseng, and J. Pei. “Reliable Early Classification on Multivariate Time Series with Numerical and Categorical Attributes”. In Proceedings of the Nineteenth Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD’15), Ho Chi Minh City, Viet Nam, May 19-22, 2015. (Acceptance rate: 90/405)
- J. Wang, S. Song, X. Lin, X. Zhu, and J. Pei. “Cleaning Structured Event Logs: A Graph Repair Approach”. In Proceedings of the Thirty-first IEEE International Conference on Data Engineering (ICDE’15), Seoul, Korea, April 13-17, 2015. (Acceptance rate: 103/409)
- Z. Yu, X. Yu, Y. Liu, W. Li, and J. Pei. “Mining Frequent Co-occurrence Patterns across Multiple Data Streams”. In Proceedings of the Eighteenth International Conference on Extending Database Technology (EDBT’15), Brussels, Belgium, March 23-27, 2015. (Acceptance rate: 47/184)
- L. Chang, X. Lin, L. Qin, J.X. Yu, and J. Pei, “Efficiently Computing Top-K Shortest Path Join”. InProceedings of the Eighteenth International Conference on Extending Database Technology (EDBT’15), Brussels, Belgium, March 23-27, 2015. (Acceptance rate: 47/184)
- K. Yu[student], W. Ding, X. Wu, and J. Pei. “Towards Scalable and Accurate Online Feature Selection for Big Data”. In Proceedings of the Fourteenth IEEE International Conference on Data Mining (ICDM’14), Shenzhen, China, December 14-17, 2014. (Full paper acceptance rate: 71/727)
- T. Guo, X. Zhu, J. Pei, and C. Zhang. “SNOC: Streaming Network Node Classification”. In Proceedings of the Fourteenth IEEE International Conference on Data Mining (ICDM’14), Shenzhen, China, December 14-17, 2014. (Full paper acceptance rate: 71/727)
- J. Han, J. Wen, and J. Pei. “Within-Network Classification Using Radius-Constrained Neighborhood Patterns”. In Proceedings of the Twenty-third ACM International Conference on Information and Knowledge Management (CIKM’14), Shanghai, China, November 3-7, 2014. (Knowledge Management track acceptance rate: 95/457)
- G. Tang[student], K. Wu, J. Pei, J. Tang, and J. Lei. “An Appliance-driven Approach to Detection of Corrupted Load Curve Data”. In Proceedings of the Twenty-third ACM International Conference on Information and Knowledge Management (CIKM’14), Shanghai, China, November 3-7, 2014. (Database track acceptance rate: 25/123)
- X. Hu[visitor], J. Pei, and Y. Tao[visitor]. “Shortest Unique Queries on Strings”. In Proceedings of the Twenty-first International Symposium on String Processing and Information Retrieval (SPIRE’14), Ouro Preto, Brazil, October 20-23, 2014. (Full paper acceptance rate: 20/45)
- W. Yu, X. Lin, W. Zhang, L. Chang, and J. Pei. “More is Simpler: Effectively and Efficiently Assessing Node-Pair Similarities Based on Hyperlinks”. In Proceedings of the Fortieth International Conference on Very Large Data Bases (VLDB’14), Hangzhou, China, September 1-5, 2014.
- Q. Qian, J. Hu[student], R. Jin, and J. Pei. “Distance Metric Learning Using Dropout: A Structured Regularization Approach”. In Proceedings of the Twentieth ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD’14), New York, NY, USA, August 24-27, 2014. (Acceptance rate: 151/1036)
- L. Zhang[visitor], J. Pei, Y. Jia, B. Zhou, and X. Wang. “Do Neighbor Buddies Make a Difference in Reblog Likelihood? An Analysis on SINA Weibo Data” (full paper with long presentation). In Proceedings of the 2014 IEEE/ACM International Conference on Advances in Social Network Analysis and Mining (ASONAM’14), Beijing, China, August 17-20, 2014. (Full paper acceptance rate: 18%)
- L. Duan[visitor], G. Tang[student], J. Pei, J. Bailey, G. Dong, A. Campbell[student], and C. Tang. “Mining Contrast Subspaces”. (Regular paper with long presentation) In Proceedings of the Eighteenth Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD’14), Tainan, Taiwan, May 13-16, 2014. (Acceptance rate: 40/371)
- J. Chan, X.V. Nguyen, W. Liu, J. Beiley, C. Leckie, K. Ramamohanarao, and J. Pei. “Structure-aware Distance Measures for Comparing Clusterings in Graphs”. (Regular paper with short presentation) In Proceedings of the Eighteenth Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD’14), Tainan, Taiwan, May 13-16, 2014. (Acceptance rate: 101/371)
- Y. Wang, J. Pei, X. Lin, and Q. Zhang. “An Iterative Fusion Approach to Graph-based Semi-supervised Learning from Multiple Views”. (Regular paper with short presentation) In Proceedings of the Eighteenth Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD’14), Tainan, Taiwan, May 13-16, 2014. (Acceptance rate: 101/371)
- J. Hu[student], J. Pei, and J. Tang. “How Can I Index My Thousands of Photos Effectively and Automatically? An Unsupervised Feature Selection Approach”. (Full paper) In Proceedings of the Fourteenth SIAM International Conference on Data Mining (SDM’14), Philadelphia, PA, USA, April 24-26, 2014.
- Y. Li, J. Bailey, L. Kulik, and J. Pei. “Efficient Matching of Substrings in Uncertain Sequences”. (Poster) In Proceedings of the Fourteenth SIAM International Conference on Data Mining (SDM’14), Philadelphia, PA, USA, April 24-26, 2014.
- G. Tang[student], Y. Yang, and J. Pei. “Price Information Patterns in Web Search Advertising: An Empirical Case Study on Accommodation Industry”. In Proceedings of the Thirteenth IEEE International Conference on Data Mining (ICDM’13), Dallas, TX, USA, December 7-10, 2013. (Acceptance rate: 94/809)
- C.L. Kam, C. Ra ̈ıssi, M. Kaytoue, and J. Pei. “Mining Statistically Significant Sequential Patterns”. InProceedings of the Thirteenth IEEE International Conference on Data Mining (ICDM’13), Dallas, TX, USA, December 7-10, 2013. (Acceptance rate: 94/809)
- Y. Li, J. Bailey, L. Kulik, and J. Pei. “Mining Probabilistic Frequent Spatio-Temporal Sequential Patterns with Gap Constraints from Uncertain Databases”. In Proceedings of the Thirteenth IEEE International Conference on Data Mining (ICDM’13), Dallas, TX, USA, December 7-10, 2013. (Acceptance rate: 94/809)
- X. Mao[student], B. Lin, X. He, D. Cai, and J. Pei. “Parallel Field Alignment for Cross Media Retrieval”. In Proceedings of the Twenty-first ACM International Conference on Multimedia (MM’13), Barcelona, Catalunya, Spain, October 21-25, 2013.
- Y-C. Lo, J-Y. Li, M-Y. Yeh, S-D. Lin, and J. Pei. “What Distinguish One from Its Peers in Social Networks?”. In Proceedings of the 2013 European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECMLPKDD’13), Prague, Czech Republic, September 23-27, 2013. (Journal track, acceptance rate: 14/182)
- Y. Wang, P. Wang, J. Pei, and W. Wang. “A Data-adaptive and Dynamic Segmentation Index for Whole Matching on Time Series”. In Proceedings of the Thirty-Ninth International Conference on Very Large Data Bases (VLDB’13), Riva del Garda, Trento, Italy, August 26-30, 2013.
- G. Tang[student], J. Pei, J. Bailey, and G. Dong. “Mining Multidimensional Contextual Outliers from Categorical Relational Data”. In Proceedings of the Twenty-fifth International Conference on Scientific and Statistical Database Management (SSDBM’13), Baltimore, Maryland, USA, July 29-31, 2013.
- Y. Xiong, Y. Zhu, J. Pei, and P.S. Yu. “Towards Cohesive Anomalies Mining”. In Proceedings of the Twenty-seventh AAAI Conference on Artificial Intelligence (AAAI’13), Bellevue, WA, USA, July 14-18, 2013.
- J. Pei, W. C-H. Wu, and M-Y. Yeh[visitor]. “On Shortest Unique Substring Queries”. In Proceedings of the Twenty-ninth IEEE International Conference on Data Engineering (ICDE’13), Brisbane, Queensland, Australia, April 8-12, 2013. (Acceptance rate: 20%)
- H. Maserrat[student] and J. Pei. “Community Preserving Lossy Compression of Social Networks”. InProceedings of the Twelfth IEEE International Conference on Data Mining (ICDM’12), Brussels, Belgium, December 10-13, 2012. (Acceptance rate: 81/756)
- T. Dwyer, A. Fedorova, S. Blagodurov, M. Roth, F. Gaud, and J. Pei. “A Practical Method for Estimating Performance Degradation on Multicore Processors and its Application to HPC Workloads”. In Proceedings of the Twenty-fifth International Conference for High Performance Computing, Networking, Storage and Analysis (SC’12), Salt Lake City, UT, USA. November 10-16, 2012. (Acceptance rate: 100/472)
- W. Liu, A. Kan, J. Chan, J. Bailey, C. Leckie, R. Kotagiri, and J. Pei. “On Compressing Weighted Time-evolving Graphs”. In Proceedings of the Twenty-first ACM International Conference on Information and Knowledge Management (CIKM’12), Maui, HI, USA, October 29-November 2, 2012.
- Y. Qian, H. Li, D. Jiang, Y. Hu, J. Pei, and Q. Zheng. “Mining Query Subtopics from Search Log Data”. In Proceedings of the Thirty-fifth Annual ACM SIGIR Conference (SIGIR’12), Portland, OR, USA, August 12-16, 2012. (Acceptance rate: 98/483)
- W.C-H. Wu, M-Y. Yeh, and J. Pei, “Random Error Reduction in Similarity Search on Time Series: A Statistical Approach”. In Proceedings of the Twenty-eighth IEEE International Conference on Data Engineering (ICDE’12), Washington, DC, USA, April 1-5, 2012. (Acceptance rate: 17.7%)
- M. Hua[student] and J. Pei, “Aggregate Queries on Probabilistic Record Linkages”. In Proceedings of the Fifteenth International Conference on Extending Database Technology (EDBT’12), Berlin, Germany, March 26-30, 2012. (Acceptance rate: 43/193)
- C. Wang, L.Y. Yuan, J-H. You, O.R. Zaiane, and J. Pei. “On Pruning for Top-k Ranking in Uncertain Databases”. In Proceedings of the Thirty-seventh International Conference on Very Large Data Bases (VLDB’11), Seattle, WA, USA, August 29-September 3, 2011.
- C. Ra ̈ıssi and J. Pei. “Towards Bounding Sequential Patterns”. In Proceedings of the Seventeenth ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD’11), San Diego, CA, USA, August 21-24, 2011. (Acceptance rate: 125/714)
- Y. Tao, C. Sheng[visitor], and J. Pei. “On k-skip Shortest Paths”. In Proceedings of the 2011 ACM SIGMOD International Conference on Management of Data (SIGMOD’11), Athens, Greece, June 12-16, 2011.
- Z. Xing[student], J. Pei, P. S. Yu, and K. Wang. “Extracting Interpretable Features for Early Classification on Time Series”. In Proceedings of the Eleventh SIAM International Conference on Data Mining (SDM’11), Phoenix, AZ, USA, April 28-30, 2011. (Acceptance rate: 86/343)
- B. Jiang[student] and J. Pei. “Outlier Detection on Uncertain Data: Objects, Instances, and Inferences”. In Proceedings of the Twenty-seventh IEEE International Conference on Data Engineering (ICDE’11), Hannover, Germany, April 11-16, 2011. (Acceptance rate: 98/494)
- D. Kang, D. Jiang[student], J. Pei, Z. Liao, X. Sun, and H-J. Choi. “Multidimensional Mining of Large-Scale Search Logs: A Topic-Concept Cube Approach. In Proceedings of the Fourth ACM International Conference on Web Search and Data Mining (WSDM’11), Hong Kong, China, February 9-12, 2011. (Acceptance rate: 83/372)
- Q. He, D. Kifer, J. Pei, P. Mitra, and L. Giles. “Citation Recommendation without Author Supervision”. In Proceedings of the Fourth ACM International Conference on Web Search and Data Mining (WSDM’11), Hong Kong, China, February 9-12, 2011. (Acceptance rate: 83/372)
- R. C.-W. Wong, A. W.-C. Fu, K. Wang, Y. Xu, J. Pei, and P.S. Yu. “Probabilistic Inference Protection on Uncertain Data”. In Proceedings of the Tenth IEEE International Conference on Data Mining (ICDM’10), Sydney, Australia, December 14-17, 2010. (Acceptance rate: 155/797)
- C. Ra ̈ıssi, J. Pei, and T. Kister. “Computing Closed Skycubes”. In Proceedings of the Thirty-sixth International Conference on Very Large Data Bases (VLDB’10), Singapore, September 13-17, 2010. (Acceptance rate: 16.1%)
- H. Maserrat[student] and J. Pei. “Neighbor Query Friendly Compression of Social Networks”. In Proceedings of the Sixteenth ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD’10), Washington, DC, USA, July 25-28, 2010. (Acceptance rate: 77/578)
- B. Xiang, D. Jiang[student], J. Pei, X. Sun, E. Chen, and H. Li. “Context-Aware Ranking in Web Search”. In Proceedings of the Thirty-third Annual ACM SIGIR Conference (SIGIR’10), Geneva, Switzerland, July 19-23, 2010. (Acceptance rate: 87/520)
- Y. Tao, K. Yi, C. Sheng, J. Pei, and F. Li. “Logging Every Footstep: Quantile Summaries for the Entire History”. In Proceedings of the 2010 ACM SIGMOD International Conference on Management of Data (SIGMOD’10), Indianapolis, Indiana, USA, June 6-11, 2010. (Acceptance rate: 20%)
- Q. He, J. Pei, D. Kifer, P. Mitra, and C.L. Giles. “Context-aware Citation Recommendation”. In Proceedings of the Nineteenth International World Wide Web Conference (WWW’10), Raleigh, NC, USA, April 26-30, 2010. (Acceptance rate: 104/743)
- M. Hua[student]and J. Pei. “Probabilistic Path Queries in Road Networks: Traffic Uncertainty Aware Path Selection”. In Proceedings of the Thirteenth International Conference on Extending Database Technology (EDBT’10), Lausanne, Switzerland, March 22-26, 2010. (Acceptance rate: 54/307)
- Y. Tao, J. Pei, J. Li, X. Xiao, K. Yi, and Z. Xing[student]. “Hiding Correlation by Independence Masking”. In Proceedings of the Twenty-Sixth International Conference on Data Engineering (ICDE’10), Long Beach, California, USA, March 1-6, 2010. (Acceptance rate: 20%)
- Q. He, B. Chen, J. Pei, B. Qiu, P. Mitra, and C. L. Giles. “Detecting Topic Evolution in Scientific Literature: How Can Citations Help?”. In Proceedings of the Eighteenth ACM Conference on Information and Knowledge Management (CIKM’09), Hong Kong, November 2-6, 2009. (Acceptance rate: 123/847)
- X. Cheng, J. Xu[student], J. Pei, and J. Liu. “Hierarchical Distributed Data Classification in Wireless Sensor Networks”. In Proceedings of the Sixth IEEE International Conference on Mobile Ad Hoc and Sensor Systems (MASS’09), Macau, China, October 12-15, 2009. (Acceptance rate: 62/245, selected for the special issue in Computer Communication.)
- Y. Zhao, H. Zhang, L. Cao, J. Pei, and C. Zhang. “Debt Detection in Social Security by Sequence Classification Using Both Positive and Negative Patterns”. In Proceedings of the 2009 European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD’09), Bled, Slovenia, September 7-11, 2009.
- Z. Xing[student], J. Pei, and P. S. Yu. “Early Classification on Time Series: A Nearest Neighbor Approach”. In Proceedings of the Twenty-first International Joint Conference on Artificial Intelligence (IJCAI’09), Pasadena, CA, USA, July 14-17, 2009. (Acceptance rate: 331/1, 290)
- H. Zhong, T. Xie, L. Zhang, J. Pei, and H. Mei. “MAPO: Mining and Recommending API Usage Patterns”. In Proceedings of the Twenty-third European Conference on Object-Oriented Programming (ECOOP 2009), Genova, Italy, July 6-10, 2009. (Acceptance rate: 25/117)
- B. Zhou[student], D. Jiang[student], J. Pei, and H. Li. “OLAP on Search Logs: An Infrastructure Supporting Data-Driven Applications in Search Engines”. In Proceedings of the Fifteenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’09), Paris, France, June 28 – July 1, 2009. (Acceptance rate: 12/122)
- J. Wang, X. He, C. Wang, J. Pei, J. Bu, C. Chen, Z. Guan, W. V. Zhang. “Can We Learn a Template-Independent Wrapper for News Article Extraction from a Single Training Site?”. In Proceedings of the Fifteenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’09), Paris, France, June 28 – July 1, 2009.
- Y. Han[visitor], B. Zhou[student], J. Pei and Y. Jia. “Understanding Importance of Collaborations in Co-authorship Networks: A Supportiveness Analysis Approach”. In Proceedings of 2009 SIAM International Conference on Data Mining (SDM’09), April 30 – May 2, 2009, Sparks, Nevada. (Acceptance rate: 54/351)
- H. Cao, D. Jiang[student], J. Pei, E. Chen, and H. Li. “Towards Context-Aware Search by Learning A Very Large Variable Length Hidden Markov Model from Search Logs”. In Proceedings of the Eighteenth International World Wide Web Conference (WWW’09), April 20-24, 2009, Madrid, Spain. (Acceptance rate: 104/888)
- J. Wang, X. He, C. Wang, J. Pei, J. Bu, C. Chen, and Z. Guan. “News Article Extraction with Template-Independent Wrapper”. In Proceedings of the Eighteenth International World Wide Web Conference (WWW’09) (poster), April 20-24, 2009, Madrid, Spain. (Acceptance rate: 93/225)
- B. Jiang[student] and J. Pei. “Online Interval Skyline Queries on Time Series”. In Proceedings of the Twenty-fifth IEEE International Conference on Data Engineering (ICDE’09), March 29 – April 4, 2009, Shanghai, China. (Acceptance rate: 93/554)
- Y. Tao, L. Ding, X. Lin, and J. Pei. “Distance-based Representative Skyline”. In Proceedings of the Twenty-fifth IEEE International Conference on Data Engineering (ICDE’09), March 29 – April 4, 2009, Shanghai, China. (Acceptance rate: 93/554)
- J. Pei, Y. Tao, J. Li, X. Xiao. “Privacy Preserving Publishing on Multiple Quasi-Identifiers”. In Proceedings of the Twenty-fifth IEEE International Conference on Data Engineering (ICDE’09), March 29 – April 4, 2009, Shanghai, China. (Acceptance rate: 150/554)
- B. Zhou[student] and J. Pei. “Answering Aggregate Keyword Queries on Relational Databases Using Minimal Group-bys”. In Proceedings of the Twelfth International Conference on Extending Database Technology (EDBT’09), March 23-26, 2009, Saint-Petersburg, Russia. (Acceptance rate: 92/283)
- Y. Xiao, W. Wu, J. Pei, W. Wang, and Z. He. “Efficiently Indexing Shortest Paths by Exploiting Symmetry in Graphs”. In Proceedings of the Twelfth International Conference on Extending Database Technology (EDBT’09), March 23-26, 2009, Saint-Petersburg, Russia. (Acceptance rate: 92/283)
- B. Zhou[student], Y. Han[visitor], J. Pei, B. Jiang[student], Y. Tao, and Y. Jia. “Continuous Privacy Preserving Publishing of Data Streams”. In Proceedings of the Twelfth International Conference on Extending Database Technology (EDBT’09), March 23-26, 2009, Saint-Petersburg, Russia. (Acceptance rate: 92/283)
- K. Tsoukalas[student], B. Zhou[student], J. Pei, and D. Cubranic. “PLEDS: A Personalized Entity Detection System Based on Web Log Mining Techniques” (industrial track). In Proceedings of the Twelfth International Conference on Extending Database Technology (EDBT’09), March 23-26, 2009, Saint-Petersburg, Russia.
- Y. Xu, B. Fang, K. Wang, A. W.-C. Fu, and J. Pei. “Publishing Sensitive Transactions for Itemset Utility”. In Proceedings of the Eighth IEEE International Conference on Data Mining (ICDM’08), December 15-19, 2008, Pisa, Italy. (Acceptance rate: 144/724)
- B. Jiang[student], J. Pei, X. Lin, D. W.-L. Cheung, and J. Han. “Mining Preferences from Superior and Inferior Examples”. In Proceedings of the Fourteenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’08), August 24-27, 2008, Las Vegas, NV, USA. (Acceptance rate: 95/510)
- H. Cao, D. Jiang[student], J. Pei, Q. He, Z. Liao, E. Chen, and H. Li. “Context-Aware Query Suggestion by Mining Click-Through and Session Data” (Best Application Paper Award). In Proceedings of the Fourteenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’08), August 24-27, 2008, Las Vegas, NV, USA. (Acceptance rate: 12/83)
- R. C.-W. Wong[visitor], A. W.-C. Fu, J. Pei, Y. S. Ho, T. Wong, and Y. Liu. “Efficient Skyline Querying with Variable User Preferences on Nominal Attributes”. In Proceedings of the Thirty-fourth International Conference on Very Large Databases (VLDB’08), August 24-30, 2008, Auckland, New Zealand. (Acceptance rate: 46/273)
- K. Tsoukalas[student], B. Zhou[student], J. Pei, and D. Cubranic. “PLEDS: A Personalized Entity Detection System Based on Web Log Mining Techniques” (invited paper). In Proceedings of the Ninth International Conference on Web-Age Information Management (WAIM’08), July 20-22, 2008, Zhangjiajie, China.
- W. Zhang, X. Lin, J. Pei, and Y. Zhang. “Managing Uncertain Data: A Probabilistic Approach” (invited paper). In Proceedings of the Ninth International Conference on Web-Age Information Management (WAIM’08), July 20-22, 2008, Zhangjiajie, China.
- M. Hua[student], J. Pei, W. Zhang, and X. Lin. “Ranking Queries on Uncertain Data: A Probabilistic Threshold Approach”. In Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data (SIGMOD’08), Vancouver, BC, Canada, June 9-12, 2008. (Acceptance rate: 78/435)
- E. Soroush, K. Wu, and J. Pei. “Fast and Quality-Guaranteed Data Streaming in Resource-Constrained Sensor Networks”. In Proceedings of the Ninth ACM International Symposium on Mobile Ad Hoc Networking and Computing (MobiHoc’08), Hong Kong, China, May 26-30, 2008. (Acceptance rate: 44/300)
- B. Zhou[student], J. Pei, and Z. Tang. “A Spamicity Approach to Unsupervised Web Spam Detection” (full paper). In Proceedings of the 2008 SIAM International Conference on Data Mining (SDM’08), Atlanta, GA, USA, April 24-26, 2008. (Acceptance rate: 40/282)
- Z. Xing[student], J. Pei, G. Dong, and P. S. Yu. “Mining Sequence Classifiers for Early Prediction” (poster paper). In Proceedings of the 2008 SIAM International Conference on Data Mining (SDM’08), Atlanta, GA, USA, April 24-26, 2008. (Acceptance rate: 77/282)
- B. Zhou[student] and J. Pei. “Preserving Privacy in Social Networks against Neighborhood Attacks” (full presentation paper). In Proceedings of the Twenty-fourth International Conference on Data Engineering (ICDE’08), Cancun, Mexico, April 7-12, 2008. (Acceptance rate: 12.1%)
- M. Hua[student], J. Pei, W. Zhang, and X. Lin. “Efficiently Answering Probabilistic Threshold Top-k Queries on Uncertain Data” (poster paper). In Proceedings of the Twenty-fourth International Conference on Data Engineering (ICDE’08), Cancun, Mexico, April 7-12, 2008. (Acceptance rate: 31%)
- X. Zeng[student], J. Pei, I. Vergara, M. Nesbitt, K. Wang, and N. Chen. “OrthoCluster: A New Tool for Mining Syntenic Blocks and Applications in Comparative Genomics”. In Proceedings of the Eleventh International Conferences on Extending Database Technology (EDBT’08), Nantes, France, March 25-30, 2008.
- B. C. M. Fung, K. Wang, A. W.-C. Fu, and J. Pei. “Anonymity for Continuous Data Publishing”. In Proceedings of the Eleventh International Conferences on Extending Database Technology (EDBT’08), Nantes, France, March 25-30, 2008.
- F. M. Jiang, J. Pei, and A. W.-C. Fu. “IX-Cubes: Iceberg Cubes for Data Warehousing and OLAP on XML Data”. In Proceedings of the ACM Sixteenth Conference on Information and Knowledge Management (CIKM’07), Lisbon, Portugal, November 6-9, 2007.
- J. Pei, B. Jiang[student], X. Lin, and Y. Yuan. “Probabilistic Skylines on Uncertain Data”. In Proceedings of the Thirty-third International Conference on Very Large Data Bases (VLDB’07), Vienna, Austria, September 23-28, 2007. (Acceptance rate: 46/263)
- M. Hua[student], J. Pei, A. W.-C. Fu, X. Lin, and H-F Leung. “Efficiently Answering Top-k Typicality Queries on Large Databases”. In Proceedings of the Thirty-third International Conference on Very Large Data Bases (VLDB’07), Vienna, Austria, September 23-28, 2007. (Acceptance rate: 46/263)
- R. C.-W. Wong, A. W.-C. Fu, K. Wang, and J. Pei. “Minimality Attack in Privacy Preserving Data Publishing”. In Proceedings of the Thirty-third International Conference on Very Large Data Bases (VLDB’07), Vienna, Austria, September 23-28, 2007. (Acceptance rate: 46/263)
- M. Acharya, T. Xie, J. Pei, and J. Xu. “Mining API Patterns as Partial Orders from Source Code: From Usage Scenarios to Specifications”. In Proceedings of the Sixth Joint Meeting of the European Software Engineering Conference and the ACM SIGSOFT Symposium on the Foundations of Software Engineering (ESEC/FSE’07), Dubrovnik, Croatia, September 3-7, 2007. (Acceptance rate: 43/251)
- M. Hua[student] and J. Pei. “Cleaning Disguised Missing Data: A Heuristic Approach”. In Proceedings of the Thirteenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’07), San Jose, California, USA, August 12-15, 2007.
- R. C.-W. Wong[visitor], J. Pei, A. W.-C. Fu, and K. Wang. “Mining Favorable Facets”. In Proceedings of the Thirteenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’07), San Jose, California, USA, August 12-15, 2007.
- J. Pei, J. Xu, Z. Wang, W. Wang, and K. Wang. “Maintaining K-Anonymity against Incremental Updates”. In Proceedings of the Nineteenth International Conference on Scientific and Statistical Database Management (SSDBM’07), Banff, Canada, July 9-11, 2007.
- R. C.-W. Wong, Y. Liu, J. Yin, Z. Huang, A. W.-C. Fu, and J. Pei. “(α,k)-anonymity Based Privacy Preservation by Lossy Join”. In Proceedings of the Ninth Asia-Pacific Web Conference and the Eighth International Conference on Web-Age Information Management (APWEB/WAIM’07), Huangshan, China, June 16-18, 2007. (Acceptance rate: 49/554)
- J. Pei, M. K. Lau[student], and P. S. Yu. “TS-Trees: A Non-Alterable Search Tree Index for Trustworthy Databases on Write-Once-Read-Many (WORM) Storage” (IEEE Outstanding Paper Award). In Proceedings of the IEEE Twenty-first International Conference on Advanced Information Networking and Applications (AINA’07), Niagara Falls, ON, Canada, May 21-23, 2007. (Acceptance rate: 134/444)
- B. Zhou[student] and J. Pei. “Sketching Landscapes of Page Farms”. In Proceedings of the 2007 SIAM International Conference on Data Mining (SDM’07), Minneapolis, MN, USA, April 26-28, 2007. (Acceptance rate: 76/296)
- Y. Bu, T.-W. Leung, A. W.-C. Fu, E. Keogh, J. Pei, and S. Meshkin. “WAT: Finding Top-K Discords in Time Series Database”. In Proceedings of the 2007 SIAM International Conference on Data Mining (SDM’07), Minneapolis, MN, USA, April 26-28, 2007. (Acceptance rate: 76/296)
- J. Pei, A. W. C. Fu, X. Lin, and H. Wang. “Computing Compressed Skyline Cubes Efficiently”. InProceedings of the twenty-third IEEE International Conference on Data Engineering (ICDE’07), Istanbul, Turkey, April 16-20, 2007. (Acceptance rate: 122/659)
- Y. Liu, L. Chen, J. Pei, Q. Chen, and Y. Zhao. “Mining Frequent Trajectory Patterns for Activity Monitoring Using Radio Frequency Tag Arrays” (one of the three papers in the best paper session). In Proceedings of the Fifth Annual IEEE International Conference on Pervasive Computing and Communications (PerCom’07), White Plains, NY, USA, March 19-23, 2007. (Acceptance rate: 20/208)
- B.-W. On, E. Elmacioglu, D. Lee, J. Kang, and J. Pei. “Improving Grouped-Entity Resolution using Quasi-Cliques”. In Proceedings of the 2006 IEEE International Conference on Data Mining (ICDM’06), Hong Kong, December 18-22, 2006. (Acceptance rate: less than 20%)
- Y. Xu, K. Wang, A. W. C. Fu, R. She, and J. Pei. “Classification Spanning Correlated Data Streams”. In Proceedings of the ACM Fifteenth Conference on Information and Knowledge Management (CIKM’06), Arlington, VA, USA, November 6-11, 2006. (Acceptance rate: 15%)
- W. Zhu, J. Pei, J. Yin, Y. Xie. “Granularity Adaptive Density Estimation and on-Demand Clustering of Concept-Drifting Data Streams”. In Proceedings of the Eighth International Conference on Data Warehousing and Knowledge Discovery (DaWaK’06), Krakow, Poland, September 4-8, 2006. (Acceptance rate: 52/199)
- J. Li, R. C.-W. Wong, A. W.-C. Fu, and J. Pei. “Achieving k-Anonymity by Clustering in Attribute Hierarchical Structures” (one of the six best papers selected for the journal special issue). In Proceedings of the Eighth International Conference on Data Warehousing and Knowledge Discovery (DaWaK’06), Krakow, Poland, September 4-8, 2006. (Acceptance rate: 52/199)
- C. Aggarwal, J. Pei, and B. Zhang[student]. “On Privacy Preservation against Adversarial Data Mining”. In Proceedings of the Twelfth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’06), Philadelphia, PA, USA, August 20-23, 2006. (Acceptance rate: 22%)
- J. Xu, W. Wang, J. Pei, X. Wang, B. Shi, and A. W.-C. Fu. “Utility-Based Anonymization Using Local Recoding”. In Proceedings of the Twelfth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’06), Philadelphia, PA, USA, August 20-23, 2006. (Acceptance rate: 22%)
- H. Wang, J. Yin, J. Pei, P. S. Yu, and J. X. Yu. “Suppressing Model Overfitting in Mining Concept-Drifting Data Streams”. In Proceedings of the Twelfth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’06), Philadelphia, PA, USA, August 20-23, 2006. (Acceptance rate: 22%)
- J. Li, H. Li, L. Wong, J. Pei, and G. Dong. “Minimum Description Length (MDL) Principle: Generators Are Preferable to Closed Patterns”. In Proceedings of the Twenty-first National Conference on Artificial Intelligence (AAAI’06), Boston, MA, USA, July 16-20, 2006. (Acceptance rate: 171/774)
- B.-W. On, D. Lee, E. Elmacioglu, J. Kang, and J. Pei. “An Effective Approach to Entity Resolution Problem Using Quasi-Clique and its Application to Digital Libraries” (short paper). In Proceedings of the ACM/IEEE 2006 Joint Conference on Digital Libraries (JCDL’06), Chapel Hill, NC, USA, June 11-15, 2006.
- Y. Tao, X. Xiao, and J. Pei. “SUBSKY: Efficient Computation of Skylines in Subspaces”. In Proceedings of the Twenty-second International Conference on Data Engineering (ICDE’06), Atlanta, GA, USA, April 3-7, 2006. (Acceptance rate: 89/456)
- J. Pei, J. Liu[student], H. Wang, K. Wang, P. S. Yu, and J. Wang. “Efficiently Mining Frequent Closed Partial Orders”, In Proceedings of the Fifth IEEE International Conference on Data Mining (ICDM’05), New Orleans, Louisiana, USA, November 27-30 2005. (Acceptance rate: 141/630)
- H. Wang and J. Pei. “A Random Method for Quantifying Changing Distributions in Data Streams”, In Proceedings of the Ninth European Conference on Principles and Practice of Knowledge Discovery in Databases (PKDD’05), Porto, Portugal, October 3-7, 2005. (Acceptance rate: 70/250)
- J. Ye, X. Zhou, J. Pei, L. Chen, and L. Zhang. “A Stratification-Based Approach to Accurate and Fast Image Annotation”. In Proceedings of the Sixth International Conference on Web-Age Information Management (WAIM’05), Hangzhou, China, October 11-13, 2005. (Acceptance rate: 48/488)
- C. Liu, K. Wu, and J. Pei. “A Dynamic Clustering and Scheduling Approach to Energy Saving in Data Collection from Wireless Sensor Networks”. In Proceedings of the Second Annual IEEE Communications Society Conference on Sensor and Ad Hoc Communications and Networks (SECON’05), Santa Clara, California, USA, September 26-29, 2005. (Acceptance rate: 55/202)
- J. Pei, W. Jin, M. Ester, and Y. Tao. “Catching the Best Views in Skyline: A Semantic Approach Based on Decisive Subspaces”. In Proceedings of the Thirty-first International Conference on Very Large Data Bases (VLDB’05), Trondheim, Norway, August 30-September 2, 2005. (Acceptance rate: 16.5%)
- J. Pei, D. Jiang[student], and A. Zhang. “On Mining Cross-Graph Quasi-Cliques”. In Proceedings of the Eleventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’05), Chicago, IL, USA, August 21-24, 2005. (Acceptance rate: 40/343)
- H. Wang, J. Pei, and P. S. Yu. “Pattern Based Similarity Search for Microarray Data” (Industrial and Government Track poster paper). In Proceedings of the Eleventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’05), Chicago, IL, USA, August 21-24, 2005. (Acceptance rate: 25/75)
- H. Yu, J. Pei, S. Tang, and D. Yang. “Mining Most General Multidimensional Summarization of Probable Groups in Data Warehouses”. In Proceedings of the Seventeenth International Scientific and Statistical Database Management Conference (SSDBM’05), Santa Barbara, California, USA, June 27-29, 2005.
- M. Cho[student], J. Pei, and D. W.-L. Cheung. “Cross Table Cubing: Mining Iceberg Cubes from Data Warehouses” (poster paper). In Proceedings of The Fifth SIAM International Conference on Data Mining (SDM’05), Newport Beach, CA, USA, April 21-23, 2005.
- D. Jiang[student], J. Pei, and A. Zhang. “A General Approach to Mining Quality Pattern-based Clusters from Gene Expression Data”. In Proceedings of the Tenth International Conference on Database Systems for Advanced Applications (DASFAA’05), Beijing, China, April 18-20, 2005. (Acceptance rate: 67/302)
- G. Dong, C. Jiang, J. Pei, J. Li, and L. Wong. “Mining Succinct Systems of Minimal Generators of Formal Concepts”. In Proceedings of the Tenth International Conference on Database Systems for Advanced Applications (DASFAA’05), Beijing, China, April 18-20, 2005. (Acceptance rate: 67/302)
- J. Pei, D. Jiang[student], and A. Zhang. “Multi-Graph Mining: A Cross-Graph Quasi-Clique Approach” (research poster paper). In Proceedings of the Twenty-first International Conference on Data Engineering (ICDE’05), Tokyo, Japan, April 5-8, 2005. (Acceptance rate: 100/521)
- C. Wang, W. Wang, J. Pei, Y. Zhu and B. Shi. “Scalable Mining Large Disk-based Graph Databases” (research full paper). In Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’04), Seattle, WA, USA, August 22-25, 2004. (Acceptance rate: 40/337)
- D. Jiang[student], J. Pei, M. Ramanathan, C. Tang, and A. Zhang. “Mining Coherent Gene Clusters from Three-Dimensional Microarray Data” (industrial full paper). In Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’04), Seattle, WA, USA, August 22-25, 2004. (Acceptance rate: 13/47)
- L. Deng, J. Pei, J. Ma, and D.L. Lee. “A Rank Sum Test Method for Informative Gene Discovery” (industrial full paper). In Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’04), Seattle, WA, USA, August 22-25, 2004. (Acceptance rate: 13/47)
- H. Wang, F. Chu, W. Fan, P. S. Yu, and J. Pei. “A Fast Algorithm for Subspace Clustering by Pattern Similarity” (full paper). In Proceedings of the Sixteenth International Conference on Scientific and Statistical Database Management (SSDBM’04), Santorini Island, Greece, 21-23 June 2004.
- W. Wang, B. Shi, C. Wang, H. Zhou, J. Pei, and M. Hong. “Efficient Pattern-Growth Methods for Frequent Tree Pattern Mining” (full paper). In Proceedings of the Eighth Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD’04), Sydney, Australia, May 26-28, 2004. (Acceptance rate: 50/235).
- J. Pei, X. Zhang[student], M. Cho[student], H. Wang, and P. S. Yu. “MaPle: A Fast Algorithm for Maximal Pattern-based Clustering” (Regular paper). In Proceedings of the Third IEEE International Conference on Data Mining (ICDM’03), Melbourne, Florida, USA, November 19-22, 2003. (Acceptance rate: 58/501)
- D. Jiang[student], J. Pei, and A. Zhang. “Interactive Exploration of Coherent Patterns in Time-Series Gene Expression Data”. In Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’03), Washington, DC, USA, August 24-27, 2003. (Acceptance rate: 70/258)
- C. Tang, A. Zhang, and J. Pei. “Mining Phenotypes and Informative Genes from Gene Expression Data”. In Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’03), Washington, DC, USA, August 24-27, 2003. (Acceptance rate: 70/258)
- J. Wang, J. Han, and J. Pei. “CLOSET+: Searching for the Best Strategies for Mining Frequent Closed Itemsets”. In Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’03), Washington, DC, USA, August 24-27, 2003. (Acceptance rate: 70/258)
- L. V. S. Lakshmanan, J. Pei, and Y. Zhao. “QC-Trees: An Efficient Summary Structure for Semantic OLAP”. In Proceedings of the 2003 ACM-SIGMOD International Conference on Management of Data (SIGMOD’03), San Diego, CA, June 9-12, 2003. (Acceptance rate: 53/342)
- J. Pei. “A General Model for Online Analytical Processing of Complex Data”. In Proceedings of the Twenty-second International Conference on Conceptual Modeling (ER’03), Chicago, IL, October 13-16, 2003. (Acceptance rate: 38/153)
- H. C. Kum, J. Pei, and W. Wang. “ApproxMAP: Approximate Mining of Consensus Sequential Patterns”. In Proceedings of the 2003 SIAM International Conference on Data Mining (SIAM DM ’03), San Francisco, CA, May 1-3, 2003.
- D. Jiang[student], J. Pei, and A. Zhang. “DHC: A Density-based Hierarchical Clustering Method for Time Series Gene Expression Data” (Regular paper). In Proceedings of the Third IEEE Symposium on Bioinformatics and Bio-engineering (BIBE’03), Washington D.C., March 10-12, 2003. (Acceptance rate: 45/129)
- Y. Huang, H. Xiong, S. Shekhar, and J. Pei. “Mining Confident Co-location Rules without A Support Threshold”. In Proceedings of the Eighteenth Annual ACM Symposium on Applied Computing (SAC’03), Melbourne, Florida, March 9-12, 2003.
- J. Pei, G. Dong, W. Zou, and J. Han. “On Computing Condensed Frequent Pattern Bases” (Regular paper). In Proceedings of the 2002 IEEE International Conference on Data Mining (ICDM’02), Maebashi TERRSA, Maebashi City, Japan, December 9-12, 2002.
- J. Pei, J. Han, and W. Wang. “Mining Sequential Patterns with Constraints in Large Databases” (Regular paper). In Proceedings of Eleventh International Conference on Information and Knowledge Management (CIKM’02), McLean, VA, November 4-9, 2002.
- L. V. S. Lakshmanan, J. Pei, J. Han. “Quotient Cube: How to Summarize The Semantics of A Data Cube”. In Proceedings of Twenty-eighth International Conference on Very Large Databases (VLDB’02), Hong Kong, China, August 20-23, 2002.
- J. Pei, J. Han, H. Lu, S. Nishio, S. Tang, and D. Yang. “H-Mine: Hyper-structure Mining of Frequent Patterns in Large Databases”. In Proceedings of the 2001 IEEE International Conference on Data Mining (ICDM’01), San Jose, California, November 29-December 2, 2001.
- W. Li, J. Han, and J. Pei. “CMAR: Accurate and Efficient Classification Based on Multiple Class-Association Rules”. In Proceedings of the 2001 IEEE International Conference on Data Mining (ICDM’01), San Jose, California, November 29-December 2, 2001.
- H. Pinto, J. Han, J. Pei, K. Wang, Q. Chen, and U. Dayal, “Multi-Dimensional Sequential Pattern Mining”. In Proceedings of the Tenth ACM International Conference on Information and Knowledge Management (CIKM’01), Atlanta, Georgia, November 2001.
- G. Dong, J. Han, J. Lam, J. Pei, and K. Wang, “Mining Multi-Dimensional Constrained Gradients in Data Cubes”. In Proceedings of the Twenty-seventh International Conference on Very Large Data Base (VLDB’01), Rome, Italy, September 2001.
- J. Han, J. Pei, G. Dong, and K. Wang, “Efficient Computation of Iceberg Cubes with Complex Measures”. In Proceedings of the 2001 ACM-SIGMOD International Conference on Management of Data (SIGMOD’01), Santa Barbara, CA, May 2001.
- J. Pei, J. Han, and L. V. S. Lakshmanan. “Mining Frequent Itemsets With Convertible Constraints”. In Proceedings of the 2001 IEEE International Conference on Data Engineering (ICDE’01), Heidelberg, Germany, April 2001.
- J. Pei, J. Han, B. Mortazavi-Asl, H. Pinto, Q. Chen, U. Dayal, and M. Hsu. “PrefixSpan: Mining Sequential Patterns by Prefix-Projected Pattern Growth”. In Proceedings of the 2001 IEEE International Conference on Data Engineering (ICDE’01), Heidelberg, Germany, April 2001.
- J. Pei and J. Han. “Can We Push More Constraints into Frequent Pattern Mining?”. In Proceedings of the 2000 ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’2000), Boston, MA, August 2000.
- J. Han, J. Pei, B. Mortazavi-Asl, Q. Chen, U. Dayal, and M. Hsu. “FreeSpan: Frequent pattern-projected sequential pattern mining”. In Proceedings of the 2000 ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’2000), Boston, MA, August 2000.
- J. Han, J. Pei, and Y. Yin. “Mining Frequent Patterns without Candidate Generation”. In Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data (SIGMOD’00), Dallas, TX, May 2000.
- J. Pei, J. Han, B. Mortazavi-Asl, and H. Zhu. “Mining Access Patterns efficiently from Web logs”. InProceedings of the 2000 Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD’00), Kyoto, Japan, April 2000.
Workshop papers
- J. Xu, W. Wang, J. Pei, X. Wang, B. Shi, and A. W.-C. Fu. “Utility-Based Anonymization for Privacy Preservation with Less Information Loss” (invited paper, one of the two best papers in the workshop selected for the special issue in ACM SIGKDD Explorations). In Proceedings of the Second ACM SIGKDD Workshop on Utility-Based Data Mining (UBDM’06), in conjunction with the Twelfth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’06), Philadelphia, PA, August 20, 2006.
- T. Xie and J. Pei. “MAPO: Mining API Usages from Open Source Repositories” (short paper). InProceedings of the Third International Workshop on Mining Software Repositories (MSR 2006), Shanghai, China, May 22-23, 2006.
- G. Dong, J. Han, L. V. S. Lakshmanan, J. Pei, H. Wang, and P. S. Yu. “Online mining of changes from data streams: Research problems and preliminary results”. In Proceedings of the 2003 ACM SIGMOD Workshop on Management and Processing of Data Streams, in cooperation with 2003 ACM-SIGMOD International Conference on Management of Data (SIGMOD’03), San Diego, CA, June 8, 2003.
- Y. Chen, G. Dong, J. Han, J. Pei, B. Wah, J. Wang. “Online Analytical Processing Stream Data: Is It Feasible?”. In Proceedings of 2002 ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery (DMKD’2002), Madison, Wisconsin, June 2, 2002.
- J. Pei, A. K. H. Tung, and J. Han. “Fault-tolerant frequent pattern mining: Problems and Challenges”. In Proceedings of the 2001 ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery (DMKD’01), Santa Barbara, CA, May 2001.
- J. Pei, J. Han, and R. Mao. “CLOSET: An efficient algorithm for mining frequent closed itemsets”. In
- Proceedings of the 2000 ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery (DMKD’00), Dallas, TX, May, 2000.
Tutorials
- P. Cui and J. Pei. “Data-Centric Trustworthy AI”. In Proceedings of the Twenty-eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’22), Washington, DC, USA, August 14-18, 2022.
- L. Wu, P. Cui, J. Pei, L. Zhao, and X. Guo. “Graph Neural Networks: Foundation, Frontiers, and Applications”. In Proceedings of the Twenty-eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’22), Washington, DC, USA, August 14-18, 2022.
- L. Wu, P. Cui, J. Pei, L. Zhao, and X. Guo. “Graph Neural Networks: Foundation, Frontiers and Applica- tions”. In Proceedings of the Thirty-first International Joint Conference on Artificial Intelligence (IJCAI’22), Messe Wien, Vienna, Austria, July 23-29, 2022.
- J. Pei, Z. Cong, X. Luo, and F. Zhu. “Data and Model Pricing in the Pipeline of Machine Learning”. In Proceedings of the Twenty-seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’21), Singapore, August 14-18, 2021.
- L. Shou, M. Gong, J. Pei, X. Geng, X. Zhou, and D. Jiang. “Language Scaling: Applications, Chal- lenges and Approaches”. In Proceedings of the Twenty-seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’21), Singapore, August 14-18, 2021.
- Z. Zhou, L. Chu, L. Wang, J. Pei, and Y. Zhang. “Towards Fair Federated Learning”. In Proceedings of the Twenty-seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’21), Singapore, August 14-18, 2021.
- F. Zhu, H. Liu, X. Wu, and J. Pei. “Data Asset for Collaborative Intelligence”. In Proceedings of the Twenty-seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’21), Singapore, August 14-18, 2021.
- X. Hu, J. Pei, L. Chu, J. Bian, and W. Liu. “Deep Learning Model Complexity: Concepts and Approaches”. In Proceedings of the Twenty-first SIAM International Conference on Data Mining (SDM’21), April 29 – May 1, 2021, Alexandria, VA, USA.
- L. Shou, M. Gong, J. Pei, X. Geng, X. Zhou, and D. Jiang. “Scaling NLP Applications to 100+ Languages”. In Proceedings of the Thirtieth The Web Conference (WWW’21), Ljubljana, Slovenia, April 19-23, 2021.
- F. Zhu and J. Pei. “Data Asset and Governance: Opportunities and Challenges at the Frontier”. In Proceedings of the Twentieth IEEE International Conference on Data Mining (ICDM’20), Sorrento, Italy, November 17-20, 2020.
- J. Pei. “Data Pricing: From Economics to Data Science”. In Proceedings of the Twenty-sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD20), San Diego, CA, USA, August 23-27, 2020.
- F. Wang, P. Cui, J. Pei, Y. Song, and C. Zang. “Recent Advances on Graph Analytics and Its Applications in Healthcare”. In Proceedings of the Twenty-sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD20), San Diego, CA, USA, August 23-27, 2020.
- X. Huang, P. Cui, Y. Dong, J. Li, H. Liu, J. Pei, L. Song, J. Tang, F. Wang, H. Yang, and W. Zhu. “Learning From Networks: Algorithms, Theory, and Applications”. In Proceedings of the Twenty-fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’19), Anchorage, AK, USA, August 4-8, 2019.
- Z.-J. M. Shen, R. Yuan, D, Wu, and J. Pei. “Data Science in Retail-as-a-Service”. In Proceedings of the Twenty-fourth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’18), London, United Kingdom, August 19-23, 2018.
- P. Cui, J. Pei, W. Zhu, T. Berger-Wolf, I. Brugere, and B. Perozzi. “Modeling Data With Networks + Network Embedding: Problems, Methodologies and Frontiers”. In Proceedings of the Twenty-fourth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’18), London, United Kingdom, August 19-23, 2018.
- P. Cui, J. Pei, and W. Zhu. “Network Embedding-Enabling Network Analytics and Inference in Vector Space”. In Proceedings of the Twenty-third ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’17), Halifax, NS, Canada, August 13-17, 2017.
- J. Pei. “Mining Uncertain and Probabilistic Data for Big Data Analytics”. In Proceedings of the Twelfth IEEE International Conference on Data Mining (ICDM’12), Brussels, Belgium, December 10-13, 2012.
- D. Jiang[student], J. Pei, and H. Li. “Enhancing Web Search by Mining Search and Browse Logs”. InProceedings of the Thirty-fourth Annual ACM SIGIR Conference (SIGIR’11), Beijing, China, July 24-28, 2011.
- M. Hay, K. Liu, G. Miklau, J. Pei, and E. Terzi. “Privacy-aware Data Management in Information Networks”. In Proceedings of the 2011 ACM SIGMOD International Conference on Management of Data (SIGMOD’11), Athens, Greece, June 12-16, 2011.
- K. Liu, G. Miklau, J. Pei, and E. Terzi. “Privacy-aware Data Mining in Information Networks”. In Proceedings of the Sixteenth ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD’10), Washington, DC, USA, July 25-28, 2010.
- D. Jiang[student], J. Pei, and H. Li. “Web Search/Browse Log Mining: Challenges, Methods, and Applications”. In Proceedings of the Sixteenth ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD’10), Washington, DC, USA, July 25-28, 2010.
- D. Jiang[student], J. Pei, and H. Li. “Web Search/Browse Log Mining: Challenges, Methods, and Applications”. In Proceedings of the 33rd Annual ACM SIGIR Conference (SIGIR’10), Geneva, Switzerland, July 19-23, 2010.
- D. Jiang[student], J. Pei, and H. Li. “Web Search/Browse Log Mining: Challenges, Methods, and Applications”. In Proceedings of the Nineteenth International World Wide Web Conference (WWW’10), Raleigh, NC, USA, April 26-30, 2010.
- J. Han and J. Pei. “Preference Queries from OLAP and Data Mining Perspective”. In Proceedings of the Twenty-fifth IEEE International Conference on Data Engineering (ICDE’09), Shanghai, China, March 29 – April 4, 2009.
- J. Pei, M. Hua[student], Y. Tao, and X. Lin. “Mining Uncertain and Probabilistic Data: Problems, Challenges, Methods and Applications”. In Proceedings of the Fourteenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’08), Las Vegas, NV, USA, August 24-27, 2008.
- J. Pei, M. Hua[student], Y. Tao, and X. Lin. “Query Answering Techniques on Uncertain and Probabilistic Data”. In Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data (SIGMOD’08), Vancouver, Canada, June 9-12, 2008.
- J. Pei, B. Zhou[student], Z. Tang, and H. Huang[student]. “Data Mining Techniques for Web Spam Detection”. In Proceedings of the Twelfth Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD’08), Osaka, Japan, May 20-23, 2008.
- T. Xie and J. Pei. “Data Mining for Software Engineering”. In Proceedings of the Twelfth ACM SIGKDD International Conference on Data Mining (KDD’06), Philadelphia, USA, August 20-23, 2006.
- J. Pei, H. Wang and P. S. Yu. “Online Mining Data Streams: Problems, Applications and Progress”. In Proceedings of the Sixth International Conference on Web-Age Information Management (WAIM’05), Hangzhou, China, October 11-13, 2005.
- H. Wang, J. Pei and P. S. Yu. “Online Mining Data Streams: Problems, Applications and Progress”. InProceedings of the Twenty-first International Conference on Data Engineering (ICDE’05), Tokyo, Japan, April 5-8, 2005.
- J. Pei, H. Wang and P. S. Yu. “Online Mining Data Streams: Problems, Applications and Progress”. InProceedings of the Tenth ACM International Conference on Data Mining (KDD’04), Seattle, WA, August 22-25, 2004.
- J. Pei, S. J. Upadhyaya, F. Farooq and V. Govindaraju. “Data Mining for Intrusion Detection: Techniques, Applications and Systems”. In Proceedings of the Twentieth IEEE International Conference on Data Engineering (ICDE’04), Boston, MA, March 30-April 2, 2004.
- J. Han, L. V. S. Lakshmanan and J. Pei. “Mining Frequent Patterns: Methods and Applications”. In the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’2001), San Francisco, California, USA, August 26-29, 2001.
- J. Pei and J. Han. “Sequential Pattern Mining: From Shopping History Analysis to Weblog Mining and DNA Mining”. In the Fifth Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD’01), Hong Kong, China, April 16-18, 2001.