Detailed Program

Day 1
12 Dec 2016
Day 2
13 Dec 2016
Day 3
14 Dec 2016
Day 4
15 Dec 2016

Workshop Sessions

Full-Day Workshops

DaMNet Data Mining in Networks. Workshop Chair/s: Giuseppe Di Fatta. Room A2.
DMHAA Data Mining in Human Activity Analysis. Workshop Chair/s: Weifeng Liu. Room S3. 
DMS Data Mining for ServiceWorkshop Chair/s: Katsutoshi Yada and Shusaku Tsumoto. Room S5. 
DSBDA Data Science and Big Data AnalyticsWorkshop Chair/s: Xiaolong Jin. Room A1. 
MoDAT Data Market for Co-evolution of Sciences and BusinessWorkshop Chair/s: Jun Nakamura and Yukio Ohsawa. Room S7. 
SENTIRE Sentiment Elicitation from Natural Text for Information Retrieval and ExtractionWorkshop Chair/s: Erik Cambria. Room S4. 
SSTDM Spatial and Spatiotemporal Data MiningWorkshop Chair/s: Raju Vatsavai. Room S12. 

Half-day Workshops 

DAPS Data mining for the Analysis of Performance and SuccessWorkshop Chair/s: Luca Pappalardo, Paolo Cintia and Roberta Sinatra. Room S6. 
DMBIH Data Mining in Biomedical Informatics and HealthcareWorkshop Chair/s: José D. Martín-Guerrero and Paulo J. G. Lisboa (keynote speaker). Room S9. 
DMED I Data Mining in Emerging Domains IWorkshop Chair/s: Cèsar Ferri and Fabio Mazzarella. Room S11. 
  • DWA Data Wrangling Automation
  • MDDM Maritime Domain Data Mining
HDM High Dimensional Data MiningWorkshop Chair/s: Ata Kaban. Room A4. 
PDDM Privacy and Discrimination in Data MiningWorkshop Chair/s: Sara Hajian, Tamir Tassa and Yucel Saygin. Room A3. 
SOMERIS Social Media and RiskWorkshop Chair/s: Fragkiskos Malliaros and Michalis Vazirgiannis. Room S10. 

PhD Forum

Invited talk
"Challenges, problem selection, and impact: A biased perspective on PhD studies"
Alexandra Olteanu

Student presentations
“Subspace Clustering Ensembles through Tensor Decomposition”
Dominik Mautz, Christian Böhm, and Claudia Plant


“Clustering with the Levy Walk: ‘Hunting’ for Clusters”
Benjamin Schelling and Claudia Plant


“Infer Mobility Patterns and Social Dynamics for Modelling Human Behaviour”
Luca Luceri
10:30-11:00

Coffee Break

Workshop Sessions

Full-Day Workshops

DaMNet Data Mining in Networks. Workshop Chair/s: Giuseppe Di Fatta. Room A2.
DMHAA Data Mining in Human Activity Analysis. Workshop Chair/s: Weifeng Liu. Room S3. 
DMS Data Mining for ServiceWorkshop Chair/s: Katsutoshi Yada and Shusaku Tsumoto. Room S5. 
DSBDA Data Science and Big Data AnalyticsWorkshop Chair/s: Xiaolong Jin. Room A1. 
MoDAT Data Market for Co-evolution of Sciences and BusinessWorkshop Chair/s: Jun Nakamura and Yukio Ohsawa. Room S7. 
SENTIRE Sentiment Elicitation from Natural Text for Information Retrieval and ExtractionWorkshop Chair/s: Erik Cambria. Room S4. 
SSTDM Spatial and Spatiotemporal Data MiningWorkshop Chair/s: Raju Vatsavai. Room S12. 

Half-day Workshops 

DAPS Data mining for the Analysis of Performance and SuccessWorkshop Chair/s: Luca Pappalardo, Paolo Cintia and Roberta Sinatra. Room S6. 
DMBIH Data Mining in Biomedical Informatics and HealthcareWorkshop Chair/s: José D. Martín-Guerrero and Paulo J. G. Lisboa (keynote speaker). Room S9. 
DMED I Data Mining in Emerging Domains IWorkshop Chair/s: Cèsar Ferri and Fabio Mazzarella. Room S11. 
  • DWA Data Wrangling Automation
  • MDDM Maritime Domain Data Mining
HDM High Dimensional Data MiningWorkshop Chair/s: Ata Kaban. Room A4. 
PDDM Privacy and Discrimination in Data MiningWorkshop Chair/s: Sara Hajian, Tamir Tassa and Yucel Saygin. Room A3. 
SOMERIS Social Media and RiskWorkshop Chair/s: Fragkiskos Malliaros and Michalis Vazirgiannis. Room S10. 

PhD Forum

Invited talk
"Top Ten List of Things That I’ve Learned Advising PhD Students"
Tina Eliassi-Rad

Student presentations
“A Semi-Supervised Ensemble Approach for Multi-Label Learning”
Ouadie Gharroudi, Haytham Elghazel, and Alex Aussem

“Similarity Tree Pruning: a novel dynamic ensemble selection approach”
Anil Narassiguin, Haytham Elghazel, and Alex Aussem

“A Novel Bayesian Ensemble Pruning Method”
Zhengshen Jiang, Hongzhi Liu, Bin Fu, and Zhonghai Wu

“A novel pre-classification based kNN algorithm”
Huahua Xie, Dong Liang, Hao Jin, Zhaojing Zhang, and Shizhan Lan
13:00-14:30

Lunch Break

Workshop Sessions

Full-Day Workshops

DaMNet Data Mining in Networks. Workshop Chair/s: Giuseppe Di Fatta. Room A2.
DMHAA Data Mining in Human Activity Analysis. Workshop Chair/s: Weifeng Liu. Room S3. 
DMS Data Mining for ServiceWorkshop Chair/s: Katsutoshi Yada and Shusaku Tsumoto. Room S5. 
DSBDA Data Science and Big Data AnalyticsWorkshop Chair/s: Xiaolong Jin. Room A1. 
MoDAT Data Market for Co-evolution of Sciences and BusinessWorkshop Chair/s: Jun Nakamura and Yukio Ohsawa. Room S7. 
SENTIRE Sentiment Elicitation from Natural Text for Information Retrieval and ExtractionWorkshop Chair/s: Erik Cambria. Room S4. 
SSTDM Spatial and Spatiotemporal Data MiningWorkshop Chair/s: Raju Vatsavai. Room S12. 

Half-Day Workshops

CLOUDMINE + DMIoT. Workshop Chair/s: Vani Mandava, David Carrera and Soundar Srinivasan. Room A4. 
  • CLOUDMINE Data Mining Systems and their Applications on the Cloud
  • DMIoT Data Mining for Internet of Things
DINA Data Integration and Applications. Workshop Chair/s: Peter Christen, Osmar Zaiane and Luiza Antonie. Room S9.
DMED II Data Mining in Emerging Domains II. Workshop Chair/s: Alicia Troncoso, Irena Koprinska and Jeremiah Deng. Room S11. 
  • DaMEMO Data Mining for Energy Modelling and Optimization
  • DMiP Data Mining in Politics
DMCS Data Mining for Cyber Security. Workshop Chair/s: Nathalie Japkowicz. Room S6. 
OEDM Optimization Based Techniques for Emerging Data Mining. Workshop Chair/s: Zhensong Chen. Room A3. 
SERecSys Semantics-Enabled Recommender Systems. Workshop Chair/s: Ludovico Boratto and Giovanni Stilo. Room S10. 

PhD Forum

Tutorial
"The data scientist's guide for writing papers"
Nikolaj Tatti
16:10-16:30

Coffee Break

Workshop Sessions

Full-Day Workshops

DaMNet Data Mining in Networks. Workshop Chair/s: Giuseppe Di Fatta. Room A2.
DMHAA Data Mining in Human Activity Analysis. Workshop Chair/s: Weifeng Liu. Room S3. 
DMS Data Mining for ServiceWorkshop Chair/s: Katsutoshi Yada and Shusaku Tsumoto. Room S5. 
DSBDA Data Science and Big Data AnalyticsWorkshop Chair/s: Xiaolong Jin. Room A1. 
MoDAT Data Market for Co-evolution of Sciences and BusinessWorkshop Chair/s: Jun Nakamura and Yukio Ohsawa. Room S7. 
SENTIRE Sentiment Elicitation from Natural Text for Information Retrieval and ExtractionWorkshop Chair/s: Erik Cambria. Room S4. 
SSTDM Spatial and Spatiotemporal Data MiningWorkshop Chair/s: Raju Vatsavai. Room S12. 

Half-Day Workshops

CLOUDMINE + DMIoT. Workshop Chair/s: Vani Mandava, David Carrera and Soundar Srinivasan. Room A4. 
  • CLOUDMINE Data Mining Systems and their Applications on the Cloud
  • DMIoT Data Mining for Internet of Things
DINA Data Integration and Applications. Workshop Chair/s: Peter Christen, Osmar Zaiane and Luiza Antonie. Room S9.
DMED II Data Mining in Emerging Domains II. Workshop Chair/s: Alicia Troncoso, Irena Koprinska and Jeremiah Deng. Room S11. 
  • DaMEMO Data Mining for Energy Modelling and Optimization
  • DMiP Data Mining in Politics
DMCS Data Mining for Cyber Security. Workshop Chair/s: Nathalie Japkowicz. Room S6. 
OEDM Optimization Based Techniques for Emerging Data Mining. Workshop Chair/s: Zhensong Chen. Room A3. 
SERecSys Semantics-Enabled Recommender Systems. Workshop Chair/s: Ludovico Boratto and Giovanni Stilo. Room S10. 

PhD Forum

Student presentations
“Improving the Prediction Cost of Drift Handling Algorithms by Abstaining”
Pierre-Xavier Loeffel, Vincent Lemaire, Christophe Marsala, and Marcin Detyniecki


“Hidden Structures at a Glance”
Remy Dautriche, Alexandre Termier, Renaud Blanch, and Miguel Santana


“Centrality-based Approach for Supervised Term Weighting”
Niloofer Shanavas, Hui Wang, Zhiwei Lin, and Glenn Hawe


“Applying Deep Learning to Stereotypical Motor Movement Detection in Autism Spectrum Disorders”
Nastaran Mohammadian Rad and Cesare Furlanello


“Regression techniques for modelling conception in seasonally calving dairy cows”
Caroline Fenlon, Luke O’Grady, Michael Doherty, Stephen Butler, Laurence Shalloo, and John Dunnion


“Sacrificing overall classification quality to improve classification accuracy of well-sought classes”
Kevin Amaral, Ping Chen, Wei Ding, and Rajani Sadasivam


“Generating Informative Summary of Social Image Search Result”
Sheetal Takale and Prakash Kulkarni

Opening

Ricardo Baeza-Yates (NTENT, USA / UPF, Spain)
Zhi-Hua Zhou (Nanjing University)
Francesco Bonchi (ISI Foundation, Italy / Eurecat, Spain)
Josep Domingo-Ferrer (URV, Spain)

Keynote 1

Deep Learning 


Yoshua Bengio, University of Montreal



Session Chair: Ricardo Baeza (NTENT, USA / UPF, Spain)
10:30-11:00

Coffee Break

Session A1: Deep Learning

Session Chair: Wei Ding (University of Massachusetts Boston, USA)

REGULAR PAPERS:

DM558 "Regularizing Deep Convolutional Neural Networks with a Structured Decorrelation Constraint"
Wei Xiong, Bo Du, Lefei Zhang, and Dacheng Tao

DM613 "Traffic Speed Prediction and Congestion Source Exploration: A Deep Learning Method"
Jingyuan Wang, Qian Gu, Junjie Wu, Guannan Liu, and Zhang Xiong

DM714 "Model Accuracy and Runtime Tradeoff in Distributed Deep Learning: A Systematic Study"
Wei Zhang, Suyog Gupta, and Fei Wang

DM976 "Convolutional MKL Based Multimodal Emotion Recognition and Sentiment Analysis"
Iti Chaturvedi, Soujanya Poria, and Erik Cambria


SHORT PAPERS:

DM390 "Structure Selection for Convolutive Non-negative Matrix Factorization Using Normalized Maximum Likelihood Coding"
Atsushi Suzuki, Kohei Miyaguchi, and Kenji Yamanishi

DM679 "Product-based Neural Networks for User Response Prediction"
Yanru Qu, Han Cai, Kan Ren, Weinan Zhang, and Yong Yu

DM933 "Deep Convolutional Factor Analyser for Multivariate Time Series Modeling"
Chao Yuan

DM945 "Learning Deep Networks from Noisy Labels with Dropout Regularization"
Ishan Jindal, Matthew Nokleby, and Xuewen Chen

Session B1: Patterns

Session Chair: Jilles Vreeken (MPI, Germany)

REGULAR PAPERS:

DM257 "A Scalable and Generic framework to Mine Top-k Representative Subgraph Patterns"
Dheepikaa Natarajan and Sayan Ranu

DM267 "CoreScope: Graph Mining Using k-Core Analysis - Patterns, Anomalies, and Algorithms"
Kijung Shin, Tina Eliassi-Rad, and Christos Faloutsos

DM319 "Mining Graphlet Counts in Online Social Networks"
Xiaowei Chen and John C. S. Lui

DM912 "On Efficient External-Memory Triangle Listing"
Yi Cui, Di Xiao, and Dmitri Loguinov

DM377 "Bi-level Rare Temporal Pattern Detection"
Dawei Zhou, Jingrui He, Yu Cao, and Jae-sun Seo

SHORT PAPERS:

DM1056 "Direct Mining of Subjectively Interesting Relational Patterns"
Tias Guns, Achille Aknin, Jefrey Lijffijt, and Tijl De Bie

DM1106 "Mining Statistically Significant Attribute Associations in Attributed Graphs"
Jihwan Lee, Keehwan Park, and Sunil Prabhakar

Session C1: Urban and mobility data

Session Chair: Charu Aggarwal (IBM Research, USA)

REGULAR PAPERS:

DM525 "Unsupervised Exceptional Attributed Sub-graph Mining in Urban Data"
Anes Bendimerad, Marc Plantevit, and Céline Robardet

DM535 "POI Recommendation: A Temporal Matching between POI Popularity and User Regularity"
Zijun Yao, Yanjie Fu, Bin Liu, Yanchi Liu, and Hui Xiong

DM649 "The Optimal Distribution of Electric-Vehicle Chargers across A City"
Chen Liu, Ke Deng, Chaojie Li, Jianxin Li, Yanhua Li, and Jun Luo

DM1047 "Relief of Spatiotemporal Accessibility Overloading with Optimal Resource Placement"
Chien-Wei Chang, Hao-Yi Chih, Dean Chou, Yu-Chen Shu, and Kun-Ta Chuang

SHORT PAPERS:

DM365 "The Development of a Smart Taxicab Scheduling System: A Multi-Source Data Fusion Perspective"
Yang Wang, Binxin Liang, Zheng Wei, Liusheng Huang, and Hengchang Liu

DM291 "Regularized Content-Aware Tensor Factorization Meets Temporal-Aware Location Recommendation"
Defu Lian, Zhengyu Zhang, Yong Ge, Fuzheng Zhang, Nicholas Jing Yuan, and Xing Xie

DM268 "Modeling Real Estate for School District Identification"
Fei Tan, Chaoran Cheng, and Zhi Wei

DM1093 "House Price Modeling over Heterogeneous Regions with Hierarchical Spatial Functional Analysis"
Bang Liu, Borislalv Mavrin, Di Niu, and Linglong Kong

Tutorial 1

Mining smartphone and mobility data
13:00-14:30

Lunch Break

Session A2: Clustering

Session Chair: Pauli Mittinen (MPI, Germany)

REGULAR PAPERS:

DM737 "Generalized Independent Subspace Clustering"
Wei Ye, Samuel Maurus, Nina Hubig, and Claudia Plant

DM1021 "Robust Graph-theoretic Clustering Approaches Using Node-Based Resilience Measures"
John Matta, Jeffrey Borwey, Tayo Obafemi-Ajayi, Donald Wunsch, and Gunes Ercal

SHORT PAPERS:

DM396 "Multi-Type Co-clustering of General Heterogeneous Information Networks"
Xianchao Zhang, Haixin Li, Wenxin Liang, and Jiebo Luo

DM565 "Interpretable Clustering via Discriminative Rectangle Mixture Model"
Junxiang Chen, Yale Chang, Brian Hobbs, Peter Castaldi, Michael Cho, Edwin Silverman, and Jennifer Dy

DM577 "Self-Grouping Multi-Network Clustering"
Jingchao Ni, Wei Cheng, Wei Fan, and Xiang Zhang

DM669 "A Theoretical Analysis of the Fuzzy K-Means Problem"
Johannes Blömer, Sascha Brauer, and Kathrin Bujna

DM749 "Multi-View Clustering via Concept Factorization with Local Manifold Regularization"
Hao Wang, Yan Yang, and Tianrui Li

DM972 "Robust Convex Clustering Analysis"
Qi Wang, Pinghua Gong, Shiyu Chang, Thomas Huang, and Jiayu Zhou

Session B2: Recommender Systems

Session Chair: Nuria Oliver (Data-Pop Alliance, Spain)

REGULAR PAPERS:

DM554 "Efficient Rectangular Maximal-Volume Algorithm for Rating Elicitation in Collaborative Filtering"
Alexander Fonarev, Alexander Mikhalev, Gleb Gusev, Ivan Oseledets, and Pavel Serdyukov

DM635 "Recommending Packages to Groups"
Shuyao Qi, Nikos Mamoulis, Evaggelia Pitoura, and Panayiotis Tsaparas

DM663 "Fusing Similarity Models with Markov Chains for Sparse Sequential Recommendation"
Ruining He and Julian McAuley

SHORT PAPERS:

DM282 "Context-aware Sequential Recommendation"
Qiang Liu, Shu Wu, and Liang Wang

DM692 "Learning Compatibility Across Categories for Heterogeneous Item Recommendation"
Ruining He, Charles Packer, and Julian McAuley

DM773 "Time-Aware User Identification with Topic Models"
Clément Lesaege, François Schnitzler, Anne Lambert, and Jean-Ronan Vigouroux

DM818 "Whether this participant will attract you to this event? Exploiting Participant Influence for Event Recommendation"
Yi Liao, Xinshi Lin, and Wai Lam

Session C2: Events and Social

Session Chair: Leman Akoglu (CMU, USA)

REGULAR PAPERS:

DM544 "Multi-resolution Spatial Event Forecasting in Social Media"
Liang Zhao, Feng Chen, Chang-Tien Lu, and Naren Ramakrishnan

DM949 "Event Series Prediction via Non-Homogeneous Poisson Process Modelling"
James Goulding, Simon Preston, and Gavin Smith

SHORT PAPERS:

DM729 "Towards Scalable Network Delay Minimization"
Sourav Medya, Petko Bogdanov, and Ambuj K Singh

DM881 "Detecting Change Processes in Dynamic Networks by Frequent Graph Evolution Rule Mining"
Erik Scharwaechter, Marwan Hassani, Emmanuel Mueller, Thomas Seidl, and Jonathan Donges

DM892 "DeBot: Twitter Bot Detection via Warped Correlation"
Nikan Chavoshi, Hossein Hamooni, and Abdullah Mueen

DM1055 "Modeling Knowledge Diffusion with Time Lags in Citation Networks"
Tao-yang Fu, Zhen Lei, and Wang-Chien Lee

DM1062 "Event Grounding from Multimodal Social Network Fusion"
Hyunsouk Cho, Jinyoung Yeo, and Seung-won Hwang

DM1076 "Large-Scale Embedding Learning in Heterogeneous Event Data"
Huan Gui, Jialu Liu, Fangbo Tao, Meng Jiang, Brandon Norick, and Jiawei Han

Tutorial 1

Mining smartphone and mobility data
16:10-16:30

Coffee Break

Session A3: Unsupervised learning

Session Chair: Vagelis Papalexakis (UC Riverside, USA)

REGULAR PAPERS:

DM336 "Iteratively Reweighted Least Squares Algorithms for L1-Norm Principal Component Analysis"
Young Woong Park and Diego Klabjan

DM383 "Heterogeneous Representation Learning with Structured Sparsity Regularization"
Pei Yang and Jingrui He

DM389 "Triply Stochastic Variational Inference for Non-linear Beta Process Factor Analysis"
Kai Fan, Yizhe Zhang, and Katherine Heller

DM821 "Causal Inference by Compression"
Kailash Budhathoki and Jilles Vreeken

SHORT PAPERS:

DM332 "Factorizing Complex Discrete Data "with Finesse""
Samuel Maurus and Claudia Plant

DM337 "A Fast Factorization-based Approach to Robust Principal Component Analysis"
Chong Peng, Zhao Kang, and Qiang Cheng

Session B3: Crowds

Session Chair: Celine Robardet (INSA Lyon, France)

REGULAR PAPERS:

DM460 "A Bayesian Nonparametric Approach to Dynamic Dyadic Data Prediction"
Fengyuan Zhu, Guangyong Chen, and Pheng-Ann Heng

DM615 "Modeling Ambiguity, Subjectivity, and Diverging Viewpoints in Opinion Question Answering Systems"
Mengting Wan and Julian McAuley

DM890 "Group Preference Aggregation: A Nash Equilibrium Approach"
Hongke Zhao, Qi Liu, and Yong Ge

DM1064 "Binding Pairwise Preferences of the Crowd into Rankings: A Thurstonian Approach"
Xiaolong Wang, Jingjing Wang, Jie Luo, Yi Chang, and Chengxiang Zhai

SHORT PAPERS:

DM568 "Improved and Scalable Bradley-Terry Model for Collaborative Ranking"
Jun Hu and Ping li

DM875 "A Robust Framework for Classifying Evolving Document Streams in an Expert-Machine-Crowd Setting"
Muhammad Imran, Sanjay Chawla, and Carlos Castillo

Session C3: Text

Session Chair: Erik Cambria (NTU, Singapore)

REGULAR PAPERS:

DM1030 "Learning Hierarchically Decomposable Concepts with Active Over-Labeling"
Yuji Mo, Stephen Scott, and Doug Downey

DM1085 "L-EnsNMF: Boosted Local Topic Discovery via Ensemble of Nonnegative Matrix Factorization"
Sangho Suh, Jaegul Choo, Joonseok Lee, and Chandan Reddy

SHORT PAPERS:

DM292 "Mutual Reinforcement of Academic Performance Prediction and Library Book Recommendation"
Defu Lian, Qi Liu, Wenya Zhu, Xing Xie, and Hui Xiong

DM529 "Topic Discovery for Short Texts Using Word Embeddings"
Guangxu Xun, Vishrawas Gopalakrishnan, Fenglong Ma, Yaliang Li, Jing Gao, and Aidong Zhang

DM745 "Concept based Short Text Stream Classification with Topic Drifting Detection"
Peipei Li, Lu He, Xuegang Hu, Yuhong Zhang, Lei Li, and Xindong Wu

DM817 "A Scalable Framework for Stylometric Analysis Query Processing"
Sarana Nutanong, chenyun YU, Raheem Sarwar, Peter Xu, and Dickson Chow

DM897 "Incorporating Pre-Training in Long Short-Term Memory Networks for Tweets Classification"
Shuhan Yuan, Xintao Wu, and Yang Xiang

DM985 "Canonical Consistent Weighted Sampling for Real-Value Weighted Min-Hash"
Wei Wu, Bin Li, Ling Chen, and Chengqi Zhang

Tutorial 2

Network Representation Learning: A Revisit in Big Data Era

Panel

How We Can/Should Handle the Offline vs. Online Data Gap?

Panelists:
     Rakesh Agrawal (Data Insights Laboratories, USA),
     Ricardo Baeza-Yates (NTENT, USA & UPF, Spain),
     Tina Eliassi-Rad (Northeastern Univ., USA),
     Katherina Morik (TU Dortmund, Germany), and
     Nuria Oliver (DataPop Alliance, Spain)

Moderator: Arno Siebes (Utrecht University, The Netherlands)  

Keynote 2

Big Data – Security and Privacy 


Elisa Bertino, Purdue University



Session Chair: Josep Domingo-Ferrer (URV, Spain)
10:30-11:00

Coffee Break

Session A4: Theory

Session Chair: Aris Gionis (Aalto University, Finland)

REGULAR PAPERS:

DM969 "From Sets of Good Redescriptions to Good Sets of Redescriptions"
Janis Kalofolias, Esther Galbrun, and Pauli Miettinen

DM799 "What Will You Gain By Rounding: Theory and Algorithms for Rounding Rank"
Stefan Neumann, Rainer Gemulla, and Pauli Miettinen

DM960 "ADAGIO: Fast Data-aware Near-Isometric Linear Embeddings"
Jaroslaw Blasiok and Charalampos Tsourakakis

DM1003 "Aligned Matrix Completion: Integrating Consistency and Independency in Multiple Domains"
Linli Xu and Zaiyi Chen

DM1037 "Communities in Preference Networks: Refined Axioms and Beyond"
Gang Zeng, Yuyi Wang, Juhua Pu, Xingwu Liu, Xiaoming Sun, and Jialin Zhang

SHORT PAPERS:

DM603 "Foundations of Perturbation Robust Clustering"
Jarrod Moore and Margareta Ackerman

DM360 "Compressing Random Forests"
Amichai Painsky and Saharon Rosset

Session C4: Web

Session Chair: Carlos Castillo (Eurecat, Spain)

REGULAR PAPERS:

DM386 "Reliable Gender Prediction Based on Users' Video Viewing Behavior"
Jie Zhang, Kuang Du, Ruihua Cheng, Zhi Wei, Chenguang Qin, Huaxin You, and Sha Hu

DM733 "Efficient Extraction of Non-negative Latent Factors from High-dimensional and Sparse Matrices"
Xin Luo, Mingsheng Shang, and Shuai Li

DM862 "Inferring latent network from cascade data for dynamic social recommendation"
Qin Zhang, Jia Wu, Peng Zhang, Guodong Long, Ivor W. Tsang, and Chengqi Zhang

DM975 "Sparse Factorization Machines for Click-through Rate Prediction"
Zhen Pan, Qi Liu, Tong Xu, Enhong Chen, Haiping Ma, and Hongjie Lin

SHORT PAPERS:

DM533 "Meta Analyses for Dynamic Contextual Multi Arm Bandits in Display Advertisement"
Hongxia Yang and Quan Lu

DM575 "Selecting Valuable Customers for Merchants in E-commerce Platforms"
Yijun Wang, Le Wu, Zongda Wu, Enhong Chen, and Qi Liu

DM589 "Service Usage Analysis in Mobile Messaging Apps: A Multi-Label Multi-View Perspective"
Hui Xiong, Junming Liu, Xinjiang Lu, Yanjie Fu, and Xiaolin Li

Session B4: Supervised Learning 1

Session Chair: George Karypis (University of Minnesota, USA)

REGULAR PAPERS:

DM850 "Asynchronous Multi-Task Learning"
Inci Baytas, Ming Yan, Anil Jain, and Jiayu Zhou

DM883 "Interactive Multi-Task Relationship Learning"
Kaixiang Lin and Jiayu Zhou

SHORT PAPERS:

DM433 "Functional Regression with Mode-Sparsity Constraint"
Pei Yang and Jingrui He

DM379 "Learning Supervised Binary Hashing: Optimization vs Diversity"
Ramin Raziperchikolaei and Miguel Carreira-Perpinan

DM732 "Bayesian Rule Sets for Interpretable Classification"
Tong Wang, Cynthia Rudin, Finale Doshi-Velez, Yimin Liu, Erica Klampfl, and Perry MacNeille

DM789 "Probabilistic Formulations of Regression with Mixed Guidance"
Aubrey Gress and Ian Davidson

DM1005 "Background Check: A general technique to build more reliable and versatile classifiers"
Miquel Nieto, Telmo Silva Filho, Meelis Kull, and Peter Flach

DM1026 "Optimizing the Multi-class F-measure via Biconcave Programming"
Weiwei Pan, Harikrishna Narasimhan, Purushottam Kar, Pavlos Protopapas, and Harish Ramaswamy

DM1084 "A Rotation Invariant Latent Factor Model for Moveme Discovery from Static Poses"
Matteo Ruggero Ronchi, Joon Sik Kim, and Yisong Yue

Tutorial 3

The Evolution of Natural Language Understanding and Prediction Technologies: from Formal Grammars to Large Scale Machine Learning
13:00-14:30

Lunch Break

Session A5: Graphs 1

Session Chair: Nikolaj Tatti (Aalto University, Finland)

REGULAR PAPERS:

DM424 "Waddling Random Walk: Fast and Accurate Mining of Motif Statistics in Large Graphs"
Guyue Han and Harish Sethu

DM514 "Random Walk with Restart over Dynamic Graphs"
Weiren Yu and Weiren Yu

DM687 "Edge Weight Prediction in Weighted Signed Networks"
Srijan Kumar, Francesca Spezzano, V.S. Subrahmanian, and Christos Faloutsos

SHORT PAPERS:

DM481 "Personalized Ranking in Signed Networks using Signed Random Walk with Restart"
Jinhong Jung, Woojeong Jin, Sael Lee, and U Kang

DM864 "Asymptotic Analysis of Equivalences and Core-Structures in Kronecker-Style Graph Models"
Alex Chin, Timothy Goodrich, Michael O'Brien, Felix Reidl, Blair Sullivan, and Drew van der Poel

DM328 "Fractality of Massive Graphs: Scalable Analysis with Sketch-Based Box-Covering Algorithm"
Takuya Akiba, Kenko Nakamura, and Taro Takaguchi

DM442 "Cut Tree Construction from Massive Graphs"
Takuya Akiba, Yoichi Iwata, Yosuke Sameshima, Naoto Mizuno, and Yosuke Yano

Session B5: Feature Selection

Session Chair: Miguel A. Carreira-Perpinan (UC Merced, USA)

REGULAR PAPERS:

DM327 "Robust Multi-View Feature Selection"
Hongfu Liu, Haiyi Mao, and Yun Fu

DM787 "A Fast Iterative Algorithm for Improved Unsupervised Feature Selection"
Bruno Ordozgoiti, Sandra Gómez Canaval, and Alberto Mozo

DM1065 "Unsupervised Feature Selection for Outlier Detection by Modelling Hierarchical Value-Feature Couplings"
Guansong Pang, Longbing Cao, Ling Chen, and Huan Liu

SHORT PAPERS:

DM380 "Feature Grouping using Weighted l1 norm for High-Dimensional Data"
Bhanukiran Vinzamuri, Karthik Padthe, and Chandan Reddy

DM877 "Toward Time-Evolving Feature Selection on Dynamic Networks"
Jundong Li, Xia Hu, Ling Jian, and Huan Liu

DM997 "Online Unsupervised Multi-view Feature Selection"
Weixiang Shao, Lifang He, Chun-Ta Lu, Xiaokai Wei, and Philip Yu

DM1097 "ExploreKit: Automatic Feature Generation and Selection"
Gilad Katz, Richard Shin, and Dawn Song

Session C5: Privacy

Session Chair: Tamir Tassa (Open University, Israel)

REGULAR PAPERS:

DM364 "Beyond Points and Paths: Counting Private Bodies"
Maryam Fanaeepour and Benjamin Rubinstein

DM436 "Differentially Private Regression Diagnostics"
Yan Chen, Ashwin Machanavajjhala, Jerome Reiter, and Andres Barrientos

DM767 "Auditing Black-box Models for Indirect Influence"
Philip Adler, Casey Falk, Sorelle Friedler, Gabriel Rybeck, Carlos Schedegger, Brandon Smith, and Suresh Venkatasubramanian

DM769 "College Student Scholarships and Subsidies Granting: A Multi-Modal Multi-Label Approach"
Han-Jia Ye, De-Chuan Zhan, Xiaolin Li, Zhen-Chuan Huang, and Yuan Jiang

SHORT PAPERS:

DM245 "Differential Location Privacy for Sparse Mobile Crowdsensing"
Leye Wang, Daqing Zhang, Dingqi Yang, Brian Y. Lim, and Xiaojuan Ma

DM434 "Scalable Block Scheduling for Efficient Multi-Database Record Linkage"
Thilina Ranbaduge, Dinusha Vatsalan, and Peter Christen

Tutorial 3

The Evolution of Natural Language Understanding and Prediction Technologies: from Formal Grammars to Large Scale Machine Learning

Social Event & Gala Dinner

Wine tasting, tour and dinner at Caves Codorniu.

Keynote 3

Big Data or Big Garbage? A Tale of a Research Journey for Real-Time Business Intelligence 


Rakesh Agrawal, Data Insights Laboratories



Session Chair: Francesco Bonchi (ISI Foundation, Italy / Eurecat, Spain)
10:30-11:00

Coffee Break

Session C6: Anomalies and Outliers

Session Chair: Takashi Washio (Osaka University, Japan)

REGULAR PAPERS:

DM273 "Probabilistic-Mismatch Anomaly Detection: Do one's Medications Match with the Diagnoses?"
Lingxiao Zhang, Xiang Li, Heifeng Liu, Jing Mei, Gang Hu, Junfeng Zhao, Bing Xie, and Guotong Xie

DM274 "Regularized Large Margin Distance Metric Learning"
Ya Li, Xinmei Tian, and Dacheng Tao

DM806 "Subspace Outlier Detection in Linear Time with Randomized Hashing"
Saket Sathe and Charu Aggarwal

SHORT PAPERS:

DM537 "Sequential Ensemble Learning for Outlier Detection: A Bias-Variance Perspective"
Shebuti Rayana, Wen Zhong, and Leman Akoglu

DM925 "Outlier Detection from Network Data with Subnetwork Interpretation"
Xuan-Hong Dang, Arllei Silva, Ambuj K Singh, Prithwish Basu, and Ananthram Swami

DM1044 "Sparse Gaussian Markov Random Field Mixtures for Anomaly Detection"
Tsuyoshi Ide, Ankush Khandelwal, and Jayant Kalagnanam

DM1058 "Low-Rank Sparse Feature Selection for Patient Similarity Learning"
Mengting Zhan, Shilei Cao, Shiyu Chang, Jishang Wei, and Buyue Qian

DM254 "Structure-Preserved Multi-Source Domain Adaptation"
Hongfu Liu, Ming Shao, and Yun Fu

DM938 "Gaussian Component based Index for GMMs"
Linfei Zhou, Bianca Wackersreuther, Frank Fiedler, Claudia Plant, and Christian Boehm

Session A6: Sequences and Time Series

Session Chair: Adytia Prakash (Virginia Tech, USA)

REGULAR PAPERS:

DM567 "Time Series Motifs: Exploiting a Novel Algorithm and GPUs to break the one Hundred Million Barrier"
Yan Zhu, Zachary Zimmerman, Nader Shakibay Senobari, Chin-Chia Michael Yeh, Gareth Funning, Abdullah Mueen, Philip Brisk, and Eamonn Keogh

DM248 "Visualization of Salient Subsequences in Time Series"
Chin-Chia Michael Yeh, Helga Van Herle, and Eamonn Keogh

DM920 "Fast Warping Distance for Sparse Time Series"
Abdullah Mueen, Nikan Chavoshi, Noor Abu-El-Rub, Hossein Hamooni, and Amanda Minnich

SHORT PAPERS :

DM295 "Frequent Sequence Mining with Subsequence Constraints"
Kaustubh Beedkar and Rainer Gemulla

DM559 "Second-order Online Active Learning and Its Applications"
Shuji Hao, Peilin Zhao, Jing Lu, Steven C.H. Hoi, and Chunyan Miao

DM1007 "HIVE-COTE: The Hierarchical Vote Collective of Transformation-based Ensembles for Time Series Classification"
Jason Lines, Sarah Taylor, and Anthony Bagnall

DM221 "All Pairs Similarity Joins for Time Series Subsequences"
Chin-Chia Michael Yeh, Yan Zhu, Liudmila Ulanova, Nurjahan Begum, Yifei Ding, Hoang Anh Dau, Diego Silva, Abdullah Mueen, and Eamonn Keogh

DM261 "Prefix and Suffix Invariant Dynamic Time Warping"
Diego Silva, Gustavo E.A.P.A. Batista, and Eamonn Keogh

DM378 "Dynamic Poisson Factor Analysis"
Yizhe Zhang, Ricardo Henao, and Lawrence Carin

Session B6: Biomedical

Session Chair: Shinici Morishita (University of Tokyo, Japan)

REGULAR PAPERS:

DM807 "Predicting COPD Failure by Modeling Hazard in Longitudinal Clinical Data"
Jianfei Zhang, Shengrui Wang, Josiane Courteau, Lifei Chen, and Alain Vanasse

DM1060 "Measuring Patient Similarities via A Deep Architecture with Medical Concept Embedding"
Zihao Zhu, Changchang Yin, Yu Cheng, Jishang Wei, and Buyue Qian

DM1104 "New Probabilistic Multi-Graph Decomposition Model to Identify Consistent Human Brain Network Modules"
Heng Huang

DM373 "Transfer Learning for Survival Analysis via Efficient L2,1-norm Regularized Cox Regression"
Yan Li, Lu Wang, Jie Wang, Jieping Ye, and Chandan Reddy

DM932 "New Robust Clustering Model for Identifying Cancer Genome Landscapes"
hongchang gao, Xiaoqian Wang, and Heng Huang

SHORT PAPERS:

DM489 "Efficient Algorithms for the Three Locus Problem in Genome-wide Association Study"
Sanguthevar Rajasekaran and Subrata Saha

DM626 "Patterns in Cognitive Rehabilitation of Traumatic Brain Injury Patients: A Text Mining Approach"
Alejandro Garcia-Rudolph, Alberto Garcia-Molina, Eloy Opisso, and Josep Maria Tormos

Tutorial 4

Core Decomposition of Networks: concepts, algorithms and applications
13:00-14:30

Lunch Break

Session A7: Graphs 2

Session Chair: Esther Galbrun (INRIA, France)

REGULAR PAPERS:

DM422 "On Dense Subgraphs in Signed Network Streams"
Jose Cadena, Anil Vullikanti, and Charu Aggarwal

DM1059 "Graph-Structured Sparse Optimization for Connected Subgraph Detection"
Baojian Zhou and Feng Chen

DM762 "Hyperbolae Are No Hyperbole: Modelling Communities That Are Not Cliques"
Saskia Metzler, Stephan Günnemann, and Pauli Miettinen

SHORT PAPERS:

DM223 "Efficient and scalable detection of overlapping communities in big networks"
Tianshu Lyu, Lidong Bing, Zhao Zhang, and Yan Zhang

DM593 "Mining Summaries for Knowledge Graph Search"
Qi Song, Yinghui Wu and Xin Luna Dong

DM490 "MeGS: Partitioning Meaningful Subgraph Structures using Minimum Description Length"
Sebastian Goebl, Annika Tonch, Christian Böhm, and Claudia Plant

DM1028 "Learning from your network of friends: a trajectory representation learning model based on online social ties"
Basma Alharbi and Xiangliang Zhang

Session B7: Distributed and High Performance Computing

Session Chair: Alexandre Termier (University of Rennes, France)

REGULAR PAPERS:

DM453 "Partition Aware Connected Component Computation in Distributed Systems"
Ha-Myung Park, Namyong Park, Sung-Hyon Myaeng, and U Kang

DM886 "Efficient Distributed SGD with Variance Reduction"
Soham De and Thomas Goldstein

DM931 "HogWild++: A New Mechanism for Decentralized Asynchronous Stochastic Gradient Descent"
Huan Zhang, Cho-Jui Hsieh, and Venkatesh Akella

SHORT PAPERS:

DM1066 "Spectrum-Revealing Cholesky Factorization for Kernel Methods"
Jianwei Xiao and Ming Gu

DM421 "Efficient and Distributed Algorithms for Large-Scale Generalized Canonical Correlations Analysis"
Xiao Fu, Kejun Huang, Evangelos Papalexakis, Hyun Ah Song, Partha Talukdar, Nicholas Sidiropoulos, Christos Faloutsos, and Tom Mitchell

DM703 "One-pass Logistic Regression for Label-drift and Large-scale Classification on Distributed Systems"
Vu Nguyen, Tu Dinh Nguyen, Trung Le, Dinh Phung, and Svetha Venkatesh

DM708 "A Fast Distributed Classification Algorithm for Large-scale Imbalanced Data"
Huihui Wang, Yang Gao, Yinghuan Shi, and Hao Wang

Session C7: Supervised Learning 2

Session Chair: Ricard Gavalda (UPC, Spain)

REGULAR PAPERS:

DM966 "Binary Classifier Calibration using an Ensemble of Near Isotonic Regression Models"
Mahdi Pakdaman Naeini and Gregory Cooper

DM1024 "Fixing the Convergence Problems in Parallel Asynchronous Dual Coordinate Descent"
Huan Zhang and Cho-Jui Hsieh

SHORT PAPERS:

DM347 "Learning Task Relational Structure for Multi-Task Feature Learning"
De Wang, Feiping Nie, and Heng Huang

DM562 "Scalable Discrete Supervised Hash Learning with Asymmetric Matrix Factorization"
Shifeng Zhang, Jianmin Li, Jinma Guo, and Bo Zhang

DM691 "Budgeted Batch Bayesian Optimization"
Vu Nguyen, Santu Rana, Sunil Gupta, Cheng Li, and Svetha Venkatesh

DM694 "Faster Kernels for Graphs with Continuous Attributes via Hashing"
Christopher Morris, Nils M. Kriege, Kristian Kersting, and Petra Mutzel

DM922 "Efficient Sampling-Based Kernel Mean Matching"
Swarup Chandra, Ahsanul Haque, Latifur Khan, and Charu Aggarwal

DM1117 "Sublinear Dual Coordinate Ascent for Regularized Loss Minimization"
liu liu and Dacheng Tao

Tutorial 4

Core Decomposition of Networks: concepts, algorithms and applications
16:10-16:30

Coffee Break

Session A8: Social

Session Chair: Neil Shah (CMU, USA)

REGULAR PAPERS:

DM505 "Vote-and-Comment: Modeling the Coevolution of User Interactions in Social Voting Web Sites"
Alceu Ferraz Costa, Agma Juci Machado Traina, Caetano Traina Jr., and Christos Faloutsos

DM661 "Homophily, Structure, and Content Augmented Network Representation Learning"
Daokun Zhang, Jie Yin, Xingquan Zhu, and Chengqi Zhang

DM805 "TO BE OR NOT TO BE FRIENDS: Exploiting Social Ties for Venture Investments"
Hao Zhong, Chuanren Liu, Xinjiang Lu, and Hui Xiong

SHORT PAPERS:

DM287 "Steering Social Media Promotions with Effective Strategies"
Kun Kuang, Meng Jiang, Peng Cui, and Shiqiang Yang

DM464 "HLGPS: A Home Location Global Positioning System in Location-Based Social Networks"
Yulong Gu, Jiaxing Song, Weidong Liu, and Lixin Zou

DM666 "A combinatorial approach to role discovery"
Albert Arockiasamy, Aristides Gionis, and Nikolaj Tatti

DM995 "HNP3: A Hierarchical Nonparametric Point Process for modeling Content Diffusion Over Social Media"
SeyedAbbas Hosseini, Ali Khodadadi, Ali Arabzadeh, and Hamid R. Rabiee

Session B8: Streaming

Session Chair: Albert Bifet (Télécom ParisTech, France)

REGULAR PAPERS:

DM418 "KNN classifier with self adjusting memory for heterogeneous concept drift"
Viktor Losing, Barbara Hammer, and Heiko Wersing

DM706 "Streaming Model Selection via Online Factorized Asymptotic Bayesian Inference"
Liu Chunchen, Feng Lu, and Ryohei Fujimaki

DM727 "ConTrack: A Scalable Method For Tracking Multiple Concepts In Large Scale Multidimensional Data"
Ali Zonoozi, Qirong Ho, Shonali Krishnaswamy, and Gao Cong

SHORT PAPERS:

DM602 "ROM: A Robust Online Multi-Task Learning Approach"
CHI ZHANG, PEILIN ZHAO, Shuji Hao, YENG CHAI SOH, and BU SUNG LEE

DM676 "Multi-Label Learning with Emerging New Labels"
Yue Zhu, Kai-Ming Ting, and Zhi-Hua Zhou

DM243 "EXTRACT: Strong Examples from Weakly-Labeled Sensor Data"
Davis Blalock and John Guttag

DM456 "Spell: Streaming Parsing of System Event Logs"
Min Du and Feifei Li

Session C8: Semi-supervised and Active Learning

Session Chair: Min-Ling Zhang (Southeast University, China) REGULAR PAPERS:
DM500 "Adaptive Neighborhood Propagation by Joint L2,1-norm Regularized Sparse Coding for Representation and Classification"
Lei Jia, Zhao Zhang, Lei Wang, Weiming Jiang, and Mingbo Zhao

DM856 "An Augmented LSTM Framework to Construct Medical Self-diagnosis Android"
Chaochun Liu, Huan Sun, Nan Du, Shulong Tan, Hongliang Fei, Wei Fan, Tao Yang, Hao Wu, Yaliang Li, and Chenwei Zhang

SHORT PAPERS:

DM446 "Semi-Supervised Multi-Label Dimensionality Reduction"
Baolin Guo, Chenping Hou, Feiping Nie, and Dongyun Yi

DM471 "Reliable Semi-supervised Learning"
Junming Shao, Chen Huang, Qinli Yang, and Guangchun Luo

DM601 "A Semi-supervised AUC Optimization Method with Generative Models"
Akinori Fujino and Naonori Ueda

DM725 "A Novel Uncertainty Sampling Algorithm for Cost-sensitive Multiclass Active Learning"
Kuan-Hao Huang and Hsuan-Tien Lin

DM748 "Can Active Learning Experience Be Transferred?"
Hong-Min Chu and Hsuan-Tien Lin

DM893 "Incorporating Expert Feedback into Active Anomaly Discovery"
Shubhomoy Das, Weng-Keen Wong, Thomas Dietterich, Alan Fern, and Andrew Emmott

18:10

Closing

ICDM 2016