6th Workshop on Data Mining for Medicine and Healthcare

April 29, 2017, Houston, TX

To be held in conjunction with 17th SIAM International Conference on Data Mining (SDM 2017)


In virtually every country, the cost of healthcare is increasing more rapidly than the willingness and the ability to pay for it. At the same time, more and more data is being captured around healthcare processes in the form of Electronic Health Records (EHR), health insurance claims, medical imaging databases, disease registries, spontaneous reporting sites, and clinical trials. As a result, data mining has become critical to the healthcare world. 

On the one hand, EHR offers the data that gets data miners excited, however on the other hand, is accompanied with challenges such as 1) the unavailability of large sources of data to academic researchers, and 2) limited access to data-mining experts. Healthcare entities are reluctant to release their internal data to academic researchers and in most cases there is limited interaction between industry practitioners and academic researchers working on related problems.

The objectives of this workshop are to:

1. Bring together researchers (from both academia and industry) as well as practitioners to present their latest problems and ideas.

2. Attract healthcare providers who have access to interesting sources of data and problems but lack the expertise in data mining to use the data effectively.

3. Enhance interactions between data mining, text mining and visual analytics communities working on problems from medicine and healthcare.

SDM is a unique venue for this workshop as leading researchers and practitioners from academia and industry will be able to participate. A workshop where healthcare professionals can have an audience, present and discuss their problems, views and ideas on the field as well as pose research challenges will attract them to SDM. The organizers of this proposed workshop have continuous and in-depth contact with people working on healthcare applications of data mining and healthcare professionals in the US and Europe which will attract a broad and varied set of participants. We believe that this workshop will serve as a bridge between the traditional SDM community and healthcare professionals - two groups of participants that have a lot to learn from and share with each other.

Topics of Interest

Topic areas for the workshop include (but are not limited to) the following: 

• Statistical analysis and characterization of healthcare data
• Text mining - mining free text in electronic medical records
• Visual analysis and exploration of longitudinal clinical trial data
• Meaningful use of healthcare data for improved patient care and cost-reduction
• Data quality assessment and improvement: preprocessing, cleaning, missing data treatment etc.
• Pattern detection and hypothesis generation from observational data
• Visualization of prescriptions drugs and interactions
• Privacy and security issues in healthcare
• Information fusion and knowledge transfer in healthcare
• Evolutionary and longitudinal patient and disease models
• Medical fraud detection
• Help with ICD 9 to ICD 10 conversions
• Health Information exchanges

Program Committee (Tentative)

Riccardo Bellazzi, University of Pavia
Joyce Ho, Emory University
Andreas Holzinger, Medical University Graz
Ying Li, IBM T.J. Watson Research Center
Mykola Pechenizky, Technical University Eindhoven
Chandan Reddy, Virginia Tech
Niels Peek, University of Manchester
Igor Pernek, Research Studios Austria
Yiye Zhang, Cornell University
Jiayu Zhou, Samsung Research America

Important Dates

Paper Submission: Jan. 15, 2017 (extended)

Notification of Acceptance: Feb. 1, 2017

Camera Ready Paper Due: Mar. 1, 2017

Workshop: Apr. 29, 2017

Submission Information

Selected papers will be recommended to a special issue on Journal of Health Informatics Research.

All submissions must be made electronically at https://easychair.org/conferences/?conf=sdmdmmh2017.

Papers submitted to this workshop must not have been accepted or be under review by another conference with a published proceedings or by a journal. The work may be either theoretical or applied.

The workshop accepts short (4-6 pages) and long papers (up to 9 pages) with US Letter (8.5" x 11") paper size (single-spaced, 2 column, 10 point font, and at least 1" margin on each side). Papers must have an abstract with a maximum of 300 words and a keyword list with no more than 6 keywords.

We would like to encourage you to prepare your paper in LaTeX2e. Papers should be formatted using the SIAM SODA macro, which is available through the SIAM website. You can access it at http://www.siam.org/proceedings/macros.php. The filename is soda2e.all. Make sure you use the macros for SODA and Data Mining Proceedings; papers prepared using other proceedings macros will not be accepted.

For Microsoft Word users, please convert your document to the PDF format.  Since there is no Microsoft Word Template, please visit http://www.siam.org/proceedings/ to view the format of previous papers.

All submissions should clearly present the author information including the names of the authors, the affiliations and the emails.

Workshop Schedule

08:00 AM – 12:00 PMApril 29th, 2017
Room: Tanglewood

Session Chair: Xia “Ben” Hu
Texas A&M University, USA

08:00 – 08:15 Opening Ceremony

08:15 – 09:15 Keynote Talk: Mixed Graphical Models with Applications to Integrative Cancer Genomics
Prof. Genevera Allen, Rice University

09:20 – 09:40 Oral Presentation: A Data Adaptive Categorical Time Series Representation for Supervised Learning
Hande Cakin, Mustafa Baydoğan, Kerem Tuncel, Boğaziçi University; Na Zou, Texas A&M University; and Jing Li, Arizona State University

09:40 – 10:00 Oral Presentation: Subtyping Patients with Parkinson's Disease Using Long-Short Term Memory Model
Xi Zhang, Fei Wang, Cornell University

10:00 – 10:15 Coffee Break

10:15 – 11:15 Keynote Talk: Lost in Data – Finding the Way Out
Prof. Xiaoning Qian, Texas A&M University

11:20 – 11:40 Oral Presentation: Switching-State Dynamical Modeling of Daily Behavioral Data
Randy Ardywibowo, Texas A&M University; Shuai Huang, University of Washington; Shupeng Gui, University of Rochester; Cao Xiao, Yu Cheng, IBM T.J. Watson Research Center; Ji Liu, University of Rochester; and Xiaoning Qian, Texas A&M University.

11:40 – 12:00pm Oral Presentation: On Comprehensive Mass Spectrometry Data Analysis for Proteome Profiling of Human Blood Samples
Sameer Manchanda, Mikaela Meyer, Purdue University; Qianqian Li, Chinese Academy of Sciences; Nan Kong, Purdue University; Kai Liang and Yan Li, Chinese Academy of Sciences.

12:00 -- 12:20 Oral Presentation: Identifying Monitoring Sets for Epidemic Surveillance in Urban Areas
Jose Cadana. Virginia Tech.

Invited Speakers


Workshop Chairs

 Honorary Chair

Zoran Obradovic
Temple University

General Chairs

Nitesh Chawla
University of Notre Dame
Gregor Stiglic
University of Maribor

Program Chairs

Fei Wang
Cornell University
Xia Hu
Texas A&M University

Note: for inquiries please send e-mail to feiwang03@gmail.com

Previous DMMH Workshops

First Workshop on Data Mining for Medicine and Healthcare was organized at KDD 2011 conference in San Diego, CA. The workshop was implemented as a full-day workshop with 2 invited speakers, 6 full papers and 4 short papers.

Keynote lectures and the panel are available at http://videolectures.net/datamining2011_san_diego/.

Information on the 2nd, 3rd and 4th Workshop on Data Mining for Medicine and Healthcare can be found at DMMH-SDM 2013DMMH-SDM 2014DMMH-SDM 2015 and DMMH-SDM 2016 websites.