KDD-2004
The Tenth ACM SIGKDD International Conference on
Knowledge Discovery and Data Mining

KDD-2004 Workshop on
Data Mining Standards, Services and Platforms (DM-SSP 04)

Sunday, August 22, 2004
Seattle, WA

Web Site: www.ncdm.uic.edu/workshops/dm-ssp04.htm



 
Program
Location: Baker Room
 
Time Author Title
9:00-10:00
Jamie MacLennan, Microsoft
Invited Talk. Vectors on Data Mining: How standards and platforms will impact  the near future of Data Mining
10:00-10:30
Break
10:30-12:00
Session 1: Standards
Stanley R. M. Oliveira and Osmar R. Zaiane, University of Alberta
Toward Standardization in Privacy-Preserving Data Mining   full text
Mark F. Hornick, Oracle
Java Data Mining (JSR-73): Status and Overview   full text
Gregor Meyer, IBM, and Robert Grossman, University of Illinois at Chicago and Open Data Partners
A Simple Strategy for Composing Data Mining Operations   full text
12:00-1:30 Lunch
1:30-3:00 Session 2: Services and Platforms
Robert Chu, SAS Web Services Standards for Data Mining   full text
Bill Hosken and Bernard Scherer, SPSS
Distributed Scoring Using PMML   abstract
Robert Grossman, University of Illinois at Chicago and Open Data Partners and David Hanley University of Illinois at Chicago
Experimental Studies Scaling Web Services For Data Mining   full text
3:00-3:30
Break
3:30-4:45
Session 3: Proposals 3.1:
Stefan Raspl, IBM
PMML Version 3.0 - Overview and Status   full text
Data Mining Group PMML Working Group
Proposals for PMML Version 3.1

Proceedings of the Second Annual Workshop on Data Mining Standards, Services and Platforms (pdf).
 
Workshop Description
 

Various data mining standards have matured and are now being deployed in a variety of products. With the maturity of standards, a variety of standards based data mining services and platforms can now be much more easily developed. Related fields such as data grids, web services, and the semantic web have also developed standards based infrastructures and services relevant to KDD. These new standards and standards based services and platforms have the potential for changing the way the data mining is used.

Talks in the workshop will cover current and emerging standards and standards based services for statistical and data mining models, for data preparation, for building models, for scoring, for workflow, and for related topics. In addition, the workshop will include talks on requirements and on standards based data mining services and platforms.

Talks on new and proposed requirements for standards are also welcome.

In addition, talks on closed related topics in fields such as grids, web services, and the semantic web, which have produced infrastructures and services relevant to KDD, are also welcome.

 
Workshop Format
 
The workshop will consist of both invited and contributed talks. Some of the invited talks will provide an update on PMML and emerging web service standards for data mining.
 
All papers should be submitted by email in pdf format to workshop at ncdm dot uic dot edu with the subject line "DM-SSP 04 Workshop submission" by May 26, 2004. Please use the prescribed formatting guidelines of KDD (Springer LNCS).
 
The workshop proceedings will be published by the ACM and distributed during the workshop. They will also be available on the workshop's home page. The full version of the accepted papers will be considered for publication in an edited proceedings after a second round of reviews, pending approval.
 
Papers should be no longer than 12 pages (or 5,000 words) inclusive of all references and figures. All papers must be submitted in either PDF (preferred) or postscript. Please ensure that any special fonts used are included in the submitted documents. All papers must be original, and have not been published elsewhere.
 
For questions, please email workshop A T ncdm uic edu with the subject line "DM-SSP 04 Workshop question".
SUBMIT PAPERS TO: workshop A T ncdm uic edu with the subject line "DM-SSP 04 Workshop submission".
 
 
Important Dates
 
Electronic submission of titles and abstracts May 26, 2004
Electronic submission of full papers June 2, 2004
Acceptance notification June 27, 2004
Camera-ready papers due (hard deadline) July 9, 2004
Workshop in Seattle August 22, 2004
 
Workshop Topics
 

Topics appropriate for the workshop include the following topics and closely related topics.

  • Maturing Standards
    • Predictive Model Markup Language (PMML)
    • XML for Analysis
    • SQL/MM Part 6: Data Mining
    • Java Data Mining (JDM) - Java Specification Request 73 (JSR-73)
    • CRoss Industry Standard Process for Data Mining (CRISP-DM)
    • OMG Common Warehouse Metadata (CWM) for Data Mining
  • Related Standards
    • Semantic Web Standards (RDF, OWL, etc.)
    • Web services (SOAP/XML, WSDL, UDDI, etc)
    • Grid services (WSRF, OGSI, etc.)
  • Emerging Standards
    • standards for KDD workflow
    • standards for data transformations
    • standards for real time data mining
    • standards for data webs
  • Standards Based Data Mining Services
    • Scoring services
    • Analysis services
    • Data exploration services
    • Statistical modeling services
  • Standards Based Platforms
    • Web service based platforms
    • Data grid platforms
    • Data web platforms
    • Knowledge grid platforms
  • Standards Based Open Source Efforts
    • R
    • Weka
    • GNU Octave
 
 
Organizing Committee
 
Robert Grossman (chair) University of Illinois at Chicago
and Open Data Partners
Robert Chu SAS
Mark Hornick Oracle
Dustin Hux Elder Research Inc.
Dave Selinger Amazon.com
Zhaohui Tang Microsoft
Kurt Thearling Capital One