
2008 / 869 pages + index / CD / ISBN: 9780898716542 / List Price $191.00 / Member Price $133.70 / Order Code PR130
Symposium held in Atlanta, GA, April 2426, 2008.
Contents
Message from the Conference CoChairs; Preface; SDM 2008 Conference Organization; Program Committee; External Reviewers; SemiSupervised Clustering via Matrix Facotization; Creating a Cluster Hierarchy under Constraints of a Partially Known Hierarchy; Constrained Coclustering of Gene Expression Data; DATA PEELER: ConstraintBased Closed Pattern Mining in nary Relations; SpaRClus: Spatial Relationship PatternBased Hierarchial Clustering; Mining Tree Patterns with Almost Smallest Supertrees; Maximal QuasiBicliques with Balanced Noise Tolerance: Concepts and Coclustering Applications; CISpan: Comprehensive Incremental Mining Algorithms of Closed Sequential Patterns for MultiVersional Software Mining; Mining Association Rules of Simple Conjunctive Queries; Discovering Relational Item Sets Efficently; A Stagewise Lease Square Loss Function for Classification; SemiSupervised Learning Based on Semiparametric Regularization; Roughly Balanced Bagging for Imbalanced Data; An Efficient Local Algorithm for Distributed Multivariate Regression in PeertoPeer Networks; Aerosol Optical Depth Prediction from Satellite Observations by Multiple Instance Regression; Feature Selection with the logRatio Kernel; A RELIEF Based Feature Extraction Algorithm; Deterministic Latent Variable Models and Their Pitfalls; MassiveScale Kernel Discriminant Analysis: Mining for Quasars; Dynamic NonParametric Mixutre Models and Recurrent Chinese Restaurant Process: With Applications to Evolutionary Clustering; Latent Variable Mining with Its Applications to Anomalous Behavior Detection; Similarity Measures for Categorical Data: A Comparative Evaluation; Gaussian Process Learning for CyberAttack Early Warning; Practical Private Computation and ZeroKnowledge Tools for PrivacyPerserving Distributed Data Mining; A Spamicity Approach to Web Spam Detection; Semantic Smoothing for Bayesian Text Classification with Small Training Data; Clustering from Constraint Graphs; Efficiently Mining Closed Subsequences with Gap Constraints; SemiSupervised Classification with Universum; Finding Subgroups Having Several Descriptions: Algorithms for Redescription Mining; The PageTrust Algorithm: How to Rank Web Pages When Negative Links Are Allowed?; A Pattern Mining Approach toward Discovering Generalized Sequences Signatures; The Asymmetric Approximate Antyime Join: A New Primative with Applications to Data Mining; Preemptive Measures against Malicious Party in PrivacyPreserving Data Mining; A Range Query Approach for High Dimensional Euclidean Space Based on EDM Estimation; A Bayesian Technique for Estimating the Credibility of Question Answerers; Semisupervised Multilabel Learning by Solving a Sylvester Equation; Exploiting Structured Reference Data for Unsupervised Text Segmentation with Conditional Random Fields; Graph Mining with Variational Dirichlet Process Mixture Models; Direct Density Ratio Estimation for Largescale Covariate Shift Adaption; ROCtree: A Novel Decision Tree Induction Algorithm Based on Receiver Operating Characteristics to Classify Gene Expression Data; Semisupervised Learning of a Markovian Metric; Mining Abnormal Patterns from Heterogeneous TimeSeries with Irrelevant Features for Fault Event Detection; Outlier Detection with Uncertain Data; Randomization of RealValued Matrices for Assessing the Significance of Data Mining Results; Theoretical Analysis of Subsequences TimeSeries Clustering from a FrequencyAnalysis Viewpoint; Active Learning with Model Selection in Linear Regression; A Feautre Selection Algorithm Capable of Handling Extremely Large Data Dimensionality; Generic Methods for Multicriteria Evaluation; A New Method for Rule Finding via Bootstrapped Confidence Intervals; Mining and Ranking Generators of Sequential Patterns; Type Independent Correction of Sample Selection Bias via Structural Discovery and Rebalancing; Exploration and Reduction of the Feature Space by Hierarchical Clustering; On the Dangers of CrossValidation. An Experimental Evaluation; Mining Complex, Maximal and Complete Subgraphs and Sets of Correlated Variables with Applications to Feature Subset Selection; SpatioTemporal Partitioning for Improving Aerosol Prediction Accuracy; On Indexing High Dementional Data with Uncertainty; Efficient Distribution Mining Classification; Mining Sequence Classifiers for Early Prediction; Exact and Approximate Reverse Nearest Neighbor Search for Multimedia Data; Finding a Haystack in Haystacks—Simultaneous Identification of Concepts in Large BioMedical Corpora; Learning Markov Network Structure Using Few Independence Tests; Statistical Density Prediction in Traffic Networks; Proximity Tracking on TimeEvolving Bipartite Graphs; Integration of Multiple Networks for Robust Label Propagation; Spatial Scan Statistics for Graph Clustering; Randomizing Social Networks: A Spectrum Preserving Approach; Efficient Maximum Margin Clustering via Cutting Plane Algorithm; Robust Clustering in Arbitrarily Orient Subspaces; The Relevantset Correlation Model for Data Clustering; Cluster Ensemble Selection; Weighted Consensus Clustering; A General Framework for Estimating Similarity of Datasets and Decision Trees: Exploring Semantic Similarity; A General Model for Multiple View Unsupervised Learning; Unsupervised Segmentation of Conversational Transcripts; LargeScale ManyClass Learning; Simultaneous Unsupervised Learning od Disparate Clusterings; Author Index.
ISBN: 9780898716542