Contact The MIT Press Information on how to order from The MIT Press Access your saved shopping cart, e-mail list subscriptions, order history, address book, and other info in the Your Profile area MIT Press Home Page


September 2006
8 x 10, 528 pp., 98 illus.
$52.00/£38.95 (CLOTH)
Short

ISBN-10:
0-262-03358-5
ISBN-13:
978-0-262-03358-9

Other Editions
Paper (2010)
Series
Adaptive Computation and Machine Learning
Related Links
Open this site in a new browser window.
Find this book in a library
< BACK
Semi-Supervised Learning
Edited by Olivier Chapelle, Bernhard Schölkopf and Alexander Zien

Series Forewordxi
Prefacexiii
1.Introduction to Semi-Supervised Learning
Download Chapter as PDF Sample Chapter - Download PDF (211 KB)
1
1.1Supervised, Unsupervised, and Semi-Supervised Learning1
1.2When Can Semi-Supervised Learning Work?4
1.3Classes of Algorithms and Organization of This Book8
I.Generative Models13
2.A Taxonomy for Semi-Supervised Learning Methods
Matthias W. Seeger
15
2.1The Semi-Supervised Learning Problem15
2.2Paradigms for Semi-Supervised Learning17
2.3Examples22
2.4Conclusions31
3.Semi-Supervised Text Classification Using EM
N. C. Nigam, Andrew McCallum and Tom Mitchell
33
3.1Introduction33
3.2A Generative Model for Text35
3.3Experminental Results with Basic EM41
3.4Using a More Expressive Generative Model43
3.5Overcoming the Challenges of Local Maxima49
3.6Conclusions and Summary54
4.Risks of Semi-Supervised Learning
Fabio Cozman and Ira Cohen
57
4.1Do Unlabled Data Improve or Degrade Classification Performance?57
4.2Understanding Unlabeled Data: Asymptotic Bias59
4.3The Asymptotic Analysis of Generative Smei-Supervised Learning63
4.4The Value of Labeled and Unlabeled Data67
4.5Finite Sample Effects69
4.6Model Search and Robustness70
4.7Conclusion71
5.Probabilistic Semi-Supervised Cluster with Constraints
Sugato Basu, Mikhail Bilenko, Arindam Banerjee and Raymond J. Mooney
73
5.1Introduction74
5.2HMRF Model for Semi-Supervised Clustering75
5.3HMRF-KMeans Algorithm81
5.4Active Learning for Constraint Acquistion93
5.5Experimental Results96
5.6Related Work100
5.7Conclusions101
II.Low-Density Separation103
6.Transductive Support Vector Machines
Thorsten Joachims
105
6.1Introduction105
6.2Transductive Support Vector Machines108
6.3Why Use Margin on the Test Set?111
6.4Experiments and Applications of the TSVMs112
6.5Solving the TSVM Optimization Problem114
6.6Connection to Related Approaches116
6.7Summary and Conclusions116
7.Semi-Supervised Learning Using Semi-Definite Programming
Tijl De Bie and Nello Cristianini
119
7.1Relaxing SVM transduction119
7.2An Approximation for Speedup126
7.3General Semi-Supervised Learning Settings128
7.4Empirical Results129
7.5Summary and Outlook133
Appendix:
The Extended Schur Complement Lemma
134
8.Gaussian Processes and the Null-Category Noise Model
Neil D. Lawrence and Michael I. Jordan
137
8.1Introduction137
8.2The Noise Model141
8.3Process Model and the Effect of the Null-Category143
8.4Posterior Inference and Prediction145
8.5Results147
8.6Discussion149
9.Entropy Regularization
Yves Grandvalet and Yoshua Bengio
151
9.1Introduction151
9.2Derivation of the Criterion152
9.3Optimization Algorithms155
9.4Related Methods158
9.5Experiments160
9.6Conclusion166
Appendix
Proof of Theorem 9.1
166
10.Data-Dependent Regularization
Adrian Corduneanu and Tommi S. Jaakkola
169
10.1Introduction169
10.2Information Regularization on Metric Spaces174
10.3Information Regularization and Relational Data182
10.4Discussion189
III.Graph-Based Models191
11.Label Propogation and Quadratic Criterion
Yoshua Bengio, Olivier Delalleau and Nicolas Le Roux
193
11.1Introduction193
11.2Label Propogation on a Similarity Graph194
11.3Quadratic Cost Criterion198
11.4From Transduction to Induction205
11.5Incorporating Class Prior Knowledge205
11.6Curse of Dimensionality for Semi-Supervised Learning206
11.7Discussion215
12.The Geometric Basis of Semi-Supervised Learning
Vikas Sindhwani, Misha Belkin and Partha Niyogi
217
12.1Introduction217
12.2Incorporating Geometry in Regularization220
12.3Algorithms224
12.4Data-Dependent Kernels for Semi-Supervised Learning229
12.5Linear Methods for Large-Scale Semi-Supervised Learning231
12.6Connections to Other Algorithms and Related Work232
12.7Future Directions234
13.Discrete Regularization
Dengyong Zhou and Bernhard Schölkopf
237
13.1Introduction237
13.2Discrete Analysis239
13.3Discrete Regularization245
13.4Conclusion249
14.Semi-Supervised Learning with Conditional Harmonic Mixing
Christopher J. C. Burges and John C. Platt
251
14.1Introduction251
14.2Conditional Harmonic Mixing255
14.3Learning in CHM Models256
14.4Incorporating Prior Knowledge261
14.5Learning the Conditionals261
14.6Model Averaging262
14.7Experiments263
14.8Conclusions273
IV.Change of Representation275
15.Graph Kernels by Spectral Transforms
Xiaojin Zhu, Jaz Kandola, John Lafferty and Zoubin Ghahramani
277
15.1The Graph Laplacian278
15.2Kernels by Spectral Transforms280
15.3Kernel Alignment281
15.4Optimizing Alignment Using QCQP for Semi-Supervised Learning282
15.5Semi-Supervised Kernels with Order Restraints283
15.6Experimental Results285
15.7Conclusion289
16.Spectral Methods for Dimensionality Reduction
Lawrence K. Saul, Kilian Weinberger, Fei Sha and Jihun Ham
293
16.1Introduction293
16.2Linear Methods295
16.3Graph-Based Methods297
16.4Kernel Methods303
16.5Discussion306
17.Modifying Distances
Alon Orlitsky and Sajama
309
17.1Introduction309
17.2Estimating DBD Metrics312
17.3Computing DBD Metrics321
17.4Semi-Supervised Learning Using Density-Based Metrics327
17.5Conclusions and Future Work329
V.Semi-Supervised Learning in Practice331
18.Large-Scale Algorithms
Olivier Delalleau, Yoshua Bengio and Nicolas Le Roux
333
18.1Introduction333
18.2Cost Approximations334
18.3Subset Selection337
18.4Discussion340
19.Semi-Supervised Protein Classification Using Cluster Kernels
Jason Weston, Christina Leslie, Eugene Ie and William Stafford Noble
343
19.1Introduction343
19.2Representation and Kernels for Protein Sequences345
19.3Semi-Supervised Kernels for Protein Sequences348
19.4Experiments352
19.5Discussion358
20.Prediction of Protein Function from Networks
Hyunjung Shin and Koji Tsuda
361
20.1Introduction361
20.2Graph-Based Semi-Supervised Learning364
20.3Combining Multiple Graphs366
20.4Experiments on Function Prediction of Proteins369
20.5Conclusion and Outlook374
21.Analysis of Benchmarks377
21.1The Benchmark377
21.2Application of SSL Methods383
21.3Results and Discussion390
VI.Perspectives395
22.An Augmented PAC Model for Semi-Supervised Learning
Maria-Florina Balcan and Avrim Blum
397
22.1Introduction398
22.2A Formal Framework400
22.3Sample Complexity Results403
22.4Algorithmic Results412
22.5Related Models and Discussion416
23.Metric-Based Approaches for Semi-Supervised Regression and Classification
Dale Schuurmans, Finnegan Southey, Dana Wilkinson and Yuhong Guo
421
23.1Introduction421
23.2Metric Structure of Supervised Learning423
23.3Model Selection426
23.4Regularization436
23.5Classification445
23.6Conclusion449
24.Transductive Inference and Semi-Supervised Learning
Vladimir Vapnik
453
24.1Problem Settings453
24.2Problem of Generalization in Inductive and Transductive Inference455
24.3Structure of the VC Bounds and Transductive Inference457
24.4The Symmetrization Lemma and Transductive Inference458
24.5Bounds for Transductive Inference459
24.6The Structural Risk Minimization Principle for Induction and Transduction460
24.7Combinatorics in Transductive Inference462
24.8Measures of Size of Equivalence Classes463
24.9Algorithms for Inductive and Transductive SVMs465
24.10Semi-Supervised Learning470
24.11Conclusion:
Transductive Inference and the New Problems of Inference
470
24.12Beyond Transduction: Selective Inference471
25.A Discussion of Semi-Supervised Learning and Transduction473
References479
Notation and Symbols499
Contributors503
Index
Online Index
509
 
Join an E-mail Alert List


 
 
TECHNOLOGY PARTNER: Azility, Inc. TERMS OF USE | PRIVACY POLICY | COPYRIGHT © 2009