Logo: Max Planck Institute for Biological Cybernetics

 

Zien, A.: Semi-Supervised Support Vector Machines and Application to Spam Filtering. ECML Discovery Challenge Workshop (09/22/ 2006)
 
 
Link: http://www.ecmlpkdd2006.org/challenge.html
 
   
 
 
Abstract:

After introducing the semi-supervised support vector machine (aka TSVM for "transductive SVM"), a few popular training strategies are briefly presented. Then the assumptions underlying semi-supervised learning are reviewed. Finally, two modern TSVM optimization techniques are applied to the spam filtering data sets of the workshop; it is shown that they can achieve excellent results, if the problem of the data being non-iid can be handled properly.