Architecture
L
1
training
data
L
1
learner
L
1
model
training
data
L
2
training
data
L
2
learner
L
2
model
unlabeled
data
•
B&M co-training handles only binary classification
•
Handles distribution skewing