13 Nov 2004
WIDM 04: Lee et al. Co-training Web Block Classification
9
PARCELS
•
PAR
ser for
C
ontent
E
xtraction &
L
ayout
S
tructure
•
•
Goals:
–
Coarse-grained classification
–
Fine-grained information extraction
–
Work on a variety of sources
–
Open-source, reference implementation
–