a) GROUPING
No. audio vocal segments > No. text lines
b) PARITIONING
No. audio vocal segments < No. text lines
END GOAL
- FORCED ALIGNMENT
No. audio vocal segments = No. text lines
System Integration