In this paper, we have showed some evidence that program analysis can enhance source code categorization performance. We have proposed various feature sets and they can generally be grouped into two, namely context-free and context-sensitive features.
In particular, we have examined JavaScript categorization. This is a new, functionality-based categorization task which is not done by others before, and we also developed tools for extracting features of JavaScript.