|
|
|
|
|
|
|
|
|
|
|
|
|
• |
Formal English
is not
|
|
|
SMS text
|
|
|
|
– |
Closer to
chatroom
|
|
|
language
|
|
|
• |
Most published
research
|
|
uses English text
|
|
|
|
– |
Lack of publicly
available
|
|
|
corpora
|
|
|
|
|
|
|
|
|
|
|
|
NUS SMS corpus
|
|
|
• |
Medium scale
(10K)
|
|
|
messages
|
|
|
• |
Demonstrates
breadth
|
|
|
and depth
|
|
|
• |
Corpus of
messages from
|
|
college students
|
|
|
|