The following principles stem from issues encountered during previous attempts to analyze and detect hate speech online. Successful solutions should focus on mitigating these issues.
- Minimize false-positives - Keyword spotting is not enough. Users tend to obscure and obfuscate words or phrases that may trigger automated filters and lead to many false positives.
- Plan for the future - Racial and minority insults should be continuously updated to reflect socio-cultural or political changes, stereotypes or slang.
- Account for complexity - Abusive statements may cross sentence boundaries; although the meaning of individual sentences in a comment may not be offensive or hateful, its overall meaning may be.
- Watch out for context - Detecting sarcasm requires knowledge of the context or even the individual emitters in an online context