BA/app
Contents
Primary sources
Computer linguistics: CL intro
Genetic Algorithms: An introduction to genetic algorithms
Resources
Text classification
Natural language classification with Python:Book, especially learning to classify text
With machine learning: with tensorflow and generally nns: [1] Machine Learning, NLP: Text Classification using scikit-learn, python and NLTK Working with scikit and text data
Error Detection
error detection using local word bigram and trigram + some others
Somewhat similar problems being solved
Cross-cultural Deception Detection. It uses unigrams + LIWC (which is more psychological and less relevant)
Deception detection -- has examples of extracted features which I might use [2] -- lie detector Linguistic Cues to Deception Assessed by Computer Programs: A Meta-Analysis -- also ideas of possible features that might be interesting to look into.
Random
Natural Language Annotation for Machine Learning ebook, seems to cover quite a lot [http://lit.eecs.umich.edu/downloads.html#Cross-Cultural%20Deception downloads and demos -- incl datasets for CL lying detection -- generally interesting