Quantitative Text Analysis: Supervised Methods and Text Classification (few places remaining)

Date:

28/05/2013

Organised by:

London School of Economics and Political Science

Presenter:

Professor Kenneth Benoit

Level:

Intermediate (some prior knowledge)

Contact:

John Fyson j.a.fyson@lse.ac.uk

Location:

View in Google Maps  (WC2A 2AE)

Venue:

London School of Economics and Political Science, Houghton Street, London

Description:

This topic introduces methods for placing documents on continuous dimensions or ‘scales’, using a semi-supervised approach method known as Wordscores. This algorithmic applies a probability model of words given texts that can be used to estimate their characteristics along a latent dimension. We then generalize this method to the Naïve Bayes classifier, a method that permits the automatic classification of texts in a test set following machine learning from a training set.

Readings:

  • Manning, Raghavan, and Schütze (2009, Chapter 13)â�¨Statsoft, “Naive Bayes Classifier Introductory Overview,” http://www.statsoft.com/textbook/ naive-bayes-classifier/.â�¨
  • Bionicspirit.com, 9 Feb 2012, “How to Build a Naive Bayes Classifier,” http://bionicspirit. com/blog/2012/02/09/howto-build-naive-bayes-classifier.html.
  • An online article by Paul Graham on classifying spam e-mail. http://www.paulgraham.com/spam.html 

Cost:

Free

Website and registration:

Register for this course

Region:

Greater London

Keywords:

Quantitative Data Handling and Data Analysis, Quantitative Software, Quantitative Text Analysis , Text Analysis , Supervised Methods , Text Classification


Related publications and presentations from our eprints archive:

Quantitative Data Handling and Data Analysis
Quantitative Software

Back to the training database