About Applied Text Analysis in Python course
This course introduces the basics of text mining and text manipulation. The course starts with an understanding of how text is processed by python, the structure of text for both machines and humans, and an overview of the nltk framework for text manipulation. The second week covers the general needs of text manipulation, including regular expressions (text searching), text cleaning, and preparing text for use in machine learning processes. In the third week, we apply basic natural language processing techniques to text and demonstrate how text classification is performed. The final week covers more advanced methods for discovering topics in documents and grouping them by similarity (topic modeling).
This course should be taken after: Introduction to Data Science in Python, Applied Plotting, Graphing, and Presenting Data in Python, and Applied Machine Learning in Python.