Person using a mouse on a laptop computer
Back to All Events
John M. Olin Library, Instruction Room 3

Introduction to Text Analysis in Python Series

This four-session course will provide participants with an introduction to the quantitative analysis of textual data using Python. The course will cover the basics of representing text as numeric data (including text preprocessing, cleaning, stemming and tokenizing). We will then work through some of the more popular methods for text analysis, including topic modeling, word embeddings, and clustering.

This course is intended for graduate students, faculty and staff from any field at WashU who are interested in learning about quantitative text analysis and would like to become familiar with the main libraries and functions used to work with textual data in Python. 

Participants MUST have basic python skills to take this class. Participants who have taken TRIADS beginner Introduction to Python course are encouraged to continue their skills development through this course.

This class will be fully in-person, and participants will use their own laptops.

Dates of Introduction to Text Analysis in Python Series (all held in Olin Library, Instruction Room 3 from 11:30 am–1 pm):

  • Monday, November 3
  • Wednesday, November 5
  • Monday, November 10
  • Wednesday, November 12

DataLab Workshops 

DataLab is a collaboration between Data Services and TRIADS, Bernard Becker Medical Library, TechDen, and DI2 to provide a breadth of workshops from the basics of understanding data to working with data tools. These workshops are open to all WashU affiliates and are held in the fall and spring semesters.