Sep 17, 2024  
2024-2025 Undergraduate Catalog 
    
2024-2025 Undergraduate Catalog
Add to Portfolio (opens a new window)

DSCI 330 - Management of Unstructured Data


(3 credits)

This course will give students an overview of the issues related to the management of unstructured data, i.e. data not stored in a table or database. Sources of unstructured data include bodies of text, social media applications, images, and audio. In the process of exploring various forms of unstructured data, students will be exposed to new programming languages and tools that are useful for managing this type of data (e.g. Python and bash). Other topics covered in the course include web scraping, natural language processing, and manipulating images/videos/audio files. Prerequisites: DSCI 326 - Data Science at Scale  and CS 250 - Algorithms and Problem-Solving II , or instructor permission. Grade or P/NC. Offered alternate years.


Course Registration



Add to Portfolio (opens a new window)