Data Wrangling using Twitter API:

Project information

  • Link: Project description and code on Github.

In this project real Twitter messages are fetched using the Twitter API tweepy.

The focus is on data wrangling including rigorous assessment and cleaning: we check missing data/completeness, tidiness, and data specific cleaning needs. Each cleaning activity follows a three-step approach to define, code, and test.

Challenges: handle limitations fetching data using the Twitter API. Combining multiple datasets. And doing all the wrangling needed as typical for working with real-world data.