Assignment 7
Learning Objectives
- scrape data from HTML through a toolkit
- specify search parameters through a URL
Data Files
None
Tasks
The objective of this assignment is to use a web scraping toolkit to import data from the web into R. After reviewing and working through the import.io user guide, pick a website to scrape.
- (40 Points) Set up an extractor for your website.
- (20 Points) Run the extractor and import the data into R (either via API or via a file load)
- (40 Points) Perform some calculation or text analysis of the extracted data in R.
Deliverables & Submission Instructions
You need to submit the .rmd and .md.html files plus a PDF that include pictures, screenshots, data file extracts, charts, and anything else that shows your use of the toolkit in a single zip file.
Scoring
Total Number of Earnable Points: 100
Approximate Time to Complete: 2-3 hours
Due Date: see Calendar or Blackboard
Approximate Time to Complete: 2-3 hours
Due Date: see Calendar or Blackboard