Skip to main content

Currently Skimming:

6 Web-Scraping Effects
Pages 37-42

The Chapter Skim interface presents what we've algorithmically identified as the most significant single chunk of text within every page in the chapter.
Select key terms on the right to highlight them within pages of the chapter.


From page 37...
... The first stage is to detect a hog disease outbreak using the scraping of disease report repository websites, such as the Swine Disease Global Surveillance Project (SDGSP) 1 and 1SDGSP is a project sponsored by the University of Minnesota Swine Center to monitor hog disease outbreaks on an international scale.
From page 38...
... NLP extracts information from the related news using the four steps of information extraction: normalize time, normalize word, keyword identification, and named entity recognition. Wei provided an example of the first stage, identifying an outbreak of African swine fever in Vietnam.
From page 39...
... that will be potentially useful in spatial disease modeling and mapping, it provides information to understand the time course of the spread, and it provides external documentation confirming disease and response to the outbreak. It could provide information to the pre-board, the Agricultural Statistics Board, or other experts.
From page 40...
... Lee Schulz asked about the accuracy of news as a variable when it is always changing, being updated, and occasionally redacted. He wondered whether it could be used to construct a variable accurate enough for possible input to a model, referring to the discussion of the accuracy issues related to trade expectations.
From page 41...
... Wikle asked about the potential for others to manipulate this type of information, especially if NASS scrapes blogs and sites where people might report incorrect information once they know how it is being used. He asked about a mechanism for detecting false placement of key indicators.


This material may be derived from roughly machine-read images, and so is provided only to facilitate research.
More information on Chapter Skim is available.