Skip to main content

Currently Skimming:

7 Summarizing Day 1
Pages 73-80

The Chapter Skim interface presents what we've algorithmically identified as the most significant single chunk of text within every page in the chapter.
Select key terms on the right to highlight them within pages of the chapter.


From page 73...
... Census Bureau, but was actually a page set up by a Bureau employee in public GitHub space. Abowd added that they are trying to establish exactly what the rules are in GitHub, and then the U.S.
From page 74...
... Census Bureau but there are not nearly enough to go around and their expertise is put into very specific systems, so the system that Sienkiewicz talked about the day before got Vilhuber's expertise and that of two other contractors who understood how to build a metadata database and how to make an early version of a data link work. Abowd added that another piece of this low-hanging fruit is the curation of code bases for confidential data analyses.
From page 75...
... He is making sure this is not the case for the 2020 Census. Levenstein said the easy changes that she thinks of in terms of what the research community wants (which is true for most of the statistical agencies)
From page 76...
... Separate from the code that will produce census products and internal code that might analyze or produce working papers within the Bureau, there might be external researchers who use census data. Thus, there may be a way to extend that pipeline, Stodden said.
From page 77...
... She suggested creating a system of trusted data stewards who can participate in a crowdsourcing effort to improve administrative data. It may be necessary to think about how to build that in because otherwise there is too much data out there to handle within the federal statistical community.
From page 78...
... Abowd responded that political science journals would not publish papers written in the RDCs. Thompson asked if the need is for certification for publication, or if there really is a problem concerning the veracity of research published using census data.
From page 79...
... John Eltinge had two very brief comments: some of the earlier discussion advocated having this cost be part of risk management. One way to think about this in a resource constrained environment is to try to spell that out in greater depth and, if documented, it could mean that basic prudent management, as well as Abowd's point code curation and software engineering processes, are all part of customary practice in computer science.
From page 80...
... A number of business decisions were made to keep that informa­ ion tightly held, which led to t a code universe that was not best practice. Trading off what is best practice from a computer science perspective versus what is best practice from these business decisions that are made in a highly charged political envi­ onment, r is something that can be taken into consideration, even though most current U.S.


This material may be derived from roughly machine-read images, and so is provided only to facilitate research.
More information on Chapter Skim is available.