%0 Book %A National Academies of Sciences, Engineering, and Medicine %T Refining the Concept of Scientific Inference When Working with Big Data: Proceedings of a Workshop—in Brief %D 2016 %U https://nap.nationalacademies.org/catalog/23616/refining-the-concept-of-scientific-inference-when-working-with-big-data %> https://nap.nationalacademies.org/catalog/23616/refining-the-concept-of-scientific-inference-when-working-with-big-data %I The National Academies Press %C Washington, DC %G English %K Math, Chemistry, and Physics %K Engineering and Technology %K Policy for Science and Technology %K Surveys and Statistics %P 4 %R doi:10.17226/23616 %X Big Data – broadly considered as datasets whose size, complexity, and heterogeneity preclude conventional approaches to storage and analysis – continues to generate interest across many scientific domains in both the public and private sectors. However, analyses of large heterogeneous datasets can suffer from unidentified bias, misleading correlations, and increased risk of false positives. In order for the proliferation of data to produce new scientific discoveries, it is essential that the statistical models used for analysis support reliable, reproducible inference. The National Academies of Sciences, Engineering, and Medicine convened a workshop to discuss how scientific inference should be applied when working with large, complex datasets.