National Academies Press: OpenBook

Empowering the Defense Acquisition Workforce to Improve Mission Outcomes Using Data Science (2021)

Chapter: Appendix D: Skills for Data Science Mastery

« Previous: Appendix C: Data Science Case Studies in Defense Acquisition
Suggested Citation:"Appendix D: Skills for Data Science Mastery." National Academies of Sciences, Engineering, and Medicine. 2021. Empowering the Defense Acquisition Workforce to Improve Mission Outcomes Using Data Science. Washington, DC: The National Academies Press. doi: 10.17226/25979.
×

D

Skills for Data Science Mastery

The National Academies of Sciences, Engineering, and Medicine 2018 report Data Science for Undergraduates: Opportunities and Options (The National Academies Press, Washington, DC) emphasized that a critical task in the education of future data scientists is to instill basic data acumen. This requires exposure to key concepts in data science, real-world data and problems that can reinforce the limitations of tools, and ethical considerations that permeate many applications. The following are key concepts involved in developing basic data science acumen.

  • Mathematical foundations. Key mathematical concepts/skills that would be important for all students in their data science programs and critical for their success in the workforce are the following:
    • Set theory and basic logic,
    • Multivariate thinking via functions and graphical displays,
    • Basic probability theory and randomness,
    • Matrices and basic linear algebra,
    • Networks and graph theory, and
    • Optimization.

    Some data scientists and programs require a deeper understanding of mathematical underpinnings. This might include the following:
    • Partial derivatives (to understand interactions in a model),
    • Advanced linear algebra (i.e., properties of matrices, eigenvalues, decompositions),
    • “Big O” notation and analysis of algorithms, and
Suggested Citation:"Appendix D: Skills for Data Science Mastery." National Academies of Sciences, Engineering, and Medicine. 2021. Empowering the Defense Acquisition Workforce to Improve Mission Outcomes Using Data Science. Washington, DC: The National Academies Press. doi: 10.17226/25979.
×
    • Numerical methods (e.g., approximation and interpolation).
  • Computational foundations. While it would be ideal for all data scientists to have extensive coursework in computer science, new pathways may be needed to establish appropriate depth in algorithmic thinking and abstraction in a streamlined manner. This might include the following:
    • Basic abstractions,
    • Algorithmic thinking,
    • Programming concepts,
    • Data structures, and
    • Simulations.
  • Statistical foundations. Important statistical foundations might include the following:
    • Variability, uncertainty, sampling error, and inference;
    • Multivariate thinking;
    • Non-sampling error, design, experiments (e.g., A/B testing), biases, confounding, and causal inference;
    • Exploratory data analysis;
    • Statistical modeling and model assessment; and
    • Simulations and experiments.
  • Data management and curation. Key data management and curation concepts/skills that would be important for all students in their data science programs and critical for their success in the workforce are the following:
    • Data provenance;
    • Data preparation, especially data cleansing and data transformation;
    • Data management (of a variety of data types);
    • Record retention policies;
    • Data subject privacy;
    • Missing and conflicting data; and
    • Modern databases.
  • Data description and visualization. Key data description and visualization concepts/skills that would be important for all students in their data science programs and critical for their success in the workforce are the following:
    • Data consistency checking,
    • Exploratory data analysis,
Suggested Citation:"Appendix D: Skills for Data Science Mastery." National Academies of Sciences, Engineering, and Medicine. 2021. Empowering the Defense Acquisition Workforce to Improve Mission Outcomes Using Data Science. Washington, DC: The National Academies Press. doi: 10.17226/25979.
×
    • Grammar of graphics,
    • Attractive and sound static and dynamic visualizations, and
    • Dashboards.
  • Data modeling and assessment. Key data modeling and assessment concepts/skills that would be important for all students in their data science programs and critical for their success in the workforce are the following:
    • Machine learning (e.g., supervised, unsupervised, and deep learning),
    • Multivariate modeling and supervised learning,
    • Dimension reduction techniques and unsupervised learning,
    • Deep learning,
    • Model assessment and sensitivity analysis, and
    • Model interpretation (particularly for “black box” models).
  • Workflow and reproducibility. Key workflow and reproducibility concepts/skills that would be important for all students in their data science programs and critical for their success in the workforce are the following:
    • Workflows and workflow systems,
    • Documentation and code standards,
    • Source code (version) control systems,
    • Reproducible analysis, and
    • Collaboration.
  • Communication and teamwork. Key communication and teamwork concepts/skills that would be important for all students in their data science programs and critical for their success in the workforce are the following:
    • Ability to understand client needs,
    • Clear and comprehensive reporting,
    • Conflict resolution skills,
    • Well-structured technical writing without jargon, and
    • Effective presentation skills.
  • Domain-specific considerations. Effective application of data science to a domain requires knowledge of that domain. Grounding data science instruction in substantive contextual examples (which will require the development of judgment and background in those areas) will help ensure that data scientists develop the capacity to pose and answer questions with data. Reinforcing skills and capaci
Suggested Citation:"Appendix D: Skills for Data Science Mastery." National Academies of Sciences, Engineering, and Medicine. 2021. Empowering the Defense Acquisition Workforce to Improve Mission Outcomes Using Data Science. Washington, DC: The National Academies Press. doi: 10.17226/25979.
×

    ties developed in data science courses in the context of a specific domain will help students see the entire data science process.

  • Ethical problem solving. Key aspects of ethics needed for all data scientists (and for that matter, all educated citizens) include the following:
    • Ethical precepts for data science and codes of conduct,
    • Privacy and confidentiality (both in the spirit and letter of the law),
    • Responsible conduct of research (e.g., human subjects),
    • Ability to identify “junk” science, and
    • Ability to detect algorithmic and human bias.
Suggested Citation:"Appendix D: Skills for Data Science Mastery." National Academies of Sciences, Engineering, and Medicine. 2021. Empowering the Defense Acquisition Workforce to Improve Mission Outcomes Using Data Science. Washington, DC: The National Academies Press. doi: 10.17226/25979.
×
Page 120
Suggested Citation:"Appendix D: Skills for Data Science Mastery." National Academies of Sciences, Engineering, and Medicine. 2021. Empowering the Defense Acquisition Workforce to Improve Mission Outcomes Using Data Science. Washington, DC: The National Academies Press. doi: 10.17226/25979.
×
Page 121
Suggested Citation:"Appendix D: Skills for Data Science Mastery." National Academies of Sciences, Engineering, and Medicine. 2021. Empowering the Defense Acquisition Workforce to Improve Mission Outcomes Using Data Science. Washington, DC: The National Academies Press. doi: 10.17226/25979.
×
Page 122
Suggested Citation:"Appendix D: Skills for Data Science Mastery." National Academies of Sciences, Engineering, and Medicine. 2021. Empowering the Defense Acquisition Workforce to Improve Mission Outcomes Using Data Science. Washington, DC: The National Academies Press. doi: 10.17226/25979.
×
Page 123
Next: Appendix E: Glossary of Terms, Abbreviations, and Acronyms »
Empowering the Defense Acquisition Workforce to Improve Mission Outcomes Using Data Science Get This Book
×
Buy Paperback | $50.00 Buy Ebook | $40.99
MyNAP members save 10% online.
Login or Register to save!
Download Free PDF

The effective use of data science - the science and technology of extracting value from data - improves, enhances, and strengthens acquisition decision-making and outcomes. Using data science to support decision making is not new to the defense acquisition community; its use by the acquisition workforce has enabled acquisition and thus defense successes for decades. Still, more consistent and expanded application of data science will continue improving acquisition outcomes, and doing so requires coordinated efforts across the defense acquisition system and its related communities and stakeholders. Central to that effort is the development, growth, and sustainment of data science capabilities across the acquisition workforce.

At the request of the Under Secretary of Defense for Acquisition and Sustainment, Empowering the Defense Acquisition Workforce to Improve Mission Outcomes Using Data Science assesses how data science can improve acquisition processes and develops a framework for training and educating the defense acquisition workforce to better exploit the application of data science. This report identifies opportunities where data science can improve acquisition processes, the relevant data science skills and capabilities necessary for the acquisition workforce, and relevant models of data science training and education.

  1. ×

    Welcome to OpenBook!

    You're looking at OpenBook, NAP.edu's online reading room since 1999. Based on feedback from you, our users, we've made some improvements that make it easier than ever to read thousands of publications on our website.

    Do you want to take a quick tour of the OpenBook's features?

    No Thanks Take a Tour »
  2. ×

    Show this book's table of contents, where you can jump to any chapter by name.

    « Back Next »
  3. ×

    ...or use these buttons to go back to the previous chapter or skip to the next one.

    « Back Next »
  4. ×

    Jump up to the previous page or down to the next one. Also, you can type in a page number and press Enter to go directly to that page in the book.

    « Back Next »
  5. ×

    Switch between the Original Pages, where you can read the report as it appeared in print, and Text Pages for the web version, where you can highlight and search the text.

    « Back Next »
  6. ×

    To search the entire text of this book, type in your search term here and press Enter.

    « Back Next »
  7. ×

    Share a link to this book page on your preferred social network or via email.

    « Back Next »
  8. ×

    View our suggested citation for this chapter.

    « Back Next »
  9. ×

    Ready to take your reading offline? Click here to buy this book in print or download it as a free PDF, if available.

    « Back Next »
Stay Connected!