ACI Federal is seeking a Health Data Scientist to join our vibrant team at the National Institutes of Health (NIH) supporting the National Institute of Environmental Health Sciences (NIEHS) located in Durham, NC.


    Overall Position Summary and Objectives The NIEHS is seeking an individual who can develop, test, and execute software and data pipelines to process, transform, integrate, and analyze climate and health related data sets. This work will involve working with different data formats and standards, linking data sets that have different spatial and temporal dimensions, and working with data sets that may be large and complex. The individual will work closely with experts in data science, geospatial methods, statistics, and biomedical research who will help guide and inform the work.

    Deliverables:

    Work Details:

    • Prepare roadmaps, schedules, status reports, plans and recommendations. 1
    • Provide written technical descriptions of implemented algorithms; contribute to the publication of the lab’s software and research. 2
    • Work with staff on data retrieval from various data sources. 3
    • Implement improvements to development activities to speed development of applications, databases, or software tools. 4
    • Work with staff on applying, extending and/or developing statistical methods to address problems identified by the staff of relevance to the project. 5
    • Provides technical experience needed to assist in reviewing, updating, analyzing and modifying existing programming systems.
    • Use advanced knowledge in machine learning, statistics, text mining, natural language processing, computational semantics, computer vision, and data science to develop creative solutions to complex real-world problems.
    • Work with staff to develop, test and refine novel programming methods, toolkits, and algorithms.
    • Conceive of and create data processing pipelines that allow for the efficient movement, pre-processing and display of data.
    • Provide technical experience reviewing, analyzing, and modifying existing programming systems.
    • Conceive of and conduct approaches to process, integrate, and analyze geospatial-based exposures data sets and health data sets.
    • Encodes, tests, debugs and installs new programs using various programming languages.
    • Develop scientific applications using R, Shiny and/or Python.
    • Collaborate with staff to develop, test, refine and apply advanced statistical and computational methods.
    • Test and maintain software products to ensure strong functionality and optimization.
    • Develops and schedules data backups, security patches or upgrades, etc.
    • Collaborate with staff to develop database application and tools for supporting bioinformatics and scientific computing research projects.
    • Documents all assignments and creates various reports as needed.
    • Prepare roadmaps, schedules, status reports, plans and recommendations.
    • Provide written technical descriptions of implemented algorithms; contribute to the publication of the lab’s software and research.
    • Writes and maintains program documentation.
    • Documents programming problems and resolutions for future reference.
    • Manage and troubleshoot deployments and image builds.
    • Develop new code and refine/troubleshoot existing code.
    • Troubleshoot scripts and programs to ensure successful use of said scripts and programs.
    • Document in-house software for training and reference purposes.
    • Provides assistance with planning, building and maintaining applications to meet an end user's needs.
    • Provide support for development of methods in data science.
    • Work with staff on literature reviews of analytic methods that have been applied across a broad range of scientific fields in order to identify the most useful methods to be applied.
    • Troubleshoots any issues or problems and implements appropriate corrective actions.
    • Develop new code and refine/troubleshoot existing code.
    • Document in-house software for training and reference purposes.
    • Provides guidance and problem resolution for users.
    • Evaluates impact of programming modifications.
    • Refines data and formats final product.

    1, 2, 3, 4, 5 represents priority rankings, where 1 is highest priority and 5 is lowest priority of those ranked



    Minimum Education

    Bachelor

    Additional Qualifications:

    Field of Study

    • Biology
    • Computer Science
    • Statistics and Decision Science

    Software

    • R
    • Python
    • Linux
    • Shiny
    • HTML


    Skills

    • Scientific Data analysis
    • Data visualization
    • Data processing
    • Data integration
    • Machine learning

    Apply now!