Staff Scientist (Informatics Algorithms Engineer), Division of Preclinical Innovation, Informatics Core
Description
NCATS, a major research component of NIH, seeks applications from outstanding candidates to fill a staff scientist (informatics algorithms engineer) position. The position is in the Informatics (IFX) Core in the center’s Division of Preclinical Innovation (DPI).
The IFX Core’s mission is to produce data-driven guidance and speed translation. The core uses innovative informatics methods and software and deep domain expertise in informatics and various translational research data types. The IFX Core has a team-science culture and collaborations that span DPI, extramural NCATS programs, other federal institutions, industry and biotechnology, and national and international academic institutions.
IFX efforts and contributions are broadly organized into five areas: (1) Translational Data Analytics; (2) Rare Diseases Informatics; (3) Building Standards, Knowledge Sources and Software; (4) Metabolomics and Multi-omics; and (5) Project Management and Outreach. These efforts are deeply connected: Outcomes from one category inform and are expanded into others. For more information, visit NCATS’ Informatics webpage.
Core Responsibilities
The staff scientist (informatics algorithms engineer) position will focus on Translational Data Analytics. The selected candidate will develop custom workflows and novel methods to handle translational research data and will deploy and maintain related tools internally and publicly.
The successful candidate will develop novel artificial intelligence (AI), deep learning (DL) and machine learning (ML) methods. They also will design a robust computational framework that allows access to and interpretation of translational research data. This framework will involve data parser and analytics development for NCATS-generated and public data, knowledge source and database (relational and graph) maintenance and development, API access, interactive visualization, and more. This framework will be largely publicly available to the broader research community.
The selected candidate will interact closely with the IFX Core Director, computational and noncomputational IFX Core and NCATS scientists (e.g., biologists, chemists, clinicians), and our academic collaborators. They will contribute to the strategic vision and mentoring of trainees and staff. The selected candidate will be encouraged to seek NCATS, NIH or external collaborations to further disseminate our data, methods and software. The candidate will help aggregate and clean in-house translational research data (e.g., high-throughput screens; multi-omic; knowledge sources on chemical, biological or disease annotations). They also will leverage these data in collaborative translational research projects. To support translational research efforts, the selected candidate will be part of a broader effort to standardize chemical, biological, disease and clinical annotations and promote the integration of these related lines of information.
NCATS expects the selected candidate to create and maintain detailed documentation on all work products to facilitate reuse and collaboration with IFX members, other researchers and DevOps. The candidate will contribute to and follow best practices in software and application development, including producing clean code and architecture. They also will be an owner/point of contact for the computational framework and its individual components. They generally will make the tools easier to maintain and make them more modular through appropriate containerization (e.g., Docker).
The selected candidate, with DPI staff, will help build a consolidated scientific computing environment, integrating libraries and software and ensuring they are interoperable. They also will apply or develop appropriate, state-of-the art methods to support large-scale data-acquisition efforts within DPI. The position requires the selected candidate to actively participate in IFX Core and NCATS committees and meetings and, as appropriate, showcase and collect feedback (e.g., user feedback) on the framework. Participation will include best practices for team development, such as design review and peer code review. The selected candidate will draft manuscripts and publish results in high-impact, peer-reviewed scientific journals and present results at internal and external scientific meetings.
Qualifications
The ideal candidate should possess a doctoral degree (M.D., Ph.D. or equivalent) in bioinformatics, computer science and engineering, or a relevant field and be a recognized expert. Applicants must have a broad range of informatics experience that includes at least some of the following:
- ML and/or DL (Bayesian inference, probabilistic programming, NLP, TensorFlow, etc.)
- Graph, NoSQL and/or RDBMS databases and semantic data standards (e.g., OWL, RDF)
- Scripting programming for high-throughput data processing and visualization (e.g., R, Python)
- Linux environments, high-computing servers, cloud computing
- Versioning systems (GitHub), full-stack software development
Applicants with knowledge in biology and/or chemistry and an understanding of data provenance (e.g., cell biology, methods for measuring multi-omic data, high throughput screens, biochemistry, genetics) are strongly preferred. Candidates should have experience in analyzing a broad range of translational research data types across different diseases and exploiting these data for therapeutic development. The selected candidate must have excellent interpersonal, verbal and written communication skills. They also must possess excellent analytical, organizational and time management skills. The desire and drive to seek out and master novel technologies and develop new ones is expected.
Salary/Benefits
The salary will be based on qualifications, experience and accomplishments. A full civil service benefits package is available. It includes retirement; health, life and long-term care insurance; and participation in a Thrift Savings Plan (401[k] equivalent).
How to Apply
Submit the following information to Ewy A. Mathé, Ph.D., at ewy.mathe@nih.gov: (1) a cover letter describing your career goals and interest in the position, including a research summary (one to two pages); (2) a current curriculum vitae with a complete bibliography; and (3) contact information for at least three references.
The review of applications will begin immediately and continue until the position is filled. Learn more about informatics research at NCATS.
Additional Information
A preappointment process (e.g., background investigation, verification of qualifications and job requirements, completion of onboarding forms, submission of required documents) may determine employment after an offer has been made and accepted.
At your supervisor’s discretion, you may be eligible for workplace flexibilities, which may include remote work or telework options and/or flexible work scheduling. These flexibilities may be requested in accordance with the NIH Workplace Flexibilities policy.