Back to all jobs

Staff Data Scientist, Machine Learning

Work from home Full-time role Hiring

About Us

Valo Health is a human-centric, AI-enabled biotechnology company working to make new drugs for patients faster. The company’s Opal Computational Platform transforms drug discovery and development through a unique combination of real-world data, AI, human translational models and predictive chemistry. Our talented team of biologists, chemists and engineers, armed with advanced AI/ML tools, work together to break down traditional R&D silos and accelerate the speed and scale of drug discovery and development. Valo is committed to hiring diverse talent, prioritizing growth and development, fostering an inclusive environment, and creating opportunities to bring together a group of different experiences, backgrounds, and voices to work together. We embrace new ways of learning, solve complex problems and welcome diverse perspectives that can help us advance patient-centric innovation. Valo is headquartered in Lexington, MA, with additional offices in New York, NY and Tel Aviv, Israel. To learn more, visit www.valohealth.com.

About the Role

As a Staff Data Scientist, Machine Learning, you will be a core member of a team of data scientists and engineers building a powerful computational platform for advancing the research and development of new medicines. As part of the Translational Platform Engineering team, you will help design, develop, and apply machine learning (ML) models, methods, and pipelines for scientific problems involving clinical and biomedical data. Successful candidates will work with a diverse set of data scientists, biological scientists, epidemiologists, and software engineers in ways that cut across traditional industry boundaries. What You’ll Do…

  • Propose, design, and develop ML approaches on high dimensional electronic health records and omics data leveraging Valo’s proprietary platform (data assets and data science packages).
  • Design, develop, and support ML pipelines, workbenches, and dashboards to enable users to solve scientific problems.
  • Develop well-designed, tested, and documented software packages.
  • Collaborate with cross-functional teams and stakeholders to derive user requirements, maintain alignment, and ensure the relevance and impact of models, analyses, and pipelines.
  • Be an active team member in code, design, and analysis review.

What You Bring...

  • Degree in a quantitative field with 7+ (BS), 5+ (MS), or 3+ (PhD) years of post-degree experience or equivalent
  • Broad experience in ML including supervised learning, unsupervised learning, dimensionality reduction, clustering, metrics, model selection, feature selection, and explainability (3+ years required).
  • Demonstrated experience with ML on electronic health records (2+ years required).
  • Proficient in Python (5+ years required) and experience with ML and data science packages (e.g., scikit-learn, statsmodels, scipy, MLlib).
  • Experience with MLops methodology such as workflow orchestration (e.g., Airflow, Prefect), experiment tracking (e.g., MLflow), containerization (e.g., Docker), and reproducible research (3+ years required).
  • Experience with collaborative software development using source control management (e.g., git, unit testing, code review, CI/CD) (3+ years required).
  • Experience with large-scale data analytics engines (e.g., Spark or Dask) and working in cloud environments (e.g., AWS) (2+ years required).
  • Experience with statistical methods such as hypothesis testing, longitudinal modeling, and time to event analysis.
  • Strong work ethic with a bias for execution and an ability to manage multiple priorities, ambiguity, and tight timelines. Ability to work effectively in teams or independently.
  • Experience with omics data is a plus.
  • Familiarity with the drug discovery and development process is a plus.

Remote Salary Range $175,000—$227,000 USD Apply tot his job

More remote roles to explore

Director of Data Engineering

Work from home Full-time role

Data Analyst III (Healthcare Analytics)

Work from home Full-time role

Data Scientist Senior - Compliance Analytics

Work from home Full-time role

Head of Analytics & Insights (Data & AI Corporate Vice President)

Work from home Full-time role

Product Manager II, Data Science - Enterprise Tools

Work from home Full-time role

Geospatial Data Scientist

Work from home Full-time role

Jr Data scientist with Tensorflow (Remote)

Work from home Full-time role

Data Science Consultant

Work from home Full-time role

Data Science Manager - Supply Chain & Manufacturing

Work from home Full-time role

Senior Data Analyst (Marketing Science)

Work from home Full-time role

Experienced Chat Support Associate – Remote Customer Service Representative for Dynamic Event and Ticketing Industry

Work from home Full-time role

Frontend Engineer (US, East Coast)

Work from home Full-time role

Experienced Remote Data Entry Specialist – Full-Time Opportunity for Detail-Oriented Individuals at blithequark

Work from home Full-time role

Experienced Virtual Assistant - Data Entry Specialist for blithequark in a Remote Work Environment

Work from home Full-time role

Experienced Technical Support Specialist, Mechatronics and Sustainable Packaging – Robotics and Automation Expert

Work from home Full-time role

Behavior Technician, Willing to Train

Work from home Full-time role

Project Coordinator, Custom Productions Unit

Work from home Full-time role

Sr. Product Manager - Tech, Amazon Customer Service

Work from home Full-time role

Senior Data Engineer (Denodo & BigQuery)

Work from home Full-time role

Mobile Phlebotomist - PRN

Work from home Full-time role