Senior Applied AI Data Scientist

Flatiron HealthNew York, NY
16h

About The Position

We’re looking for a Senior Data Scientist to help us accomplish our mission to improve and extend lives by learning from the experience of every person with cancer. Are you ready to be the next changemaker in cancer care? Flatiron Health is a healthtech company using data for good to power smarter care for every person with cancer, around the world. Flatiron partners with cancer centers in the US, Europe and Asia to transform patients’ real-life experiences into real-world evidence and create a more modern, connected oncology ecosystem. Our multidisciplinary teams include oncologists, data scientists, data analysts, software engineers, research scientists, product experts and more. Flatiron Health is an independent affiliate of the Roche Group. What You'll Do At Flatiron, we’re advancing the use of machine learning and generative AI to extract clinically relevant information from unstructured medical notes to create de-identified oncology research datasets. The Discovery team is helping to build these next generation research data products, developing and applying ML & LLMs to capture a complete picture of the patient journey. The Discovery has team members spanning many different fields, from ML engineers and data scientists, to product management and oncologists. As part of our team, you will apply existing internal and off-the-shelf external AI systems and validate AI generated data sets that are used by clinicians and researchers to evolve cancer research, generate clinical insights, and learn from the experience of millions of people living with cancer. Engaging with a cross-functional group of stakeholders both within Discovery and across the company, you will contribute to the build out of our data sets from scoping through to validation, productionization and delivery. In addition, you'll also: Work with our clinical stakeholders to apply existing AI systems to turn raw clinical data into high quality research data Become a subject matter expert on our data and its capabilities and collaborate closely across the team to understand data needs and provide analytical support that enhances model development and deployment. Work with research scientists and oncologists to validate that our team’s models can be used to generate sound scientific insights, including full dataset performance analyses Work closely with subject matter experts & ML researchers to define requirements for training and evaluation datasets, and maintain software pipelines for the generation of these sets. Provide analytic support and create custom data outputs for cross-functional teams such as our team of clinical experts. Interface with internal scientific & clinical stakeholders to understand what data they need to conduct high quality research. Work cross-functionally with software engineers to productionize, scale, and monitor our team’s models. Who You Are You're a product-focused senior data scientist, with creative analytical problem-solving skills ready to tackle the problems of measuring the performance of complex datasets & the systems that build them. You’re excited to learn about oncology from our clinical stakeholders and work with them to apply AI to extract nuanced clinical concepts from the medical record and validate the fitness-for-use of that data for oncology research. You’re a kind, passionate and collaborative problem-solver who seeks and gives candid feedback, and values the chance to make an important impact.

Requirements

  • You have 5+ years of relevant working experience as an applied data scientist or similar technical data-oriented role, including relevant applied work in a graduate program. Some prior experience with ML or LLMs is preferred.
  • You understand how machine learning and AI systems are measured and can analyze an existing system to understand the quality of its output, assess where improvements are needed, and communicate the impact of those improvements to stakeholders
  • You have collaborated with other technical team members in a production development environment using formal version control, Python (including data manipulation in pandas, polars or a similar framework), and SQL.
  • You’re impact-oriented, and care deeply about creating real change for customers, users, and ultimately patients. You choose the right, rather than the flashiest, method available to reach your goals.
  • You are a clear and confident communicator who can break down complex data analyses to tell a compelling story.
  • You have led cross-functional initiatives and excel at influencing decision-making without authority.

Nice To Haves

  • You have experience working with data in a healthcare setting.
  • You have experience with the risks of bias in machine learning, health equity research/analysis or have worked with underrepresented groups in a clinical research setting.
  • You have experience working in dbt or other ETL frameworks
  • You have experience with deep learning and traditional NLP methods.

Responsibilities

  • Work with our clinical stakeholders to apply existing AI systems to turn raw clinical data into high quality research data
  • Become a subject matter expert on our data and its capabilities and collaborate closely across the team to understand data needs and provide analytical support that enhances model development and deployment.
  • Work with research scientists and oncologists to validate that our team’s models can be used to generate sound scientific insights, including full dataset performance analyses
  • Work closely with subject matter experts & ML researchers to define requirements for training and evaluation datasets, and maintain software pipelines for the generation of these sets.
  • Provide analytic support and create custom data outputs for cross-functional teams such as our team of clinical experts.
  • Interface with internal scientific & clinical stakeholders to understand what data they need to conduct high quality research.
  • Work cross-functionally with software engineers to productionize, scale, and monitor our team’s models.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service