The Position Genentech, Inc. seeks a Principal Data Scientist at its South San Francisco, CA location. Duties: Apply statistical theory and methods to lead projects to design, develop, and program methods, processes, and systems to consolidate and analyze unstructured and diverse “big data” sources to generate actionable, persuasive information and insights and innovative solutions for client services and product enhancement. Use statistical and visualization techniques to inform feature engineering, model selection, and optimization of LLM-based applications. Design data pipelines to curate, preprocess, and structure datasets that improve LLM-based algorithms performance and reduce biases, with a focus on data quality and diversity. Perform thorough data exploration to understand dataset characteristics, uncover patterns, detect biases, and identify data quality issues. Lead research on scientific approach and utilize state-of-art methodologies to analyze complex datasets and interpret analysis of results. Develop and code software programs, algorithms, and automated processes to cleanse, integrate, and evaluate large datasets from multiple disparate sources. Provide methodical and implementation guidance as well as hands-on support and be accountable for the development and implementation of Data Science products. Collaborate with AI engineers, product owners, business analysts, and other developers in Agile teams to integrate LLMs into scalable, robust, fair, and ethical end-user applications, focusing on user experience, relevance, and real-time performance. Design, develop, customize, optimize, and fine-tune LLM-based and other AI-infused algorithms tailored to specific use cases such as text generation, summarization, information extraction, chatbots, AI agents, code generation, document analysis, sentiment analysis, and data analysis, among others. Evaluate the pros and cons of different approaches and Generative AI platforms with comprehensive quantitative and qualitative analysis. Collaborate within global Agile teams in the Informatics business and foundational domains to develop products that provide the highest value to both Pharma and Diagnostics business stakeholders. Serve as a technical expert and resource for the team and clients, contributing to building and cultivating a data-driven decision-making culture. Telecommuting permitted up to 3 days per week.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Principal
Education Level
Ph.D. or professional degree