Kickstart Your Career at Vertex! Are you ready to make a real impact? At Vertex, our mission is to tackle serious diseases and to change lives, for the better, for the future. Our aim is to give you the skills, insights, and career guidance to be an important part of that future; to turn your potential into progression. As a Vertex intern or co-op, you’ll work on meaningful projects, collaborate with talented teams, and learn from industry leaders. We’re passionate about innovation, inclusion, and supporting your growth—inside and outside the lab. Why Vertex? Real Projects: You’ll work on assignments that make a real impact, not just busy work. Mentorship & Networking: Connect with leaders and peers who want to see you succeed through professional networks, connections, and collaborations that will shape your longer-term career. Flexible & Supportive: We offer flexible work options with Flex @ Vertex and prioritize your wellbeing. Inclusive Culture: Collaboration and inclusion are embedded in everything we do. Career Launchpad: Build skills, explore career paths, and get guidance for your future career. Ready to apply? Submit your application and let’s turn possibilities into reality! Your Impact The Vertex Statistical Programming internship program is a multi-week experiential training program for students currently working towards an advanced degree in Statistics, Biostatistics, Data Science, Computer Science, Applied Mathematics, Biomedical Engineering, or a related field. If you are passionate, collaborative, and growth-minded, an internship at Vertex will help you gain meaningful experience in our Statistical Programming functional areas and serve as a launchpad for your career. Important Notice Regarding Internship and Co-op Inquiries At Vertex Pharmaceuticals, we are committed to providing a fair and structured recruitment process for all students interested in internship and co-op opportunities. To ensure consistency and equity, all student applications must go through our Early Talent Acquisition Team. Due to the high volume of interest, we are unable to respond to individual solicitation. Direct solicitation to Vertex employees- including senior leaders via email will result in removal from the recruiting process. We appreciate your enthusiasm and interest in Vertex. To be considered for internship or co-op roles, please apply directly through our official application channels. (https://www.vrtx.com/careers/career-growth-and-opportunities/internships/) Thank you for respecting our process and helping us maintain a fair experience for all candidates. What you will be doing: We are seeking an intern to contribute to the development of an LLM-based agent designed to enhance the automation of clinical Table, Figure, and Listing (TFL) generation workflows. The primary responsibilities of the intern will include: Parsing Clinical TFL Shell Documents: Extracting structured specifications such as titles, population definitions, variables, footnotes, and programming notes from RTF/DOCX files. Extracting Structured Specifications: Transforming unstructured text into structured formats for downstream processing. Interpreting User Prompts: Understanding and processing user inputs to guide the automation workflow. Mapping Specifications to R Function/SAS Macros Libraries: Matching extracted specifications to appropriate R functions/SAS Macros within an existing library. Triggering TFL Generation Workflows: Automating the execution of TFL generation workflows The system will leverage a combination of prompt-driven reasoning, structured document parsing, retrieval-augmented generation (RAG), and tool/function calling to seamlessly integrate LLM outputs with deterministic R code/SAS code execution. The intern will be responsible for delivering the following Technical Deliverables: Shell-to-Function Matching Agent: Clinical TFL shell text and user prompts. Structured function calls to the R library. Schema-Constrained LLM Output: Generate JSON mappings for table types, population definitions, variables, and grouping. Validation Layer: Develop mechanisms to ensure LLM outputs align with predefined function signatures and constraints. Prompt Optimization: Enhance prompt engineering to improve reliability and minimize hallucinations in LLM outputs. System Integration: Seamlessly integrate the LLM agent with the existing automation system to enable TFL generation workflows. This role offers a unique opportunity to work at the intersection of statistical programming, machine learning, and clinical analytics. The successful candidate will gain hands-on experience in developing cutting-edge AI-driven automation tools that have a direct impact on the efficiency and accuracy of clinical reporting processes. This role is not focused on: Researching or training new LLM models. Fine-tuning large foundation models. Instead, this role emphasizes: Building applied AI systems. Developing production-oriented tools. Designing hybrid workflows that combine deterministic and probabilistic methods. Supporting statistical programmers in automating TFL generation.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Intern