Scientific Lead, Applied Intelligence for Discovery (Remote)

Other Jobs To Apply

No other job posts for this day.

Scientific Lead, Applied Intelligence for DiscoveryLocation: US, San Francisco CATime Type: Full timeJob DescriptionAt Lilly, we unite caring with discovery to make life better for people around the world. We are a global healthcare leader headquartered in Indianapolis, Indiana. Our employees around the world work to discover and bring life-changing medicines to those who need them, improve the understanding and management of disease, and give back to our communities through philanthropy and volunteerism. We give our best effort to our work, and we put people first. We’re looking for people who are determined to make life better for people around the world.The OpportunityWe are building something unprecedented, an AI foundation that will fundamentally change how drug discovery research is conducted.The Applied Intelligence for Discovery (AI4D) team is a newly formed group within Lilly Research Laboratories that operates at the intersection of scientific delivery and core platform development. AI4D’s mission is to connecting scientists to petabyte-scale data through natural language interfaces, automated analysis workflows, and intelligent search — and to convert early deployments into repeatable system standards and evaluation practices that scale across therapeutic areas.As a Generative AI Engineer, you will design, build, and operate the core AI systems that power this transformation: retrieval-augmented generation over internal scientific documents, text-to-SQL over complex omics databases, agentic workflows that automate multi-step analyses, and the evaluation infrastructure that able the next-generation of medicines for patients.Key ResponsibilitiesDesign, build, and optimize RAG pipelines over internal publications, study reports, electronic lab notebooks, and other scientific documentsBuild hybrid retrieval systems combining vector search with structured metadata, knowledge graphs, and ontology-aware filteringBuild and optimize text-to-SQL systems over Lilly’s databases, enabling scientists to query gene expression, proteomics, pathway, and variant data through natural languageDevelop schema documentation, semantic annotations, and gold-standard question/SQL pairs that bridge how scientists think about data and how it is storedImplement multi-step reasoning approaches (chain-of-thought, self-correction, Reflexion loops) to improve accuracy on complex scientific queriesDesign agentic AI workflows that chain database queries, bioinformatics tools, literature search, and visualization into automated multi-step scientific analysesEvaluate and integrate emerging orchestration frameworks (LangGraph, CrewAI, custom architectures) for scientific use casesBuild evaluation frameworks measuring accuracy, reliability, and scientific validity of AI outputsBasic QualificationsPhD in Computer Science, Data Science, or a related technical field with 0-3+ years of experience; or equivalent experience building production LLM systems; MS in Computer Science, Data Science, or a related technical field with 5+ years of experience; or equivalent experience building production LLM systemsAdditional Skills/PreferencesExperience building LLM-powered applications, including at least two of: RAG systems, text-to-SQL, agentic workflows, or fine-tuning pipelinesStrong software engineering skills in Python with experience building production-grade systemsDeep familiarity with the modern LLM ecosystem: embedding models, vector databases, and orchestration frameworksExperience designing evaluation frameworks for LLM systems — systematic approaches to measuring accuracy, detecting hallucinations, and tracking regressionsComfort working with complex, heterogeneous data — databases with hundreds of tables, specialized schemas, or domain-specific vocabulariesFamiliarity with cloud computing environments (AWS preferred), containerization (Docker), and CI/CD practicesExperience in pharmaceutical, biotech, or life sciences environmentsFamiliarity with biomedical data types (omics, clinical, molecular) or scientific databasesExperience with MLOps/LLMOps tooling: experiment tracking, model registries, prompt versioning, A/B testing for AI systemsKnowledge of biomedical ontologies (Gene Ontology, MeSH, ChEBI) or experience integrating domain-specific knowledge into LLM systemsExperience building for regulated environments where auditability, reproducibility, and explainability are requirementsLilly is dedicated to helping individuals with disabilities to actively engage in the workforce, ensuring equal opportunities when vying for positions. If you require accommodation to submit a resume for a position at Lilly, please complete the accommodation request form ( for further assistance. Please note this is for individuals to request an accommodation as part of the application process and any other correspondence will not receive a response.Lilly is proud to be an EEO Employer and does not discriminate on the basis of age, race, color, religion, gender identity, sex, gender expression, sexual orientation, genetic information, ancestry, national origin, protected veteran status, disability, or any other legally protected status.Our employee resource groups (ERGs) offer strong support networks for their members and are open to all employees. Our current groups include: Africa, Middle East, Central Asia Network, Black Employees at Lilly, Chinese Culture Network, Japanese International Leadership Network (JILN), Lilly India Network, Organization of Latinx at Lilly (OLA), PRIDE (LGBTQ+ Allies), Veterans Leadership Network (VLN), Women’s Initiative for Leading at Lilly (WILL), enAble (for people with disabilities). Learn more about all of our groups.Actual compensation will depend on a candidate’s education, experience, skills, and geographic location. The anticipated wage for this position is$166,500 - $266,200Full-time equivalent employees also will be eligible for a company bonus (depending, in part, on company and individual performance). In addition, Lilly offers a comprehensive benefit program to eligible employees, including eligibility to participate in a company-sponsored 401(k); pension; vacation benefits; eligibility for medical, dental, vision and prescription drug benefits; flexible benefits (e.g., healthcare and/or dependent day care flexible spending accounts); life insurance and death benefits; certain time off and leave of absence benefits; and well-being benefits (e.g., employee assistance program, fitness benefits, and employee clubs and activities).Lilly reserves the right to amend, modify, or terminate its compensation and benefit programs in its sole discretion and Lilly’s compensation practices and guidelines will apply regarding the details of any promotion or transfer of Lilly employees.#WeAreLilly

Back to blog

Common Interview Questions And Answers

1. HOW DO YOU PLAN YOUR DAY?

This is what this question poses: When do you focus and start working seriously? What are the hours you work optimally? Are you a night owl? A morning bird? Remote teams can be made up of people working on different shifts and around the world, so you won't necessarily be stuck in the 9-5 schedule if it's not for you...

2. HOW DO YOU USE THE DIFFERENT COMMUNICATION TOOLS IN DIFFERENT SITUATIONS?

When you're working on a remote team, there's no way to chat in the hallway between meetings or catch up on the latest project during an office carpool. Therefore, virtual communication will be absolutely essential to get your work done...

3. WHAT IS "WORKING REMOTE" REALLY FOR YOU?

Many people want to work remotely because of the flexibility it allows. You can work anywhere and at any time of the day...

4. WHAT DO YOU NEED IN YOUR PHYSICAL WORKSPACE TO SUCCEED IN YOUR WORK?

With this question, companies are looking to see what equipment they may need to provide you with and to verify how aware you are of what remote working could mean for you physically and logistically...

5. HOW DO YOU PROCESS INFORMATION?

Several years ago, I was working in a team to plan a big event. My supervisor made us all work as a team before the big day. One of our activities has been to find out how each of us processes information...

6. HOW DO YOU MANAGE THE CALENDAR AND THE PROGRAM? WHICH APPLICATIONS / SYSTEM DO YOU USE?

Or you may receive even more specific questions, such as: What's on your calendar? Do you plan blocks of time to do certain types of work? Do you have an open calendar that everyone can see?...

7. HOW DO YOU ORGANIZE FILES, LINKS, AND TABS ON YOUR COMPUTER?

Just like your schedule, how you track files and other information is very important. After all, everything is digital!...

8. HOW TO PRIORITIZE WORK?

The day I watched Marie Forleo's film separating the important from the urgent, my life changed. Not all remote jobs start fast, but most of them are...

9. HOW DO YOU PREPARE FOR A MEETING AND PREPARE A MEETING? WHAT DO YOU SEE HAPPENING DURING THE MEETING?

Just as communication is essential when working remotely, so is organization. Because you won't have those opportunities in the elevator or a casual conversation in the lunchroom, you should take advantage of the little time you have in a video or phone conference...

10. HOW DO YOU USE TECHNOLOGY ON A DAILY BASIS, IN YOUR WORK AND FOR YOUR PLEASURE?

This is a great question because it shows your comfort level with technology, which is very important for a remote worker because you will be working with technology over time...