Data Scientist, Machine Learning
Passionate about making a difference in the world of cancer genomics?
With the advent of genomic sequencing, we can finally measure and process our genetic makeup. We now have more data than ever before but providers often don't have the infrastructure or expertise required to easily extract the valuable insights that exist in said data. Here at Tempus, we believe the greatest promise for the detection and treatment of cancer lies in building a deep understanding of the interaction between molecular activity and clinical treatment, through the discovery of response patterns and unique biomarkers.
We're on a mission to connect an entire ecosystem to redefine how genomic data is used in clinical settings. We are looking for data scientists who are passionate about developing and applying state of the art techniques to processing and analyzing vast amounts of clinical, genomic, and molecular data. Data scientists will collaborate with product, research, and business development teams to build the most advanced data platform in cancer care.
What You'll Do
- Design and prototype novel data visualization and analysis tools and algorithms
- Wrangle and analyze large diverse sparse datasets, extract insights, and drive further research opportunities
- Interrogate analytical results for robustness and validity, and out of sample stability
- Document, summarize, and present research findings to a group of peers and stakeholders
Qualifications
- Degree in computer science, software engineering, statistics, machine learning, bioinformatics or related technical field
- Experience building and validating predictive models on structured or unstructured data
- Proficient in Python, SQL
- Experience working in a Linux / Mac environment
- Experience with the following: Pandas, NumPy, SciPy, Scikit-learn, Jupyter Notebooks
- Experience with supervised and unsupervised machine learning algorithms, and ensemble methods, such as: K-Means, PCA, Regression, Neural Networks, Decision Trees, Gradient Boosting
- Outstanding programming and problem solving skills
- Self-driven and work well in an interdisciplinary team with minimal direction
- A strong desire to understand why things work the way they do
- Thrive in a fast-paced environment and willing to shift priorities seamlessly
- Experience with communicating insights and presenting concepts to a diverse audience of engineers, clinicians, laboratory scientists and business development professionals
Nice to Haves
- Kaggle.com competitions and/or kernels track record
- Experience with AWS architecture
- Experience working with clinical and/or genomic data
- Experience with: Git, matplotlib, seaborn, HTML5, CSS3, JavaScript, D3, Plot.ly, Flask
- Experience in agile environments and comfort with quick iterations
- Experience in Big Data technologies such as Spark, Hadoop