Senior Data Scientist, NLP at Ascent
Who We Are
Ascent is a Chicago-based startup that closed its Series B funding round of $19.3M in Fall 2019. Founded only four years ago, the team has since grown to almost 50 full time employees, led and supported by an executive team and board of directors with extensive experience in technology, regulatory compliance, sales and business operations. Ascent serves global financial institutions such as ING and CommBank, and is also working directly with regulators around the world in order to continually improve and advance our product.
A first mover in building proprietary RegulationAI™, Ascent delivers market leading regulatory knowledge as a groundbreaking new way for financial firms to manage compliance. It is our mission to help our customers protect their business from regulatory and reputational risk while reducing their overall cost to comply.
We prioritize diversity, equity, and inclusion and believe strongly that a team with different backgrounds and perspectives produces better results. Together, we are solving a $64 billion global problem in regulatory compliance. Watch this video to learn more about what we do.
Who You Are
We are building an intelligent compliance platform that enables compliance professionals to easily track and understand their compliance obligations and related regulation. To support that platform, we are also building a full data science platform and team to help improve our efficiency at processing regulations and unlock new features using fundamentally creative approaches.
We are looking for experienced, passionate data scientists to help us build and maintain models that help solve a wide variety of problems.
As a Senior Data Scientist working on NLP at Ascent, your day-to-day work will include working closely with business users, our regulation content team, and the rest of our data scientists and engineers to: a) use state-of-the-art NLP to automate significant sections of the workflow involved in onboarding and managing regulations; b) uncover novel features and information from text to enable new product features (e.g. finding related groups of regulation across regulators); c) deploy, monitor, and maintain models to actually solve problems in production; and d) experiment with creative solutions to problems that use new research and tools and disseminate new knowledge to the team. We primarily use the Python data ecosystem, including both scikit-learn and Keras+Tensorflow, but we are open to all tools. We use all kinds of models, including deep learning and non-deep learning; we prefer to use the simplest tool that accomplishes the goal.
- Work with non-technical colleagues to design and build machine learning models that accomplish specific tasks, with a heavy focus on NLP – splitting, summarization, classification, entity recognition, similarity scoring, recommendation, etc
- Cross validate and test models prior to deployment, and monitor model accuracy in production
- Employ production-quality coding standards / best-practices during model training, prototyping, and deployment
- Help educate others in the company about machine learning and data science so that they can think productively about possible solutions to their business problems
- Stay current on machine learning research and tools
- Prioritize minimum viable models / solutions over complicated models / solutions
- Think about NLP fundamentals and use your intuition to apply cutting-edge NLP research to our most difficult problems
- Mentor less experienced data scientists
Minimum Skills and Experience
- 4+ years solving business problems using data science
- 2+ years building NLP models that were used to transform text in production pipelines
- 1+ year using sequential deep learning models or similar on text problems
- Experience with both supervised and unsupervised learning approaches
- A thorough understanding of the mathematical underpinnings of common models
- Experience developing creative modeling solutions given a set of business requirements and delivering that value to the business
- Ability to work productively on small teams and manage workstreams independently if needed
- Proficient in SQL, *nix CLI tools (grep/sed/awk/BASH, etc), and Python
- Experience deploying and maintaining code using git-based tools and operating in a continuous deployment/integration environment
- Experience writing thorough tests and documentation for maintainable code-bases
Preferred Skills and Experience
- 3+ years building NLP models that were used to transform text in production pipelines
- 2+ years of experience using NLP to drive process automation
- Uncommonly strong statistical and mathematical background
- Educational background in Computational Linguistics or similar
Ascent employees enjoy many benefits and perks, including:
- Competitive compensation
- Medical, dental, and vision insurance; premiums paid 95% for the individual
- Medical premiums paid 50% for covered dependents
- Life insurance
- Commuter benefits
- Unlimited PTO
- Professional development stipend
- The opportunity to work with smart people on challenging problems!