| Chicago
Sorry, this job was removed at 1:28 p.m. (CST) on Tuesday, July 21, 2020
Find out who's hiring in Chicago.
See all Data + Analytics jobs in Chicago
Apply now
By clicking continue you agree to Built In’s Privacy Policy and Terms of Use.



Cat Digital is the digital and technology arm of Caterpillar Inc., responsible for bringing world class digital capabilities to our products and services. With almost one million connected assets worldwide, we're focused on using IoT and other data, technology, advanced analytics and AI capabilities to help our customers build a better world. 


Cat Digital’s Advanced Data Quality team is looking for a talented and motived Principal Data Scientist to drive platform data quality improvements by developing and delivering ML/AI models to address the most challenging data quality issues.  As a Principal Data Scientist, you will apply machine learning and other analytics techniques on a very large set of diverse data from IoT connected assets and our integrated network of dealers.


JOB DUTIES: As a Principal Data Scientist, you will contribute to the design, development, deployment, and quality of Caterpillar’s state-of-the-art digital platform by leading the development of advanced Data Quality methods and routines.

  • Competent to perform all programming, project management, and development assignments without close supervision; normally assigned the more complex aspects of systems work.
  • Lead role in complex projects spanning across multiple system components.
  • Work in all phases of product creation process including creating technical requirements, project planning, identifying dependencies, system architecture and development.
  • Investigation and root cause analysis of software and system defects.
  • Focus on productivity, quality and competitiveness of major technology initiatives.
  • Apply knowledge and skills to solve most complex data engineering and quality problems.
  • Organize and drive configuration management activities of the development process.
  • Works directly on complex application/technical problem identification and resolution, including responding to off-shift and weekend support calls.
  • Works independently on complex systems or infrastructure components that may be used by one or more applications or systems.
  • Drives data pipeline development focused around delivering high quality data.
  • Mentor and assist software engineers, providing technical assistance and direction as needed.
  • Maintains high standards of software quality by establishing good practices and habits.
  • Identifies and encourage areas for growth and improvement.
  • Communicate with peer engineering teams to help direct development, debugging, and testing of data for accuracy, integrity, interoperability, and completeness.
  • Performs integrated testing and customer acceptance testing of components that requires careful planning and execution to ensure timely, quality results.







Basic qualifications

  • MS or PhD degree in quantitative discipline such as mathematics, statistics, data science, computer science, engineering
  • 7+ years of experience in designing and implementing data processing and machine learning frameworks
  • 7+ years of experience with Python, NoSQL and relational databases
  • 3+ years of experience as principal engineer
  • 3+ years of experience with AWS stack


Top candidates will also have:

  • Proven experience in most of the following:
    • Compiling and standardizing diverse, non-sanitized datasets.
    • Working with structured and unstructured data.
    • Developing classification and regression models.
    • Unsupervised learning algorithms.
    • Natural language processing.
    • Customized statistical algorithm development and deployment.
    • Experience integrating analytical models with existing data pipelines.
  • Proven experience with AWS full-stack development and services such as Athena, CloudFormation, DynamoDB, Fargate, EC2, EMR, Lambda, RDS, S3, SageMaker.
  • Thorough knowledge of statistical approaches, quantitative analytic methods, data management techniques, and/or related digital technologies, and the ability to handle complex issues.
  • Experience with dashboard development and design using data visualization tools such as Tableau, Power BI, Kibana
  • Experience in some of the following:
    • Designing, developing, deploying and maintaining software at scale.
    • Experience delivering productionized software solutions.
    • Deploying software using CI/CD tools such as Jenkins, GoCD, Azure DevOps etc.
    • Deploying and maintaining software using public clouds such as AWS
    • Developing software applications using relational and NoSQL databases.
    • Experience working within an Agile framework
  • Solid knowledge of computer science fundamentals like data structures and algorithms.
  • Exhibit strong initiative and teamwork skills, and a demonstrated track record of growing and learning through experience.
  • Demonstrate strong communication and presentation skills, with the ability to articulate conclusions to customers who have limited knowledge and experience with quantitative analytical methods. 
  • Ability to work under pressure and within time constraints.
  • Passion for technology and an eagerness to contribute to a team-oriented environment.
  • Challenges include meeting expectations in delivering results, learning to refine solutions to better fit complex situations, making timely decisions, and communicating effectively with all project stakeholders. 
Read Full Job Description
Apply now
By clicking continue you agree to Built In’s Privacy Policy and Terms of Use.

Technology we use

  • Engineering
  • Sales & Marketing
    • JavaLanguages
    • JavascriptLanguages
    • PythonLanguages
    • RLanguages
    • ScalaLanguages
    • SqlLanguages
    • ReactLibraries
    • ReduxLibraries
    • AngularJSFrameworks
    • Backbone.jsFrameworks
    • Ember.jsFrameworks
    • HadoopFrameworks
    • Node.jsFrameworks
    • Ruby on RailsFrameworks
    • SparkFrameworks
    • SpringFrameworks
    • Amazon Web ServicesFrameworks
    • AWS ElasticSearchFrameworks
    • AWS Code PipelineFrameworks
    • DockerFrameworks
    • ApigeeFrameworks
    • FlinkFrameworks
    • AkkaFrameworks
    • Amazon ECSFrameworks
    • MySQLDatabases
    • OracleDatabases
    • PostgreSQLDatabases
    • DynamoDBDatabases
    • RDSDatabases
    • SalesforceCRM


In the heart of Chicago's lively West Loop area, we have easy access to public transport, great bars and restaurants, and an awesome office roof deck.
Caterpillar Does Digital: The Machine Learning Behind the Machines

An Insider's view of Cat Digital

What are some social events your company does?

Whether we’re working virtually or in-person, we are always looking for ways to have fun and grow as a team. Team dinners, coffee chats, ax throwing, chess club, and virtual happy hours are just a few of the activities we do to make work more fun and connect with colleagues around the world.



What kinds of technical challenges do you and your team face?

It’s amazing to be able to work in an architectural framework where we can negotiate between speed to market and a solid application – software that is well-built, well-designed, well-tested. I find this negotiation both challenging and exhilarating.


Lead Software Engineer

How does the company support your career growth?

I’ve been with Caterpillar for 20 years now, and I’ve been lucky to work on teams that have different focuses. I’ve worked on everything from engineering applications to the latest and greatest digital technology applications.


Digital Product Manager

How do you make yourself accessible to the rest of the team?

The team should be comfortable approaching me with any kind of issue — like improving a process, getting rid of unnecessary ceremonies or something else — and know that I will address it. I believe a manager should be the first line of defense against bugs and conflicting priorities, and my team needs to know that I have their back.


Software Engineering Manager

What projects are you most excited about?

Deep learning algorithms, popularized in the past five years, allow us to scan huge volumes of data from Caterpillar's fleet of connected engines and machines for unusual patterns. We're now able to make sophisticated predictions that wouldn’t have been possible 20 years ago.


Analytics Director

What are Cat Digital Perks + Benefits

Volunteer in local community
Caterpillar Inc. participates in local volunteer activities such as the Chase Corporate Challenge
Partners with Nonprofits
Open door policy
Team owned deliverables
Team based strategic planning
Open office floor plan
Documented equal pay policy
Unconscious bias training
Someone's primary function is managing the company’s diversity and inclusion initiatives
Health Insurance & Wellness Benefits
Flexible Spending Account (FSA)
Disability Insurance
Dental Benefits
Vision Benefits
Health Insurance Benefits
Life Insurance
Pet Insurance
Wellness Programs
Onsite Gym
Retirement & Stock Options Benefits
401(K) Matching
Company Equity
Performance Bonus
Match charitable contributions
Child Care & Parental Leave Benefits
Generous Parental Leave
Flexible Work Schedule
Remote Work Program
Family Medical Leave
Adoption Assistance
Vacation & Time Off Benefits
Generous PTO
Paid Volunteer Time
Paid Holidays
Paid Sick Days
Perks & Discounts
Casual Dress
Game Room
Recreational Clubs
Professional Development Benefits
Job Training & Conferences
Tuition Reimbursement
Diversity Program
Lunch and learns
Cross functional training encouraged
Promote from within
Time allotted for learning
Online course subscriptions available
Customized development tracks
More Jobs at Cat Digital12 open jobs
All Jobs
Data + Analytics
Dev + Engineer
Data + Analytics
Data + Analytics
Data + Analytics
Data + Analytics
Data + Analytics
Data + Analytics
Apply now
By clicking continue you agree to Built In’s Privacy Policy and Terms of Use.
Save jobView Cat Digital's full profileSee more Cat Digital jobs