Data Engineer, Python at GoHealth
GoHealth is looking for Data Engineers who will be responsible for the design, development, and delivery of data transformation tasks used in transforming data into a format that can be easily analyzed. We are seeking candidates who have experience in data analysis, collection, and optimization for the purpose of informing business decisions. The Data Engineer will work with other team members in owning data pipelines including execution, documentation, maintenance, and metadata management. In this role, you will also support the development of the data infrastructure necessary for full scale data science, predictive analytics and machine learning.
- Design, develop and deploy optimal extraction, transformation, and loading of data from various GoHealth and external data sources.
- Monitor, execute and report on all data pipeline tasks while working with appropriate teams to take corrective action quickly, in case of issues.
- Perform unit testing, system integration testing and assist with user acceptance testing.
- Adapt data components to accommodate changes in source data and new business requirements.
- Create and maintain documentation of the technical detail design, operational support and maintenance procedures for all data pipeline tasks.
- Ensure data quality and compliance with development, architecture, reporting, and regulatory standards throughout entire data pipeline.
- Collaborate with the rest of the Data Engineering Team, subject matter experts and department leaders to understand, analyze, build and deliver new data-related processes and/or reports.
Skills and Experience:
- Bachelor’s Degree in computer science or equivalent experience required.
- 2+ years of experience in the design and development of data pipelines and tasks.
- Good understanding of data warehousing concepts and dimensional data modeling.
- Hands-on experience with troubleshooting performance issues and fine tuning SQL queries.
- Experience in Python including in modules/libraries such as pandas, numpy, Flask, scikit-learn, and sci-py.
- Proven experience extracting data from structured data sources (SQL, Excel, CSV files, Couchbase) and unstructured data sources
- (Splunk, log files) both on-premise and in the cloud.
- Experience consuming data from web services, REST and SOAP, HTML, XML and JSON.
- Knowledge of version control systems using Git, Bitbucket, SVN, or Team Foundation.
- Experience in Microsoft SQL Server, SSIS, SSRS, Power BI, or Azure is preferred but not required.
- Familiar with other data warehouse platforms like AWS Redshift or AWS Data Pipeline.
Benefits and Perks:
- Open vacation policy
- 401k program with company match
- Medical, dental, vision, and life insurance benefits
- Flexible spending accounts
- Commuter and transit benefits
- Professional growth opportunities
- Casual dress code
- Generous employee referral bonuses
- Happy hours, ping-pong tournaments, and more company-sponsored events
- Subsidized gym memberships
- GoHealth is an Equal Opportunity Employer