PhysIQ is looking for a Lead Data Engineer to join our growing team! Our Lead Data Engineer will lead and grow a team of engineers focused on collecting, storing, processing, analyzing, and serving huge data sets in our real-time healthcare platform. The data engineering team will be 100% focused on ensuring physIQ has a cutting edge, rock solid, real-time analytics platform.
Initial responsibilities include taking ownership of existing data processing systems, developing a 12-month roadmap, and begin executing on that plan. Longer term, this role will be responsible for curating a world class data engineering team and ensure the physIQ analytics platform is second to none in Healthcare.
Responsibilities
- Design new streaming and batch data processing solutions to meet business needs
- Implement and maintain solutions with hands-on leadership of data engineering team
- Collaborate with architect to develop roadmap for data systems and development plans
- Collaborate with architect to develop architectural and technical decisions around data systems
- Ensure proper technical documentation for all data systems
- Own data availability and integrity within platform
- Ensure production data systems are both scalable and fault tolerant
- Drive quality through test driven development and testing best practices
- Support verification and validation of data systems per physIQ Quality Management System
- Proactively identify issues & risks with data systems and implement preventative solutions
- Work collaboratively with our data scientists to understand problems and implement optimal solution
- Productize offline analytic models into our platform as real-time systems
- Work collaboratively with our DevOps & Security teams to ensure regulatory security compliance
- Ensure data systems are properly instrumented for real-time monitoring
- Work collaboratively with our Cloud Operations team to meet disaster recovery requirements
- Leverage modern open source technologies and extend/improve as needed
- Constant evaluation of new technology solutions
Required Skills
- 8+ years of experience in a hands-on, data centric role (architecture, streaming, warehousing)
- 3-5 years of professional experience with Java and Python
- Proven track record of Innovation and expertise in Data Engineering
- Deep understanding of streaming data architectures and technologies
- Experience with modern data processing platforms (Hadoop, MapReduce, Spark, Flink, etc)
- Extensive experience working with distributed systems (Kafka, Cassandra, Elasticsearch, etc)
- Strong understanding of software engineering best practices
- Demonstrated leadership capabilities with the ability to work across functional boundaries
- Ability to coach, develop, and lead others
- Ability to work with broad parameters in complex situations
- Excellent organizational skills and ability to manage multiple projects in a fast-paced environment
- Experience with agile development and continuous integration
- Self-starter with excellent communication skills
Nice to have:
- Designing, deploying, and managing large scale application infrastructures
- Experience with a major cloud platform, such as AWS or GCE
- Production experience with Flink
- Production experience with Spark
- Production experience with Cassandra
- Development experience with Gradle
- Development experience with Kotlin
To apply, send your resume and cover letter to [email protected]