Data Engineer
Data Engineer
Full Time - Exempt
Onsite, Columbus, OH
AndHealth is on a mission to radically improve access and outcomes for the most challenging chronic health conditions with the goal of making world-class specialty care accessible and affordable to all. We partner with health systems, community health centers, plans, and employers to remove barriers to care to ensure all people have access to the care they deserve.
Do you thrive on building robust data pipelines that unlock valuable insights? We're seeking a highly motivated Data Engineer to play a critical role in establishing and maintaining integrations with our Community Health Center (CHC) and health system partners. You'll be responsible for the entire data engineering lifecycle, from designing and building secure data pipelines to ensuring high-quality, standardized data is readily available for analysis and application integration. This hands-on role demands a deep understanding of data manipulation, cloud platforms, and a passion for improving healthcare outcomes through data-driven solutions. This role offers a unique opportunity to leverage your data engineering expertise to create a positive impact on healthcare delivery. You'll be instrumental in building the data foundation that empowers data-driven decision making and ultimately improves patient outcomes!
Responsibilities:
- Healthcare Data Integration:
- Collaborate with CHC partners to design and develop secure data pipelines for ingesting data from CHC and health system partner Electronic Health Records (EHRs) and other disparate sources.
- Analyze CHC-specific EHR data models and collaborate with partners to identify the best available data fields.
- Leverage scripting languages (Python) to automate data extraction, transformation, and loading (ETL) processes.
- Cloud Platform Expertise:
- Utilize BigQuery and Google Cloud Platform (GCP) tools to build efficient and scalable data pipelines.
- Manage data storage and ensure data security and compliance with relevant healthcare regulations (HIPAA etc.).
- Data Quality Management:
- Implement robust data quality checks to identify and rectify inconsistencies in healthcare data.
- Standardize data formats for seamless integration with internal systems and analytics tools.
- Develop data pipelines that are maintainable, scalable, and performant.
- Reference Data Management:
- Automate the import and maintenance of reference data tables, including 340b pricing, HRSA data, CMS.gov PUFs, Medicaid spending data, and FDA Orange Book data.
Desired Skills & Qualifications:
- 2+ years of experience in data engineering or a related field.
- Expert proficiency in SQL for data manipulation and querying.
- Strong understanding of data modeling concepts and best practices.
- Experience with cloud platforms like Google Cloud Platform, particularly BigQuery.
- Proven ability to design and implement efficient data pipelines using scripting languages (Python preferred).
- Experience with data quality tools and methodologies.
- Familiarity with healthcare data standards and regulations (HIPAA etc.) is a plus.
- Excellent communication and collaboration skills to work effectively with internal and external partners.
- Strong problem-solving skills and a proactive approach to identifying and resolving data issues.