Senior ML Ops / Data Engineer

About xCures:

Launched in 2018, xCures provides a paradigm shift to improve patient outcomes via comprehensive, intelligent access to healthcare data on an AI-assisted platform. Delivered using a Software as a Service (SaaS) model,  xCures enables healthcare organizations to quickly find and use clinical insights. We continue to enhance our platform to provide significant new capabilities for patient care.

About the role:

The ML Ops / Data Engineer is a member of a highly motivated team, and reports to the VP of Engineering.  The individual will work closely with Data Science, Engineering, Customer Success, and Medical Affairs personnel to design, analyze, develop, test and document machine learning algorithms and data integration tools in the development and enhancement of an integrated electronic medical record comprising data from multiple providers. They will leverage key development, database, ML, reporting and analytic skills across a diverse and multi-faceted application landscape. The position involves working on multiple projects simultaneously and performing research to explore new offerings and technologies to improve workload productivity.

This job is right for you if you like:

  • Solving problems that make a real difference in people’s lives and wellbeing.
  • Rapid growth and the ability to make a personal, directional impact on strategy/execution.
  • Guiding innovative products, services, and processes from idea to user adoption
  • Rockstar teammates: an unparalleled team with decades of prior work experience in artificial intelligence, software systems, molecular biology, clinical oncology, clinical and regulatory operations, and related fields

Responsibilities

Essential Duties and Responsibilities include the following.  Other duties may be assigned. 

  • Design, build, and deploy machine learning models into development and production environments
  • Work with data science team to streamline model training, validation, integrated QA metrics, and inference pipelines
  • Implement and manage scalable infrastructure for ML model training and deployment using cloud services (principally AWS)
  • Ensure adherence to data privacy, security, and regulatory standards
  • Implement and maintain proper logging, monitoring, and alerting systems to safeguard development and production ML infrastructure
  • Lead the design, planning, and execution of the integration of data science and ML tools into the xCures solution, ensuring seamless interoperability between different EMR systems and our central platform
  • Develop data mapping and transformation strategies to align EMR data with the xCures database schema, ensuring data consistency and accuracy
  • Conduct thorough testing and validation of integrated systems to ensure data integrity, system reliability, and adherence to regulatory requirements
  • Use quality assurance principles to ensure data accuracy and integrity.
  • Troubleshoot data migration issues in partnership with the data science team
  • Collaborate with internal development teams, product managers, and other stakeholders to align integration efforts with overall product development and business objectives
  • Provide technical guidance in software design and development activities
  • Participate in scrum activities, perform code reviews, contribute to a high performing, growing team
  • Ensure new software meets quality standards including writing unit tests and automated tests
  • Provide technical support to resolve integration issues, troubleshoot errors, and optimize data exchange processes to minimize downtime and improve efficiency

Worksite Location

Fully remote; occasional travel.

Qualifications:

To perform this job successfully, an individual must be able to perform each essential duty satisfactorily. The requirements listed are representative of the knowledge, skill, and/or ability required. Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions.

Education and/or Experience:

Bachelor’s degree from four-year college or university and 4 years’ experience; or comparable experience and/or training; or equivalent combination of education and experience. BS in Computer Science or HealthCare Informatics preferred.

Required Skills and Qualifications:

  • Strong operational knowledge of ML Ops, including AWS Sagemaker, plus tools for integrating with LLMs and various ML algorithms, such as scikit-learn, langchain, and related packages
  • Good working knowledge of EMR systems (e.g., Epic, Cerner, Allscripts), their APIs, coding ontologies, data formats (HL7, FHIR), and integration methods
  • Familiarity with healthcare data standards and protocols (e.g., HIPAA, DICOM, IHE)
  • Proficiency in programming languages such as Java, Python, or JavaScript for developing integration interfaces.
  • HL7, CDA, X12, FHIR – Electronic Medical Record Integration experience
  • Strong SQL skills, including performance tuning in a SQL environment
  • Use ETL tools/processes to load data repositories and create data stores
  • AWS ecosystem familiarity (Lambdas, S3, SQS, RedShift)
  • Expertise in data structures and common methods to transform data for downstream analysis
  • Knowledge of SQL, Redshift, etc., to extract, transform and load data
  • Experience with interpreting data trends, conducting complex data analysis, and reporting results
    Must reside in the United States.
  • Must have authorization to work in the United States.

Preferred Skills and Qualifications:

  • Experience with Electronic Medical Records and/or Practice Management Systems
  • Experience deploying cloud solutions with Infrastructure as Code tools such as CloudFormation, CDK, and Terraform
  • Extensive experience with databases (e.g., SQL, NoSQL) and database design principles.
  • Strong knowledge of RESTful APIs and microservices architecture.
  • Experience with cloud platforms (e.g., AWS, Azure, or GCP) and containerization technologies (Docker, Kubernetes).
  • Experience developing serverless applications leveraging AWS Lambda
  • Experience writing and maintaining unit tests
  • Understanding of AWS ElasticSearch / Opensearch
  • Experience working with large-scale databases, collection, and organization of real-time event streaming data.
  • Working with Dimensional, Entity-Relationship, Tabular models, and OLAP data modeling.
  • A proven track record in delivering in an agile environment, while managing multiple priorities.
  • Practical experience with Continuous Integration/Continuous Deployment (CI/CD) pipelines
  • Experience in constructing engineering and architectural patterns

Notes

This is a big list. Don’t worry if you do not meet every qualification or wishlist item. If you are passionate, ambitious, adept, and mission-aligned, then we want to hear from you — even if you don’t check every box listed here. True talent shines through and transcends a list of bullet points.

Comp & Benefits:

  • Salary range : 135K to 200K annually
  • Medical, Dental, Vision insurance
  • 401k

xCures acknowledges that equal opportunity for all persons is a fundamental human value.Each employee and applicant will be considered on the basis of individual ability and merit, without regard to race, color, religion, age, sex, sexual orientation, gender identity, gender expression, pregnancy, national origin, marital status, physical disability, mental disability, medical condition, genetic information, protected military or veteran status, or any other characteristics.

To apply, send your resume to eng-jobs@xcures.com