Details

Location
Cleveland OH

Division
I.T.

Contact

Shane Paynter
614.643.0704
shane@ringsidetalent.com

Job Description

We are looking for an AI/ML Data Engineer to work for our client. The ideal candidate aligns with the responsibilities and qualifications outlined below.

About the Role

Our client is seeking an AI/ML Data Engineer to design, build, and scale modern data and machine learning platforms. You’ll partner with data scientists and product/engineering teams to develop reliable data pipelines, productionize ML models, and enable robust MLOps practices—driving measurable business impact from data and AI.

Responsibilities

Design and build cloud-scale data pipelines (batch & streaming) using tools such as Spark/Databricks, Airflow/Prefect, Kafka/Kinesis, and DBT
Develop feature engineering workflows, feature stores, and reusable data assets for ML
Productionize models with ML pipelines (training, evaluation, deployment) using MLflow/SageMaker/Vertex AI/Azure ML
Implement MLOps best practices: CI/CD for data & ML, automated testing, model registry, canary/blue‑green deployments
Build monitoring for data quality, model performance (drift, bias, accuracy), and platform health
Optimize data storage and compute performance (partitioning, Z‑ordering, indexing, caching)
Enforce data governance & security (IAM, secrets management, lineage, PII handling)
Collaborate with data scientists, analysts, and software engineers to translate requirements into scalable solutions
Create documentation and provide enablement for stakeholders and downstream consumers

Qualifications

5+ years of experience in data engineering or ML engineering, including production systems
Strong skills in Python and SQL; experience with Spark (PySpark) and Databricks or similar platforms
Hands-on experience with one major cloud (AWS, Azure, or GCP) and services for data & ML (e.g., S3/ADLS/GCS, EMR/Databricks, SageMaker/Azure ML/Vertex AI)
Workflow orchestration (e.g., Airflow, Prefect, Dagster) and CI/CD (GitHub Actions, Azure DevOps, GitLab CI)
Containerization and deployment with Docker; Kubernetes experience is a plus
Streaming data experience (e.g., Kafka, Kinesis, Pub/Sub) strongly preferred
Knowledge of MLOps frameworks (MLflow, model registries), testing (unit/integration), and observability (metrics, logging)
Familiarity with data quality frameworks (Great Expectations, Deequ) and governance (Lakehouse/medallion architecture, lineage)

What Our Client Offers

Competitive compensation with performance bonus
Modern data stack (Databricks/Spark, Airflow/Prefect, MLflow) and green‑field build opportunities
High visibility to leadership and ownership over ML platform decisions
Professional development budget (certifications, conferences, courses)