Senior Data Engineer
Company overview:
Blue Orange Digital is a boutique data & AI consultancy that delivers enterprise-grade results. We design and build modern data platforms, analytics, and ML/AI Agent solutions for mid‑market and enterprise clients across Private Equity, Financial Services, Healthcare, and Retail.
Our teams work with technologies like Databricks, Snowflake, dbt, and the broader Microsoft ecosystem to turn messy, real-world data into trustworthy, actionable insight.
We’re a builder‑led, client‑first culture that prizes ownership, clear communication, and shipping high‑impact work.
Note: Please submit your resume in English, as all application materials must be in English for review and consideration.
Position Overview
Blue Orange Digital is seeking a Senior Data Engineer for a full‑time, 22‑week engagement (with options to extend or convert) on a large‑scale enterprise data warehouse implementation in the global commodities trading and supply chain industry.
You will design and build ingestion pipelines, medallion‑layer transformations, and Gold‑layer data products on Microsoft Fabric—covering Trader P&L, Global Position, Risk Metrics (VaR), and Financial Consolidation—as part of a project‑based delivery team. Working closely with the Solutions Architect/Tech Lead and a co‑invested client engineering team, you will contribute across all four engagement phases from Day 1 through stabilization and handover.
Responsibilities
Design and implement Bronze‑layer ingestion pipelines from regional ERP systems, CTRM instances (including Irely), and SAP S/4HANA source feeds into OneLake using Microsoft Fabric Data Factory.
Build Silver‑layer transformation logic in Fabric notebooks using PySpark and SQL against Delta Lake tables, covering data cleansing, standardization, deduplication, and business rule application.
Develop Gold‑layer data product tables serving Trader P&L, Global Position, VaR risk metrics, and Financial Consolidation reporting domains.
Implement and maintain the OneLake medallion architecture following BOD design standards and engagement coding conventions.
Perform schema design, data modeling, and query performance optimization across Fabric Lakehouse and Warehouse layers.
Collaborate with the Analytics/Power BI Specialist to publish clean, analytics‑ready Gold‑layer outputs aligned to semantic model requirements.
Participate in code reviews, enforce development standards, and contribute to CI/CD pipeline configuration for the Fabric workspace.
Work with client‑side Fabric/Azure data engineers as a technical peer—aligning on ingestion patterns, shared schemas, and pipeline governance.
Requirements
7+ years of data engineering experience with a strong portfolio of end‑to‑end pipeline implementations.
Hands‑on proficiency with Microsoft Fabric (OneLake, Lakehouse, Notebooks, Data Factory, Warehouse)—the full core stack for this engagement.
Strong SQL and PySpark skills with production‑level experience building and maintaining Delta Lake pipelines.
Experience building multi‑hop pipelines across Bronze/Silver/Gold medallion architectures.
Familiarity with ERP or CTRM source system integration (SAP S/4HANA or similar strongly preferred).
Bachelor’s degree in Computer Science, Engineering, or a related technical field (or equivalent experience).
Preferred Qualifications
Exposure to commodity trading, financial risk, or P&L reporting data environments.
Microsoft Fabric or Azure Data Engineer certification (DP‑700 or DP‑203).
Experience setting up CI/CD for data pipelines using Azure DevOps or GitHub Actions.
Understanding of data governance, lineage tracking, and schema evolution best practices.
Salary: $ 6200 - 8500 [ USD ] monthly
Background checks may be required for certain positions/projects.
Blue Orange Digital is an equal-opportunity employer.
- Department
- Engineering
- Role
- Senior Data Engineer
- Locations
- Multiple locations
- Remote status
- Fully Remote
- Monthly salary
- $6,200 - $8,500
About Blue Orange Digital
Blue Orange Digital is a data and AI consulting firm that helps companies turn complex data into real business outcomes. We partner with organizations across industries to design and deploy scalable data infrastructure, advanced analytics, and AI-powered solutions. Our team is fully remote, globally distributed, and driven by curiosity, impact, and innovation.