Job Description
We help the world run better
At SAP, we keep it simple: you bring your best to us, and we'll bring out the best in you. We're builders touching over 20 industries and 80% of global commerce, and we need your unique talents to help shape what's next. The work is challenging – but it matters. You'll find a place where you can be yourself, prioritize your wellbeing, and truly belong. What's in it for you? Constant learning, skill growth, great benefits, and a team that wants you to grow and succeed.
What you will build
As a Data Engineer on the Service & Support Data Lake team, you will help modernize our data engineering capabilities. We run an in-house data lake that powers AI services for SAP customer support and generates insights through project and agent mining. You will join a collaborative team evolving from isolated pipelines to a unified intelligent platform for self-service analytics and autonomous data agents. Data governance underpins every solution we deliver end to end.
Responsibilities
- Design and maintain scalable pipelines with clear architecture and high quality standards.
- Build real-time and batch ingestion from diverse sources with automated validation.
- Write production-grade code with strong testing discipline (unit/integration tests), code review quality, and maintainable modular design.
- Troubleshoot complex data issues end to end, including root-cause analysis, performance bottlenecks, and reliability incidents.
- Implement anonymization and PII controls aligned with governance requirements.
- Develop metadata pipelines for schema profiling, business context extraction, and lineage tracking.
- Contribute to semantic layers that map technical fields to business terminology for self-service and natural-language analytics use cases.
- Use AI-assisted development practices to accelerate delivery and maintenance; prior autonomous-agent experience is a plus, not a requirement.
- Establish monitoring, alerting, and data quality controls to ensure secure, reliable analytical assets, evaluate their AI/ML readiness based on data science requirements.
- Partner with the Tech Lead and Architect to convert requirements into production-ready systems.
What you bring
- Bachelor's degree or equivalent practical experience.
- 3+ years of experience coding in Python (pandas, pytest) and SQL.
- 3+ years of experience with Spark / Big Data processing (PySpark): transformations, partitioning, performance optimization.
- 3+ years designing and deploying data pipelines, including managing data schemas and processing high-volume workflows.
- Strong software engineering fundamentals: clean code, modular design, debugging, version control, and maintainable documentation.
- Strong SQL and data modeling capabilities (normalized and denormalized patterns, data contracts, schema evolution).
- Hands-on testing and release practices: unit/integration testing, CI/CD pipelines, and safe production rollout.
- Experience in observability and operations: metrics, logging, alerting, and on-call friendly troubleshooting.
- Experience with SQL databases (PostgreSQL preferred, others acceptable) and NoSQL databases (Elasticsearch and Delta Lake required).
- Experience with real-time and batch ingestion (APIs - polling & push, Kafka streaming).
- Experience with workflow orchestration (Kubeflow Pipelines, Airflow, Prefect, or Dagster).
- Experience with data governance: data redaction, anonymization, and PII handling.
- Proficiency in Git workflows: branching strategies, code review, CI/CD integration.
Preferred Qualifications
- 5+ years designing enterprise-scale data platforms and analytics infrastructure.
- Familiarity with LLM-enabled data applications, including RAG, embeddings, vector search, and evaluation from a data platform perspective.
- Experience building data services and data APIs that support AI-related applications and analytics products.
- Experience productionizing data and feature pipelines that support machine learning and intelligent applications.
- Strong interest in AI-native data engineering and agentic data workflows, including orchestration, tool integration, evaluation, and workflow automation.
- Understanding of MLOps/LLMOps principles to ensure scalable and reliable deployment of text processing and redaction pipelines.
- Experience with monorepo or shared library architecture patterns.
- Ability to operate across ambiguity and influence cross-functional technical decisions.
- Demonstrated learning agility in adopting emerging data concepts (for example, data agents and semantic layer patterns).
What You'll Get
Technical Growth
- Build an end-to-end data platform from consolidation to metadata intelligence to AI agents.
- Deepen expertise in metadata pipelines, semantic layers, and data-agent foundations.
- Master data quality frameworks with testing patterns, quality gates, and large-scale validation.
Team & Impact
- Join as a foundation team member whose work shapes platform direction.
- Collaborate directly with the Tech Lead and Data Scientists.
- Enable next-generation capabilities: self-service analytics, intelligent agents, and automated insights.
Long-Term Vision
- Grow at the intersection of data engineering and AI agents.
- Help evolve the platform from raw data to intelligent autonomous systems.
- Drive scalable impact across datasets, pipelines, and downstream analytics.
Where you belong
Culture:
- Shared ownership model: any engineer can maintain any pipeline
- AI-augmented workflows: tools like Claude Code support migration and scaffolding
- Quality-first: 100% branch coverage, automated quality gates enforced
- Collaborative learning: show & tell demos, pattern documentation, peer reviews
Bring out your best
SAP innovations help more than four hundred thousand customers worldwide work together more efficiently and use business insight more effectively. Originally known for leadership in enterprise resource planning (ERP) software, SAP has evolved to become a market leader in end-to-end business application software and related services for database, analytics, intelligent technologies, and experience management. As a cloud company with two hundred million users and more than one hundred thousand employees worldwide, we are purpose-driven and future-focused, with a highly collaborative team ethic and commitment to personal development. Whether connecting global industries, people, or platforms, we help ensure every challenge gets the solution it deserves. At SAP, you can bring out your best.
We win with inclusion
SAP’s culture of inclusion, focus on health and well-being, and flexible working models help ensure that everyone – regardless of background – feels included and can run at their best. At SAP, we believe we are made stronger by the unique capabilities and qualities that each person brings to our company, and we invest in our employees to inspire confidence and help everyone realize their full potential. We ultimately believe in unleashing all talent and creating a better world.
SAP is committed to the values of Equal Employment Opportunity and provides accessibility accommodations to applicants with physical and/or mental disabilities. If you are interested in applying for employment with SAP and are in need of accommodation or special assistance to navigate our website or to complete your application, please send an e-mail with your request to Recruiting Operations Team: [email protected].
For SAP employees: Only permanent roles are eligible for the SAP Employee Referral Program, according to the eligibility rules set in the SAP Referral Policy. Specific conditions may apply for roles in Vocational Training.
Qualified applicants will receive consideration for employment without regard to their age, race, religion, national origin, ethnicity, gender (including pregnancy, childbirth, et al), sexual orientation, gender identity or expression, protected veteran status, or disability, in compliance with applicable federal, state, and local legal requirements.
Successful candidates might be required to undergo a background verification with an external vendor.
AI Usage in the Recruitment Process
For information on the responsible use of AI in our recruitment process, please refer to our Guidelines for Ethical Usage of AI in the Recruiting Process.
Please note that any violation of these guidelines may result in disqualification from the hiring process.
Requisition ID: 455502 | Work Area: Software-Design and Development | Expected Travel: 0 - 10% | Career Status: Professional | Employment Type: Regular Full Time | Additional Locations: #LI-Hybrid
