Submitting more applications increases your chances of landing a job.
Here’s how busy the average job seeker was last month:
Opportunities viewed
Applications submitted
Keep exploring and applying to maximize your chances!
Looking for employers with a proven track record of hiring women?
Click here to explore opportunities now!You are invited to participate in a survey designed to help researchers understand how best to match workers to the types of jobs they are searching for
Would You Be Likely to Participate?
If selected, we will contact you via email with further instructions and details about your participation.
You will receive a $7 payout for answering the survey.
Bosch Global Software Technologies Private Limited is a 100% owned subsidiary of Robert Bosch GmbH, one of the world's leading global supplier of technology and services, offering end-to-end Engineering, IT and Business Solutions. With over 27,000+ associates, it’s the largest software development center of Bosch, outside Germany, indicating that it is the Technology Powerhouse of Bosch in India with a global footprint and presence in the US, Europe and the Asia Pacific region.
Experience Summary
6–8 years of Data Engineer specialized in building document and knowledge-oriented data pipelines for regulatory/compliance domains, with strong capabilities in structured transformations, knowledge graphs, and containerized platform integration.
Core Responsibilities / Focus
Build and operate data ingestion and transformation pipelines for legal/regulatory content
Normalize and transform heterogeneous source formats (e.g., XML/HTML/structured exports) using tools such as XSLT
Implement pipelines for embeddings generation, indexing, and enrichment for downstream AI/RAG systems
Design and manage RDF-based knowledge representations and SPARQL-accessible datasets
Integrate storage and processing components across containerized/cloud environments
Support event-driven or integration-heavy workflows (e.g., via Apache Camel, message brokers)
Ensure reproducibility, maintainability, and operational handover of data pipelines
Core Skills (Must-Have)
Python
Docker / Docker Compose
Kubernetes
Knowledge Graphs (RDF)
SPARQL
XSLT
/Java
Embeddings pipelines / vector preparation
Azure Storage (or equivalent cloud storage services)
Apache Camel
Git
Preferred / Nice-to-Have
Docling (or similar Document conversion)
CloudEvents
Kafka (or other message brokers)
Event-based systems / event-driven architecture
Dev Containers
GitOps
Documentation practices
Domain Advantage
Experience processing legal/regulatory source documents and preserving semantic structure / provenance
Familiarity with content domains such as EU regulation, privacy, ESG, and compliance frameworks
Educational qualification:
BE/B.Tech or Equivalent Degree
Experience :
6-8 Years
Mandatory/requires Skills :
Strong hands-on expertise in Python, Docker / Docker Compose, Kubernetes, Knowledge Graphs,Java (RDF),SPARQL,XSLT,Embeddings pipelines / vector preparation, Azure Storage (or equivalent cloud storage services),
Apache Camel,Git
Preferred Skills :
You'll no longer be considered for this role and your application will be removed from the employer's inbox.