Submitting more applications increases your chances of landing a job.

Here’s how busy the average job seeker was last month:

Opportunities viewed

Applications submitted

Keep exploring and applying to maximize your chances!

Looking for employers with a proven track record of hiring women?

Click here to explore opportunities now!
We Value Your Feedback

You are invited to participate in a survey designed to help researchers understand how best to match workers to the types of jobs they are searching for

Would You Be Likely to Participate?

If selected, we will contact you via email with further instructions and details about your participation.

You will receive a $7 payout for answering the survey.


User unblocked successfully
https://bayt.page.link/iQw6JpGUE3wdyo457
Back to the job results
Other Business Support Services
Create a job alert for similar positions
Job alert turned off. You won’t receive updates for this search anymore.

Job description

Company Description

Bosch Global Software Technologies Private Limited is a 100% owned subsidiary of Robert Bosch GmbH, one of the world's leading global supplier of technology and services, offering end-to-end Engineering, IT and Business Solutions. With over 27,000+ associates, it’s the largest software development center of Bosch, outside Germany, indicating that it is the Technology Powerhouse of Bosch in India with a global footprint and presence in the US, Europe and the Asia Pacific region.



Job Description

Experience Summary


6–8 years of Data Engineer specialized in building document and knowledge-oriented data pipelines for regulatory/compliance domains, with strong capabilities in structured transformations, knowledge graphs, and containerized platform integration.


Core Responsibilities / Focus


  • Build and operate data ingestion and transformation pipelines for legal/regulatory content


  • Normalize and transform heterogeneous source formats (e.g., XML/HTML/structured exports) using tools such as XSLT


  • Implement pipelines for embeddings generation, indexing, and enrichment for downstream AI/RAG systems


  • Design and manage RDF-based knowledge representations and SPARQL-accessible datasets


  • Integrate storage and processing components across containerized/cloud environments


  • Support event-driven or integration-heavy workflows (e.g., via Apache Camel, message brokers)


  • Ensure reproducibility, maintainability, and operational handover of data pipelines


Core Skills (Must-Have)


  • Python


  • Docker / Docker Compose


  • Kubernetes


  • Knowledge Graphs (RDF)


  • SPARQL


  • XSLT


  • /Java


  • Embeddings pipelines / vector preparation


  • Azure Storage (or equivalent cloud storage services)


  • Apache Camel


  • Git


Preferred / Nice-to-Have


  • Docling (or similar Document conversion)


  • CloudEvents


  • Kafka (or other message brokers)


  • Event-based systems / event-driven architecture


  • Dev Containers


  • GitOps


  • Documentation practices


Domain Advantage


Experience processing legal/regulatory source documents and preserving semantic structure / provenance


Familiarity with content domains such as EU regulation, privacy, ESG, and compliance frameworks



Qualifications

Educational qualification:


BE/B.Tech or Equivalent Degree


Experience :


6-8 Years


Mandatory/requires Skills :
Strong hands-on expertise in Python, Docker / Docker Compose, Kubernetes, Knowledge Graphs,Java (RDF),SPARQL,XSLT,Embeddings pipelines / vector preparation, Azure Storage (or equivalent cloud storage services),


Apache Camel,Git


Preferred Skills :




This job post has been translated by AI and may contain minor differences or errors.
You’ve reached the maximum limit of 15 job alerts. To create a new alert, please delete an existing one first.
Job alert created for this search. You’ll receive updates when new jobs match.
Are you sure you want to unapply?

You'll no longer be considered for this role and your application will be removed from the employer's inbox.