DIGIWORLD TECHNOLOGIES PTE. LTD. is hiring for a Data Engineer internship — a 12-month, on-site Engineering Hardware role based in UBI CRESCENT, Singapore. It is an unpaid internship. It is open to university students, typically in Year 2–4. Applicants with experience in SAP Implementation, data uploads, Application Performance Management, Informatica Powercenter, and Apache Spark are a strong fit.
⚡ New Engineering Hardware internships, the moment they're posted — join our Telegram
About this role
Job Objective: Develop and deliver scalable ETL/ELT solutions across Teradata and Hadoop platforms by building robust data pipelines, ingestion frameworks, and automation capabilities. Ensure efficient data processing, performance optimization, and reliable integration of enterprise data sources, while supporting data warehouse, CDC, and MDM initiatives. Responsibilities • Create EDW ETL/ELT codes using Teradata SQL, Informatica, Apache Spark, QueryGrid, Trino to perform various transformations and load into Teradata based data warehouse or datamarts • Performance tuning of highly complex applications to reduce resource usage • Create framework using GCFR, shell scripts, BTEQ and shell scripts to automate end to end. Also to integrate with control • Build in-house CDC framework using Java to process OLR logs from SAP(Oracle) and ingestion the data into Teradata. Create framework around this CDC ingestion to make it configurable • Install and build Master Data Management (MDM) application for various user uploads along with data validations and approval workflow. Requirements: • Strong hands on experience with the ETL(Informatica Power Center) and Teradata based ETL development, similarly Hadoop ecosystem (Hive, Impala, Spark, Kafka, Iceberg, Ranger, Atlas, Nifi, Flink etc.,), and data pipeline orchestration. • Create EDW ETL/ELT codes using Teradata SQL, Informatica, apache Spark, QueryGrid, Trino to perform various transformation and load into Teradata based data warehouse or datamarts • Performance tuning of highly complex applications to reduce resource usage • Create framework using GCFR, shell scripts, BTEQ and shell scripts to automate end to end. Also to integrate with control • Build in-house CDC framework using Java to process OLR logs from SAP(Oracle) and ingestion the data into Teradata. Create framework around this CDC ingestion to make it configurable
Also in Engineering Hardware