GECO ASIA PTE. LTD. is hiring for a Data Engineer internship — a 12-month, on-site Data Science role based in Singapore. It is an unpaid internship. It is open to university students, typically in Year 2–4. Applicants with experience in Spark SQL, Teradata, Hadoop Database, Metadata Standards, and Informatica are a strong fit.
⚡ New Data Science internships, the moment they're posted — join our Telegram
About this role
Job Summary Role: Data Engineer Start: ASAP Duration: 12 Months Location: Singapore We are seeking an experienced Data Engineer with 6+ years of expertise in enterprise data warehouse (EDW), ETL/ELT development, and big data engineering. The ideal candidate will be responsible for designing, developing, and optimizing scalable data pipelines and frameworks across Teradata-based data warehouse environments and modern Hadoop ecosystem platforms. This role requires strong hands-on experience in ETL development using Informatica PowerCenter, Teradata SQL, and Spark, along with deep exposure to data pipeline automation, performance tuning, and CDC (Change Data Capture) frameworks. The candidate will also contribute to building enterprise-grade data ingestion, transformation, and governance solutions supporting both structured and semi-structured data across complex enterprise systems. Requirements: - Bachelor’s degree in Computer Science, Information Technology, Engineering, or a related field. - Minimum 6+ years of experience in Data Engineering, ETL development, or Enterprise Data Warehouse environments. - Strong hands-on experience in Teradata SQL and EDW/ETL development. - Proven experience with Informatica PowerCenter for enterprise ETL development. - Strong experience with Hadoop ecosystem technologies (Apache Spark, Hive, Kafka, NiFi, Iceberg, Impala, etc.) - Strong expertise in performance tuning and optimization of large-scale data pipelines. - Hands-on experience with shell scripting, BTEQ, and automation frameworks. - Strong understanding of data pipeline orchestration and workflow automation. - Experience with CDC (Change Data Capture) concepts, preferably with Java-based implementation. - Experience building frameworks for data ingestion, transformation, and automation in enterprise environments. - Strong understanding of data modeling, ETL design patterns, and data warehousing concepts. - Experience working with enterprise systems such as SAP (Oracle) for data integration. - Experience with Master Data Management (MDM) solutions including data validation and approval workflows. - Strong analytical, problem-solving, and debugging skills. - Experience working in Agile or enterprise-scale delivery environments. Roles and Responsibilities: - Design, develop, and maintain ETL/ELT processes using Teradata SQL, Informatica PowerCenter, Apache Spark, QueryGrid, and Trino. - Build and optimize data pipelines to load and transform data into Teradata-based data warehouses and data marts. - Develop and implement scalable data engineering frameworks to support end-to-end automation using GCFR, BTEQ, and shell scripting. - Perform performance tuning and optimization of complex ETL jobs and data processing workflows to reduce resource consumption and improve efficiency. - Build and maintain in-house CDC (Change Data Capture) frameworks using Java to process OLR logs from SAP (Oracle) systems and ingest data into Teradata. - Design configurable and reusable CDC ingestion frameworks to support enterprise-wide data integration requirements. - Develop and support Master Data Management (MDM) solutions for user data uploads, including data validation, enrichment, and approval workflows. - Work with Hadoop ecosystem technologies including Hive, Impala, Spark, Kafka, Iceberg, NiFi, Ranger, Atlas, and Flink for data ingestion, processing, and governance. - Implement data pipeline orchestration and scheduling mechanisms to ensure reliable and timely data delivery. - Collaborate with business, analytics, and application teams to translate requirements into scalable data engineering solutions. - Ensure data quality, consistency, and governance across enterprise data platforms. - Support troubleshooting, root cause analysis, and production issue resolution for data pipelines and ETL processes. - Continuously improve data engineering standards, frameworks, and automation capabilities. Please send your application highlighting: - Your relevant experience - Current/expected salary - Availability information - A latest MS-WORD Resume **We regret that only short-listed applicants will be contacted.** GECO Asia values the data privacy rights of our customers, associates, partners and prospective applicants. We have a privacy policy that governs our collection and use of personal data in place. In conjunction with the PDPA act in Singapore, we have updated our Privacy Policy and Terms of Use to better clarify our collection and use of your personal information. The same can be found here (https://www.geco.asia/about/privacy-policy) Note: GECO Asia is an Information Technology Consulting Services provider. We provide specialist IT and Digital Transformation specialist resources on a project (SOW) and/or permanent basis. We operate under a Comprehensive License offered by Ministry of Manpower, Singapore. [GECO Asia Pte Ltd, License No. 07C4453] [2 Venture Drive, #10-18 Vision Exchange, Singapore 608526]
Also in Data Science