Design, develop, and maintain robust data pipelines to support data integration from Oracle Flexcube and other data sources into the Cloudera Data Platform.
Utilize Hadoop for efficient data storage and Hive+Impala for data querying and processing.
Ensure the reliability, scalability, and performance of data operations.
Implement best practices for data management, including data quality, data governance, and data security.
Monitor and troubleshoot data pipelines to ensure smooth data flow and timely resolution of issues.
Collaborate with cross-functional teams to understand data requirements and deliver solutions that meet business needs.
Automate data workflows and processes to enhance operational efficiency.
Develop and maintain documentation related to data engineering and operations Bachelor's or Master's degree in Computer Science, Information Technology, or a related field.
5+ and 7+ years of experience in data engineering and data operations, preferably in the banking sector.
Strong knowledge of Oracle Flexcube, APIs, JDBC/ODBC connection, SQL, Python, Spark (PySpark), Cloudera Data Platform, Hadoop, Hive, and Impala.
Cloud knowledge is a plus.
Proven experience in designing and managing data pipelines.
Excellent analytical and problem-solving skills.
Strong communication and interpersonal skills.
Ability to work independently and as part of a team.
Certification in data engineering or related fields is a plus
(ref:hirist.tech)
How strong is your resume?
Upload your resume and get feedback from our expert to help land this job
How strong is your resume?
Upload your resume and get feedback from our expert to help land this job