Officer - Data Engineer - C11 - Hybrid - CHENNAI @ Citi

Remote Full-time
This is a data engineer position - a programmer responsible for the design, development implementation and maintenance of data flow channels and data processing systems that support the collection, storage, batch and real-time processing, and analysis of information in a scalable, repeatable, and secure manner in coordination with the Data & Analytics team.The overall objective is defining optimal solutions to data collection, processing, and warehousing. Must be a Spark Java development expertise in big data processing, Python and Apache spark particularly within banking & finance domain. He/She designs, codes and tests data systems and works on implementing those into the internal infrastructure.Responsibilities: Ensuring high quality software development, with complete documentation and traceabilityDevelop and optimize scalable Spark Java-based data pipelines for processing and analyzing large scale financial dataDesign and implement distributed computing solutions for risk modeling, pricing and regulatory complianceEnsure efficient data storage and retrieval using Big DataImplement best practices for spark performance tuning including partition, caching and memory managementMaintain high code quality through testing, CI/CD pipelines and version control (Git, Jenkins)Work on batch processing frameworks for Market risk analyticsPromoting unit/functional testing and code inspection processesWork with business stakeholders and Business Analysts to understand the requirementsWork with other data scientists to understand and interpret complex datasetsQualifications:5- 8 Years of experience in working in data eco systems.4-5 years of hands-on experience in Hadoop, Scala, Java, Spark, Hive, Kafka, Impala, Unix Scripting and other Big data frameworks.3+ years of experience with relational SQL and NoSQL databases: Oracle, MongoDB, HBaseStrong proficiency in Python and Spark Java with knowledge of core spark concepts (RDDs, Dataframes, Spark Streaming, etc) and Scala and SQLData Integration, Migration & Large Scale ETL experience (Common ETL platforms such as PySpark/DataStage/AbInitio etc.) - ETL design & build, handling, reconciliation and normalizationData Modeling experience (OLAP, OLTP, Logical/Physical Modeling, Normalization, knowledge on performance…

Apply Now
Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

Project Administrator - Renewable Energy (Traveler)

Remote

Service Desk Analyst

Remote

**Experienced Full Stack Software Engineer – Web & Cloud Application Development at arenaflex**

Remote

Senior Staff Solutions Engineer

Remote

Dynamic Remote Live Chat Support Specialist – Immediate Start, Full‑Time & Flexible Hours

Remote

Insurance Customer Support Associate – Remote in Nevada

Remote

**Part Time Evening Remote Data Entry Specialist – Unlock a World of Opportunities at blithequark**

Remote

Urgently Hiring: Want Retail Sales Associate, Crabtree Valley

Remote

MS Access Data Analyst

Remote

Application Developer ($60,000 - $100,000 Annually DOE)

Remote
← Back