We are seeking a hands-on Data Engineer with strong experience in Spark-based processing, object storage platforms, and data modeling to build and maintain MMBL’s Medallion Architecture (Raw → Curated → Semantic) on MINIO-based Delta Lake. You will play a key role in integrating multi-system data and supporting analytics, BI, and ML downstream.
If you are passionate about data engineering and ETL processes and have experience working with large-scale data platforms, we would love to hear from you! Apply today to be part of our team.
• Develop and manage data pipelines across ingestion (Kafka, Sqoop, Trino), transformation (Spark), and enrichment (Delta Lake on MINIO).
• Design and maintain Raw, Curated, and Semantic Zones within the Data Lake following the Medallion Architecture.
• Enable scalable processing and curation using Apache Spark (batch + structured streaming).
• Implement data modelling in OLAP columnar formats for BI performance.
• Collaborate with governance teams to ensure data is discoverable, high-quality, and catalogued via Open Metadata.
• Support structured and federated queries across data zones using Trino.
• Contribute to data quality frameworks, lineage tracking, and performance tuning.
• Education: Bachelor’s/master’s degree in computer science, Information Technology, or a related field.
• Experience: 3 to 5 years of experience in data engineering, ETL development, or related fields.
• Key Skills:
• Strong experience in SQL development and database optimization (Oracle 19c, PostgreSQL, MySQL).
• Proficiency in Trino, Sqoop, Delta Lake, and MINIO for data processing and storage.
• Hands-on experience with Python and R for data manipulation and ETL automation.
• Expertise in Big Data technologies and distributed computing frameworks.
• Experience with Tableau for data visualization and reporting.
• Strong understanding of data warehousing concepts, data lakes, and cloud storage solutions.
• Knowledge of data governance, security, and compliance standards.
• Strong experience with Apache Spark (PySpark) or Trino.
• Working knowledge of object stores (MINIO, S3) and Delta-Lake architecture.
• Expertise in Medallion Data Architecture (Raw → Curated → Semantic).
• Proficiency in data modeling, data federation, and columnar OLAP stores (e.g., Parquet).
• Proficient in SQL, Trino/Presto, and batch + streaming pipelines.
Job Locations:
Head Office
About MMBL:
Mobilink Microfinance Bank Ltd. is providing banking services to over 42 million registered users including 16+ million monthly active customers across Pakistan. With a hybrid model that combines traditional microfinance with mobile/digital banking technologies, the bank now operates with over 100 branches and 200,000 branchless banking agents and provides a USSD (GSM) based digital channel offering savings, micro enterprise (MSME) loans, small housing loans, remittances, collection (utility bills and loan installments), mobile wallets, insurance, G2P, B2B & B2P payments; thus, playing a leading role in the promotion of financial inclusion.
MMBL is committed to fostering a positive and productive workplace, and our core values reflect this focus. These values include promoting innovation and entrepreneurship, encouraging teamwork and collaboration, and prioritizing a customer-centric approach in all aspects of our business.
Why Join MMBL ?
This is an opportunity for someone who is passionate about making a difference and playing a key role in driving transformative change. Our team is committed to empowering millions with the tools necessary to succeed in the digital age, and we're looking for a talented individual to join us in this endeavor.