Associate Data Engineer

Description:

You will be responsible for designing, developing, and maintaining scalable data pipelines to support data-driven initiatives. The role requires a strong understanding of data architecture, database management, and programming skills to handle large volumes of data.

Job Responsibilities:

Writing, maintaining, and debugging web crawlers and scrapers to extract data on large scale
Using APIs to fetch data and store the data in databases (SQL or NoSQL)
Parsing data extracted from various sources and performing data cleaning and transformation
Implement data quality and data validation processes to ensure accuracy, consistency, and reliability of data
Processing extracted data to extract business insights and taking appropriate actions
Develop and maintain robust, scalable, and high-performance data pipelines and ETL processes
Design, build, and optimize data models, databases, and data warehouses for storage and retrieval of structured and unstructured data
Maintaining repositories using version control tools, like Git, and deploying programs on servers

Job Qualifications:

No degree requirement - we will test skills through our recruitment process
0-2 years of experience in Python. Preference will be given to candidates with experience in data engineering technologies, database management, web automation, and web scraping
NOTE: Recent graduates are welcome to apply but they MUST showcase exceptional academic, freelance, or personal projects
Experience with data transformation and data warehousing tools, like dbt and Snowflake
Familiarity with different databases (like MySQL, MongoDB, and PostgreSQL) and experience of understanding and writing complex SQL queries
Experience with scraping libraries and frameworks of Python (like Selenium, BS4, and Scrapy) and Requests module
Knowledge of API integration to implement complex workflow automations
Experience with CI/CD and version control tools, like Git and Github
Familiar with UNIX & Shell Scripting
Bonus: Knowledge of AWS (EC2, RDS, Glue, EMR, Lambda, S3, etc.), Azure (ADF, Databricks, etc.), and GCP (BigQuery, Compute Engine, Functions, etc.)

Organization	Data Prism
Industry	IT / Telecom / Software Jobs
Occupational Category	Associate Data Engineer
Job Location	Lahore,Pakistan
Shift Type	Morning
Job Type	Full Time
Gender	No Preference
Career Level	Entry Level
Experience	Fresh
Posted at	2023-12-11 3:40 am
Expires on	2024-12-24