For a hands-on learning experience to develop Agentic AI applications, join our Agentic AI Bootcamp today. Early Bird Discount

Associate Data Engineer

Remote | Pakistan

Full Time | Pacific Time

Why this role exists:

Most data engineering roles expect you to wrangle yesterday’s data with yesterday’s tools. We’re building AI-powered pipelines for clients who want tomorrow’s answers today, across healthcare, finance, and manufacturing. If you’re ready to learn fast and build smarter, this is your launchpad.

 

Data Science Dojo’s consulting team is scaling up to deliver next-gen AI and analytics solutions for industry leaders. As we take on more client-facing projects, think LLM-powered automation, real-time analytics, and robust data infrastructure, we need hands-on data engineers who can turn raw data into actionable insights. This role is critical to powering our AI automation initiatives and ensuring our clients’ data foundations are ready for what’s next.

Your mission, should you choose to accept it:

  • Configure and optimize components for big data pipelines that tackle diverse, real-world data science challenges.
  • Build and manage data pipelines for AI automation, including data preprocessing and transformation workflows.
  • Develop robust data workflows to enable analytics, insights generation, and smarter decision-making.
  • Contribute to product development using cutting-edge AI tools and cloud services (with a focus on Ejento and RAG-based solutions).
  • Independently design, build, and launch new data extraction, transformation, and loading (ETL) processes in production environments.
  • Collaborate with cross-functional teams—data scientists, product managers, and clients—to meet evolving data requirements and communicate insights.
  • Design and implement data transfer and integration solutions across a variety of systems.

The skills and expertise you bring:

  • Background in computer science, engineering, mathematics, or equivalent experience—formal degree not required.
  • Understanding of core data engineering concepts: data modeling, ETL, and data warehousing.
  • Familiarity with relational and NoSQL databases (e.g., MySQL, PostgreSQL, MongoDB).
  • Experience with SQL or similar query languages for data manipulation.
  • Awareness of cloud platforms and services for data engineering (e.g., AWS, Azure, Google Cloud).
  • Basic proficiency in Python or similar languages for data manipulation and visualization (e.g., NumPy, Pandas, Matplotlib, Seaborn).
  • Knowledge of version control systems (e.g., Git) for collaborative development.
  • Exposure to AI automation, LLMs, OCR, or document data extraction pipelines.
  • Strong problem-solving skills, analytical thinking, and attention to detail.
  • Ability to learn quickly, adapt to new technologies, and communicate technical concepts effectively.

 

Bonus Points

  • Certifications in cloud or data engineering (e.g., Azure Fundamentals AZ-900, Azure Data Fundamentals DP-900, Azure Data Engineer Associate DP-203).
  • Experience with data processing frameworks (e.g., Apache Spark, Hadoop, AWS Glue).
  • Familiarity with Microsoft Fabric services (Data Factory, Lakehouse, Warehouse, Real-Time Analytics, OneLake).
  • Exposure to data visualization tools (e.g., Tableau, Power BI) or Microsoft Power Platform (Power Automate, Power Apps).
  • Experience with workflow automation tools (e.g., Apache Airflow) or containerization (e.g., Docker, Kubernetes).
  • Knowledge of data serialization formats (e.g., JSON, Parquet, XML, Avro).

 

At Data Science Dojo, we believe the future belongs to those who can learn, adapt, and apply AI to solve meaningful problems. We look for individuals who stay curious and use AI not just as a tool, but as part of how they think and solve problems, improving the quality, speed, and impact of their work, while ensuring solutions are ethical, responsible, and compliant.

Apply Now

You can also call or email us.