Fullstack AI Engineer

HR POD - Hiring Talent Globally · Lahore, Pakistan, Pakistan

Location
Lahore
Job Type
Full-time
Posted
May 24, 2026

Job Description

Requirements:

  • 4 years of experience as a fullstack or backend engineer
  • Strong proficiency in Python and JavaScript/TypeScript
  • Experience with FastAPI / Django / Node.js and React / Next.js
  • Solid understanding of distributed systems and async architectures
  • Hands-on experience deploying LLMs such as GPT-4/4.1, Claude, LLaMA, Mistral, Mixtral
  • Experience serving models using vLLM, Triton, TGI, or similar frameworks
  • Strong understanding of transformer models and inference trade-offs
  • Experience with embeddings, vector search, and RAG architectures
  • Experience with AWS, GCP, or Azure (GPU workloads preferred)
  • Strong Docker and Kubernetes experience
  • Familiarity with CI/CD pipelines for ML systems
  • Experience with observability tools (Prometheus, Grafana, OpenTelemetry)
  • Experience with multimodal AI (audio, video, image models)
  • Experience optimizing LLM inference...

Ready to Apply?

Submit your application for Fullstack AI Engineer at HR POD - Hiring Talent Globally

Apply Now