Job Opportunities | USA | Careers | APEX-2000 Inc - A leading Global Consulting and IT services company

Requirement ID:	91144
Job Title:	Gen AI Architect
Job Type:	Contract
Duration:	6 - 9 months
Location:	Chicago, IL
Job Description:	"8+ years of total software engineering and technical architecture experience 3+ years of hands-on experience designing and deploying enterprise Gen AI or agentic workflows into production environments Experience conducting client-facing technical workshops, architecting solutions for RFPs, and translating complex business requirements into robust designs Programming: Advanced Python (Pandas, PyTorch, Hugging Face, Scikit-Learn) Frameworks: LangChain, LlamaIndex, LangGraph, CrewAI, AutoGen Cloud Platforms: AWS (Bedrock), Azure (Azure OpenAI), or GCP (Vertex AI) Databases: Vector DBs (ChromaDB, Pinecone, FAISS, Weaviate) Microservices: FastAPI, Flask, Spring Boot, REST APIs DevOps / MLOps: Docker, Kubernetes, CI/CD pipelines, MLflow, model monitoring tools " "Key Responsibilities 1. Architecture & Solution Design System Design: Define end-to-end architectures for complex Gen AI solutions, including Retrieval-Augmented Generation (RAG) pipelines, multimodal models, and autonomous multi-agent systems Reusable Patterns: Establish reusable architectural patterns, blueprint standards, and engineering templates to accelerate AI adoption and eliminate redundant engineering efforts Integration Strategy: Design secure, cloud-native microservices and APIs (e.g., FastAPI, REST) to seamlessly connect AI orchestrators with enterprise data backends and legacy systems 2. Model & Agent Strategy Orchestration & Agentic Systems: Architect and build multi-agent, goal-driven, autonomous AI systems utilizing frameworks such as LangGraph, CrewAI, AutoGen, or LlamaIndex Model Lifecycle & Strategy: Lead the evaluation, selection, and fine-tuning strategy for proprietary APIs (e.g., OpenAI, Anthropic, Gemini) and open-source models (e.g., Llama, Mistral) based on performance, latency, and cost Prompt Engineering: Standardize advanced prompt engineering frameworks (e.g., chain-of-thought, few-shot prompting) to ensure deterministic and scalable model behavior 3. LLMOps, Infrastructure, & Data Infrastructure Sizing: Define cloud infrastructure scaling models, GPU/TPU sizing strategies, and orchestration layouts (utilizing Docker, Kubernetes/OpenShift) Vector Databases: Architect robust information retrieval utilizing vector search databases (e.g., Pinecone, Milvus, Weaviate, FAISS) Operational Pipelines: Implement end-to-end GenAIOps/LLMOps pipelines for automated model deployment, performance monitoring, drift detection, and observability 4. Governance, Security, & Compliance Responsible AI: Embed governance, security, and compliance guardrails by design into all AI architectures, ensuring protection against prompt injection, data leakage, and PII exposure Quality Assurance & Testing: Implement evaluation frameworks to systematically test LLM accuracy, reduce hallucinations, and validate solution architectures " Role Descriptions: Essential Skills: Gen AI Architect Desirable Skills: Keyword: Skills: AI & Gen AI - Products & Tools Experience Required: 6-8
	Apply Now

Apply Now