Description: We are running a YOLO-based object detection system deployed on a Google Cloud Platform (GCP) Kubernetes setup with a worker-handler architecture. Each node is currently assigned two CPU cores. During normal operations, when 50 users are active, CPU usage peaks around 1200 millicores (mc), raising concerns about scalability. We aim to support up to 15,000 concurrent users, with the goal of keeping CPU usage under 500mc per instance, without compromising the accuracy and reliability of detections. Project Goals: We are seeking an experienced AI/Machine Learning/DevOps engineer to analyze our current system and implement optimizations. The goal is to reduce CPU load while maintaining accurate object detection and classifications. Key Objectives: Analyze Current Architecture:...
Keyword: Machine Learning
Delivery Time: 5 days left days
Price: $216.0
AI (Artificial Intelligence) HW/SW Infrastructure Architecture Machine Learning (ML)