We have developed a Python-based data pipeline for scraping and processing audio files. The pipeline downloads .wav files via multiple API calls, but due to API rate limits and long processing times, we need to scale the pipeline efficiently without managing servers. Project Goal: Deploy the Python scraping pipeline using AWS Fargate to parallelize execution across multiple serverless containers, efficiently process data, and upload results to Amazon S3—all while eliminating the need for direct EC2 instance management. Key Responsibilities: AWS Fargate Setup & Scaling: Deploy containerized scraping tasks with Fargate, allowing for dynamic scaling. Containerization (Docker): Package the Python data pipeline into a lightweight Docker container for deployment. Task Orchestration (ECS or Batch): Configure AWS ECS (Elastic Container Service) or AWS Batch to efficiently distribute and manage scraping jobs. Storage & Data Management: Optimize .wav file uploads to Amazon S3 and manage task execution logs. Security & Networking: Ensure containers have proper IAM roles, security groups, and VPC configurations for API access. Queue-Based Task Distribution: (Optional) Integrate AWS SQS or EventBridge to queue and trigger scraping tasks efficiently.
Keyword: Event Manager
Price: $60.0
Docker Amazon Web Services Amazon EC2 Amazon S3 AWS Fargate Python AWS Lambda DevOps
We are a U.S.-based energy startup actively building natural gas and solar power plants, as well as establishing domestic manufacturing facilities for solar panels and gas turbines in the USA. Our power generation projects began in 2024, but we have not yet focused on p...
View JobWe are seeking a creative freelance social media content creator to attend an onsite educational event and capture engaging iPhone videos for less than one hour. The ideal candidate will have experience in video creation for social media and a keen eye for detail. You w...
View JobHydrologic modeling for temporary bypass of the A-5 Canal simulating surface water flow to determine how diverted canal flows will move through the bypass channel under various storm events. This includes hydrologic modeling to estimate runoff volumes using rainfall and...
View Job