Need 1500+ HTML Files Parsed for Content and XML Generated


$50.00
Hourly: $50.00 - $82.00

I need someone to take a set of HTML pages that were built using Front Page and extract: 1. Title 2. Content 3. Date The pages all have the same basic HTML format. There are 1500+ files. I need someone to take all the files, clean them up (remove HTML tags), and provide me with basic title, content and date. An example of one of these files is here: http://www.tombutt.com/forum/2003/031230.htm or here: http://tombutt.com/forum/2013/131230.htm

Keyword: Data Cleaning

Price: $50.0

HTML XML

 

Data Engineer Needed for Rapid Data Cleanup and Transformation (Excel-Based)

We’re looking for an experienced data engineer or data wrangler to help us quickly clean, standardize, and expand an Excel-based dataset in preparation for migration into a new ERP system. The project involves normalizing color codes, expanding product-color combination...

View Job
Email Newsletter Marketing Specialist – Strategy, Automation & Engagement

About Us We are Launching a new Email Marketing company and our focus is on email marketing that actually drives revenue. We’ve already built out our lead generation and cold email outreach teams. Now, we’re looking for a Newsletter Marketing Specialist to join our team...

View Job
AI/ML Engineer – Document Parsing with LLMs

Job Summary: We’re building a lightweight MVP for an AI-powered tool that helps industrial companies make sense of complex regulatory documents. Your job: build the backend engine that ingests long, messy documents and produces clean, structured summaries with LLM assis...

View Job