TIME SENSITIVE PROJECT Description: I have approximately 200 reports in Excel and PDF format. These reports contain tables or structured/semi-structured data, but the formatting, field names, and file naming conventions vary significantly across files. I'm looking for a skilled data analyst or Python developer who can help me compare these reports and identify which ones are at least 60% similar in content. This will require fuzzy matching techniques and possibly data normalization. Responsibilities: Extract data from PDF and Excel reports (some may require OCR or table parsing). Clean and normalize the data across all files. Compare the reports and determine which are ≥60% similar based on data content. Deliver a summary of matched report pairs or groups with similarity scores....
Keyword: Data Processing
Delivery Time: 2 days left days
Price: $481.0
Data Mining Data Processing Excel Python Software Architecture
Saya mencari ahli pemasaran yang berpengalaman untuk membantu saya membangun merek saya di media sosial. Persyaratan: - Strategi konten yang menarik - Desain grafis yang menarik - Kemampuan menulis salinan yang persuasif - Pengalaman dengan iklan berbayar di ...
View JobI need a Power BI report to monitor training across multiple depots. The report should track: - Training completion rates - License expiry dates - Training checklists compliance Visualization of training completion rates should be in bar charts. The data will be sourc...
View Job