Task – Data Extraction, data enrichment, accurately merge and make it presentable
Client Requirements:
The client provided three Franchise Disclosure Documents (FDDs) from well-known fast-food chains. Additionally, they shared an incomplete Excel sheet containing 3,392 franchisees with names and contact details. My task was to:
✅ Extract franchisee details from the PDFs (total: 7,126 franchisees).
✅ Identify and add the missing 3,734 franchisees to complete the dataset.
Challenges & Solutions:
🔹 Inconsistent Formats – The tables in the PDFs had varying structures, requiring careful extraction and formatting.
🔹 Missing Addresses – Some records lacked complete addresses, so I cross-referenced available data.
🔹 Data Matching – Used VLOOKUP and Excel formulas to accurately merge the extracted data with the client’s existing sheet.
Final Delivery:
📌 A fully completed Excel database with all 7,126 franchisees, ensuring accuracy and consistency.
📌 Cleaned and formatted data, making it easy for the client to use.