Task – Data Extraction, data enrichment, accurately merge and make it presentable
Client Requirements:
The client provided three Franchise Disclosure Documents (FDDs) from well-known fast-food chains. Additionally, they shared an incomplete Excel sheet containing 3,392 franchisees with names and contact details. My task was to:
β
Extract franchisee details from the PDFs (total: 7,126 franchisees).
β
Identify and add the missing 3,734 franchisees to complete the dataset.
Challenges & Solutions:
πΉ Inconsistent Formats β The tables in the PDFs had varying structures, requiring careful extraction and formatting.
πΉ Missing Addresses β Some records lacked complete addresses, so I cross-referenced available data.
πΉ Data Matching β Used VLOOKUP and Excel formulas to accurately merge the extracted data with the clientβs existing sheet.
Final Delivery:
π A fully completed Excel database with all 7,126 franchisees, ensuring accuracy and consistency.
π Cleaned and formatted data, making it easy for the client to use.