
Data Matching Index for Business Classification System
- or -
Post a project like this1544
£479(approx. $626)
- Posted:
- Proposals: 21
- Remote
- #3339411
- OPPORTUNITY
- PRE-FUNDED
- Awarded
Cloud (GCP/Azure) | ETL | Spark (PySpark/Scala) | Python | Data Science | Machine Learning
TOP PRO - Excel, Word, PowerPoint, VBA, Google Sheet, Outlook, Access, Database, Scripting

Trusted CRM Consultant and Data Automation Expert. Automating your business, designing your future!

Researcher, Data base building, Lead generation, E Mail database, Data Entry, Research base writing, Market Research,

Lead Generation Expert I Expert Virtual Assistant I Web Researcher I Contact List Building Expert

Helping ambitious, self-motivated business owners achieve time and cost savings by automating spreadsheets to allow for pro-active data-led decision making.
Software Engineer | Java Developer | PHP Developer | Data Scraping | Web Apps. | Desktop Apps | BI Reports and Dashboards
31817471264463238543626525472642406516811183780413682172454441123738316416492371180
Description
Experience Level: Expert
I have a master table of 2800 unique business classifications and approximately 250,000 unique business classifications on another table. I would like back a consolidated table of the 250,000 mapped to the 2800 classifications. Basically we have around 2 million businesses in our database and each has been invited to clarify their main line of business activity (what they do). End result is that we have around 250,000 different business classifications. We've come up with a user friendly list of 2800 classifications and now want to align the 250,000 top to the most appropriate classification within the 2800 list.
Suggest: As the two tables are very different its going to take fuzzy string matching to match and then select the most appropriate and strongest confidence. Stage(1) would obvious be looking for an exact match between the two classification systems, then (2) a string search for the full classification match, then (3) Looking for all the words in file 2 within file (1) in the same order, then (4) all of the words in any order (5) several words in correct order (6) several words in any order (7) less words in correct order, (8) less words in any order (9) even less words in correct order (10) even less words in any order (11) single word
Thanks
Milton
Suggest: As the two tables are very different its going to take fuzzy string matching to match and then select the most appropriate and strongest confidence. Stage(1) would obvious be looking for an exact match between the two classification systems, then (2) a string search for the full classification match, then (3) Looking for all the words in file 2 within file (1) in the same order, then (4) all of the words in any order (5) several words in correct order (6) several words in any order (7) less words in correct order, (8) less words in any order (9) even less words in correct order (10) even less words in any order (11) single word
Thanks
Milton
Milton E.
99% (79)Projects Completed
35
Freelancers worked with
46
Projects awarded
57%
Last project
7 Feb 2024
United Kingdom
New Proposal
Login to your account and send a proposal now to get this project.
Log inClarification Board Ask a Question
-
There are no clarification messages.
We collect cookies to enable the proper functioning and security of our website, and to enhance your experience. By clicking on 'Accept All Cookies', you consent to the use of these cookies. You can change your 'Cookies Settings' at any time. For more information, please read ourCookie Policy
Cookie Settings
Accept All Cookies