XCEL Merge/consolidate data and remove duplicates from 2500 rows
- or -
Post a project like this2170
$$
- Posted:
- Proposals: 13
- Remote
- #1979036
- Awarded
Web Developer | Web Design | Admin Support | Virtual Assistance | WordPress | Magento | eBay
Aurangabad
Virtual Assistant, Excel programmer, Customer Support Agent, EN-GR/GR-EN Translator
Kornos
1394015715225745411115326612246901228050126714015365471675070211564021597962167377
Description
Experience Level: Intermediate
Estimated project duration: 1 day or less
I need someone with fairly advanced EXCEL skills.
I have a spreadsheet which has a list of venues - with around 2500 rows.
These rows have usually 2 or 3 entries referring to the same venue.
1. IDENTIFY DUPLICATES
Certain columns (e.g. venue name, venue number) will be the same, and this is how you will most accurately be able to identify when a row is a duplicate. This is your Key Data for identifying duplicates.
Some of the duplicates have a different venue number, but the same name.
Others may have the same venue number, but a slightly different name.
There may be a minority of duplicates where both the venue name and number are different, and the only way to identify that they are in fact duplicates will be to check the other data in the row (usually postcode).
2. PROPAGATE DATA TO MAKE DUPLICATES MATCH
Some rows will have e.g. columns A B and C filled. Others will have e.g. A, D and E.
I want each entry to have A, B, C, D and E filled out. (note, there are more than 5 columns!)
whenever data exists for ANY column, that data should be propagated for EVERY duplicate row, within that column.
3. IDENTIFY CONFLICTS
Highlight in orange any venues numbers where they have the same name but a different venue number.
Highlight in orange any data which cannot be propagated amongst duplicates because there are duplicate entries with different data in these columns. (mostly this will not be the case).
4. DELETE ANY DUPLICATES WHERE POSSIBLE
Duplicates will not have any conflicts to be resolved.
CONDITIONS:
This is not a manual entry job. It must use reliable formulae to ensure consistency.
All people wishing to bid for the job must include the word biscuit in the first line.
I would hope this job would take no more than an hour or two for someone with the right skills.
Please refer to the attached images, one showing the before, and one showing the after (i.e. before duplicates have been deleted)
I'm not an excel expert (more of a dabbler), but I understand roughly how functions work, so if you want to explain roughly how you intend to go about it, that would be helpful.
I have a spreadsheet which has a list of venues - with around 2500 rows.
These rows have usually 2 or 3 entries referring to the same venue.
1. IDENTIFY DUPLICATES
Certain columns (e.g. venue name, venue number) will be the same, and this is how you will most accurately be able to identify when a row is a duplicate. This is your Key Data for identifying duplicates.
Some of the duplicates have a different venue number, but the same name.
Others may have the same venue number, but a slightly different name.
There may be a minority of duplicates where both the venue name and number are different, and the only way to identify that they are in fact duplicates will be to check the other data in the row (usually postcode).
2. PROPAGATE DATA TO MAKE DUPLICATES MATCH
Some rows will have e.g. columns A B and C filled. Others will have e.g. A, D and E.
I want each entry to have A, B, C, D and E filled out. (note, there are more than 5 columns!)
whenever data exists for ANY column, that data should be propagated for EVERY duplicate row, within that column.
3. IDENTIFY CONFLICTS
Highlight in orange any venues numbers where they have the same name but a different venue number.
Highlight in orange any data which cannot be propagated amongst duplicates because there are duplicate entries with different data in these columns. (mostly this will not be the case).
4. DELETE ANY DUPLICATES WHERE POSSIBLE
Duplicates will not have any conflicts to be resolved.
CONDITIONS:
This is not a manual entry job. It must use reliable formulae to ensure consistency.
All people wishing to bid for the job must include the word biscuit in the first line.
I would hope this job would take no more than an hour or two for someone with the right skills.
Please refer to the attached images, one showing the before, and one showing the after (i.e. before duplicates have been deleted)
I'm not an excel expert (more of a dabbler), but I understand roughly how functions work, so if you want to explain roughly how you intend to go about it, that would be helpful.
Greg D.
100% (15)Projects Completed
25
Freelancers worked with
22
Projects awarded
20%
Last project
10 Aug 2022
United Kingdom
New Proposal
Login to your account and send a proposal now to get this project.
Log inClarification Board Ask a Question
-
There are no clarification messages.
We collect cookies to enable the proper functioning and security of our website, and to enhance your experience. By clicking on 'Accept All Cookies', you consent to the use of these cookies. You can change your 'Cookies Settings' at any time. For more information, please read ourCookie Policy
Cookie Settings
Accept All Cookies