Online Data Merge Tool
- or -
Post a project like this5114
£750(approx. $943)
- Posted:
- Proposals: 4
- Remote
- #23353
- Awarded
Description
Experience Level: Expert
I am looking for a web based system to be hosted on 123-reg (please see their server specifications for details) to be an online data merging tool. The chosen developer will be given beanstalk SVN Access to our development server.
The aim is that I upload a CSV of customer records. The CSV will contain multiple instances of unique customer records. The system will match duplicated records (based on email address) and merge the two records together.
The system will then output a CSV for download with no duplicate records.
Specifications/user flow as follows ...
1. Upload CSV to a page.
2. System merges multiple rows into unique rows matching by email address
3. On successful merge user is offered a link to the new CSV
I expect the CSVs to be about 100,000 rows in length.
I have attached example CSVs
Input = the file to be uploaded
Output = the file I would like the system to output.
I want the unique identifier/index field to be email.
After the merge their should be one row per email address.
These rules apply to all fields EXCEPT Attributes.
If a field is populated the row with the most recent date populates that field, overwriting the contents.
If a field is populated and the row with the most recent date is blank, the field shall remain as is ...
... i.e. a populated field should not be overwritten by a blank field, even when the blank field is newer.
If a field is blank the populated row with the most recent date populates that field.
If a field is blank and there is no data to populate that field it remains blank.
These rules apply to the Attributes field only
Wherever two rows are merged the Attributes field should become merged from both fields.
E.g. ||Hotel|| + ||Golf Member|| = ||Hotel||||Golf Member||
The aim is that I upload a CSV of customer records. The CSV will contain multiple instances of unique customer records. The system will match duplicated records (based on email address) and merge the two records together.
The system will then output a CSV for download with no duplicate records.
Specifications/user flow as follows ...
1. Upload CSV to a page.
2. System merges multiple rows into unique rows matching by email address
3. On successful merge user is offered a link to the new CSV
I expect the CSVs to be about 100,000 rows in length.
I have attached example CSVs
Input = the file to be uploaded
Output = the file I would like the system to output.
I want the unique identifier/index field to be email.
After the merge their should be one row per email address.
These rules apply to all fields EXCEPT Attributes.
If a field is populated the row with the most recent date populates that field, overwriting the contents.
If a field is populated and the row with the most recent date is blank, the field shall remain as is ...
... i.e. a populated field should not be overwritten by a blank field, even when the blank field is newer.
If a field is blank the populated row with the most recent date populates that field.
If a field is blank and there is no data to populate that field it remains blank.
These rules apply to the Attributes field only
Wherever two rows are merged the Attributes field should become merged from both fields.
E.g. ||Hotel|| + ||Golf Member|| = ||Hotel||||Golf Member||
Lawrence A.
0% (0)Projects Completed
6
Freelancers worked with
3
Projects awarded
64%
Last project
4 Nov 2010
United Kingdom
New Proposal
Login to your account and send a proposal now to get this project.
Log inClarification Board Ask a Question
-
There are no clarification messages.
We collect cookies to enable the proper functioning and security of our website, and to enhance your experience. By clicking on 'Accept All Cookies', you consent to the use of these cookies. You can change your 'Cookies Settings' at any time. For more information, please read ourCookie Policy
Cookie Settings
Accept All Cookies