
Excel formatted data extraction from HTML document
- or -
Post a project like this3971
$$
- Posted:
- Proposals: 5
- Remote
- #683419
- Awarded
Description
Experience Level: Intermediate
General information for the business: Accountants and Business Advisors
Database management system (DBMS): Microsoft SQL Server
Description of requirements/functionality: Error messages to be highlightd separately
Extra notes: Requirement:
To extract data from an html document into a pre-defined Excel format to include the following data: Date, Name, Address and postcode (address and postcode to be in standard format of address line 1, address line 2, address line 3, address line 4 (city), address line 5 (post code)
Source data:
The source data has a number of variables and is which we think makes it difficult to extract and format the address detail directly from it.
A suggested solution for the address problem is to identify the post code from the source data, cross reference it against a separate post code address data base (that displays in Excel format for address line 1, address line 2 address line 3 etc.) to provide a list of addresses, then identify the house number from the source data and then cross reference this against the list of address to enable selection of the correct pre-formatted address.
The date and name also has to be extracted from the source data.
Source data example:
A typical example of the source data is shown below with the specific data required highlighted:
County Court Judgements
The Register of Judgments, Orders and Fines Regulations 2005
Notice of County Court Judgements
John Smith
A County Court Judgement has been recorded against John Smith, 46 Loch View Gardens, Glenrothes, Fife, KY6 2NP and previously residing at 22 Glen Place, Glenrothes, KY3 5TG, on 18 March 2012, conveying the requirements of section 22(2A) in The Register of Judgments, Orders and Fines Regulations 2005 his details are hereby recovered by the Official Registrar Margaret Drummond of the Scottish Court Service, Saughton House, Broomhouse Drive, Edinburgh, EH11 3XD
Points to note:
There is sometimes, but not always, ‘a previously residing at’ or ‘previously living at’ address which is usually detailed after the current address. We only need the details from the current address. Please also note there is address of the Registrar which can change but which we don’t need.
Database management system (DBMS): Microsoft SQL Server
Description of requirements/functionality: Error messages to be highlightd separately
Extra notes: Requirement:
To extract data from an html document into a pre-defined Excel format to include the following data: Date, Name, Address and postcode (address and postcode to be in standard format of address line 1, address line 2, address line 3, address line 4 (city), address line 5 (post code)
Source data:
The source data has a number of variables and is which we think makes it difficult to extract and format the address detail directly from it.
A suggested solution for the address problem is to identify the post code from the source data, cross reference it against a separate post code address data base (that displays in Excel format for address line 1, address line 2 address line 3 etc.) to provide a list of addresses, then identify the house number from the source data and then cross reference this against the list of address to enable selection of the correct pre-formatted address.
The date and name also has to be extracted from the source data.
Source data example:
A typical example of the source data is shown below with the specific data required highlighted:
County Court Judgements
The Register of Judgments, Orders and Fines Regulations 2005
Notice of County Court Judgements
John Smith
A County Court Judgement has been recorded against John Smith, 46 Loch View Gardens, Glenrothes, Fife, KY6 2NP and previously residing at 22 Glen Place, Glenrothes, KY3 5TG, on 18 March 2012, conveying the requirements of section 22(2A) in The Register of Judgments, Orders and Fines Regulations 2005 his details are hereby recovered by the Official Registrar Margaret Drummond of the Scottish Court Service, Saughton House, Broomhouse Drive, Edinburgh, EH11 3XD
Points to note:
There is sometimes, but not always, ‘a previously residing at’ or ‘previously living at’ address which is usually detailed after the current address. We only need the details from the current address. Please also note there is address of the Registrar which can change but which we don’t need.
John C.
100% (3)Projects Completed
2
Freelancers worked with
2
Projects awarded
40%
Last project
7 Aug 2015
United Kingdom
New Proposal
Login to your account and send a proposal now to get this project.
Log inClarification Board Ask a Question
-
There are no clarification messages.
We collect cookies to enable the proper functioning and security of our website, and to enhance your experience. By clicking on 'Accept All Cookies', you consent to the use of these cookies. You can change your 'Cookies Settings' at any time. For more information, please read ourCookie Policy
Cookie Settings
Accept All Cookies