
A script to scrape page data and transform to needed format
- or -
Post a project like this571
€50(approx. $59)
- Posted:
- Proposals: 4
- Remote
- #4149282
- PRE-FUNDED
- Awarded
Description
Experience Level: Entry
Hi,
Here is the page to scrape data from: https://docs.sumsub.com/docs/global-supported-documents
You need to write a script that would scrape the data and adapt/transform it to follow the structure of the example file. I'll share it in DM
for example
{
countryName: 'Afghanistan',
docs: [
{
name: 'پاسپورټ (Passport) – full spread',
localName: 'p',
innerDocs: false,
},
{
name: 'دیپلوماتیک پاسپورت (Diplomatic passport) – full spread',
localName: 'p',
innerDocs: false,
},
{
name: 'پېژندپاڼه (ID card) – back side required',
localName: 'p',
innerDocs: false,
},
{
name: 'دچلولواجازت پاڼه (Driver license) – back side are required',
localName: 'p',
innerDocs: false,
},
],
},
or if there are multiple levels of data, then like follows:
{
countryName: 'Belarus',
docs: [
{
name: 'Passport – full spread',
localName: 'p',
innerDocs: false,
},
{
name: 'ID Card (ІДЭНТЫФІКАЦЫЙНАЯ КАРТКА) – back side required',
innerDocs: ['Army / Military ID Card – front side'],
},
{
name: 'Driving License – front side',
localName: 'p',
innerDocs: false,
},
{
name: 'Residence Permit – back side required',
localName: 'p',
innerDocs: false,
},
],
},
Pay attention that inside the object "ID Card (ІДЭНТЫФІКАЦЫЙНАЯ КАРТКА) – back side required" there is an array "innerDocs"
The expected result is a working script that I would be able to use
Here is the page to scrape data from: https://docs.sumsub.com/docs/global-supported-documents
You need to write a script that would scrape the data and adapt/transform it to follow the structure of the example file. I'll share it in DM
for example
{
countryName: 'Afghanistan',
docs: [
{
name: 'پاسپورټ (Passport) – full spread',
localName: 'p',
innerDocs: false,
},
{
name: 'دیپلوماتیک پاسپورت (Diplomatic passport) – full spread',
localName: 'p',
innerDocs: false,
},
{
name: 'پېژندپاڼه (ID card) – back side required',
localName: 'p',
innerDocs: false,
},
{
name: 'دچلولواجازت پاڼه (Driver license) – back side are required',
localName: 'p',
innerDocs: false,
},
],
},
or if there are multiple levels of data, then like follows:
{
countryName: 'Belarus',
docs: [
{
name: 'Passport – full spread',
localName: 'p',
innerDocs: false,
},
{
name: 'ID Card (ІДЭНТЫФІКАЦЫЙНАЯ КАРТКА) – back side required',
innerDocs: ['Army / Military ID Card – front side'],
},
{
name: 'Driving License – front side',
localName: 'p',
innerDocs: false,
},
{
name: 'Residence Permit – back side required',
localName: 'p',
innerDocs: false,
},
],
},
Pay attention that inside the object "ID Card (ІДЭНТЫФІКАЦЫЙНАЯ КАРТКА) – back side required" there is an array "innerDocs"
The expected result is a working script that I would be able to use

Artem K.
100% (32)Projects Completed
12
Freelancers worked with
8
Projects awarded
100%
Last project
28 Jul 2025
United Kingdom
New Proposal
Login to your account and send a proposal now to get this project.
Log inClarification Board Ask a Question
-
There are no clarification messages.
We collect cookies to enable the proper functioning and security of our website, and to enhance your experience. By clicking on 'Accept All Cookies', you consent to the use of these cookies. You can change your 'Cookies Settings' at any time. For more information, please read ourCookie Policy
Cookie Settings
Accept All Cookies