Python Keras/Tensorflow multiclass string classification
- or -
Post a project like this603
£100(approx. $125)
- Posted:
- Proposals: 7
- Remote
- #3655420
- Awarded
PPH #1 Service Provider in Development & IT : Wordpress|Magento|React Native|Mobile App Development|Angular|Node.js
Mohali
Experienced Graphics designer |Experienced WordPress Full Stack|SharePoint Expert|Data Entry Team |flyer/business card/Brochure||2DAnimation||Photoshop work|
Islamabad
WordPress Expert✮Shopify Expert✮Graphic Designer✮AutoCAD 2D & 3D✮CV Writer & Designer✮Fullstack developer
Rawalpindi
365356212834212878002359825249900634098673636573
Description
Experience Level: Entry
Hi,
We want to classify a 35 million string collection.
Some string cannot be classified using a RegEx, so we want to use ML to classify the values. The buckets are:
- Name -- Human first name or Full name. It must support different languages (English, Spanish, French...) i.e: Christina Charleston OR Javi Garcia OR Chloé Monet
- UserName -- Used internnaly in companies -- i.e: kaquinn, jamiesona, delaneyj
- Copyright -- RegEx. -- i.e: Copyright 1999 Adobe Systems Incorporated, (and variants)
- Email -- RegEx.
- IDs -- RegEx (passport Numbers, Driving licenses, ID...) Fixed formats available on the web.
- Unclassified -- Classification confidence is lower than a fixed threshold.
Interface: It will be hosted on a API service.
Input: String
Output: Classification ID
If you have any other question, please feel free to ask :)
We want to classify a 35 million string collection.
Some string cannot be classified using a RegEx, so we want to use ML to classify the values. The buckets are:
- Name -- Human first name or Full name. It must support different languages (English, Spanish, French...) i.e: Christina Charleston OR Javi Garcia OR Chloé Monet
- UserName -- Used internnaly in companies -- i.e: kaquinn, jamiesona, delaneyj
- Copyright -- RegEx. -- i.e: Copyright 1999 Adobe Systems Incorporated, (and variants)
- Email -- RegEx.
- IDs -- RegEx (passport Numbers, Driving licenses, ID...) Fixed formats available on the web.
- Unclassified -- Classification confidence is lower than a fixed threshold.
Interface: It will be hosted on a API service.
Input: String
Output: Classification ID
If you have any other question, please feel free to ask :)
Javier M.
100% (9)Projects Completed
9
Freelancers worked with
8
Projects awarded
61%
Last project
24 Feb 2024
United Kingdom
New Proposal
Login to your account and send a proposal now to get this project.
Log inClarification Board Ask a Question
-
There are no clarification messages.
We collect cookies to enable the proper functioning and security of our website, and to enhance your experience. By clicking on 'Accept All Cookies', you consent to the use of these cookies. You can change your 'Cookies Settings' at any time. For more information, please read ourCookie Policy
Cookie Settings
Accept All Cookies