Regex Projects
Looking for freelance Regex jobs and project work? PeoplePerHour has you covered.
Post an offer to educate them
Past "Regex" Projects
NGINX Developer Required
I need someone who can write complex regex logic for redirects using NGINX. I am also looking for this person to implement NGINX load balancing. Please get in touch with me for more information and provide your credentials. You will not be able to make changes on the live environment - you will need to provide the files to me for testing and approval. Budget to be determined based on requirements being discussed.
Sign up form validation
I have a complicated signup page that has a lot of input validation. Both using regex patterns but also calls to backend db via fetch. Looking for someone to just do the html/js on a single page.
Language processor (regex, grammar and DFAs and NFAs )
I need help answering some questions. More details will be shared.
ChordPro Conversion from PDF
Hi All, I have around 100 PDF files that I have converted (roughly) to text files. The first job would be to tidy them up and remove obvious conversion errors. Second job would be to insert the primary ChordPro commands to convert from text to chordpro. Somone used to working with a multi file text editor (eg EditPad) and familiar with RegEx would probably find the chordpro conversion easy as all files could be done at once. EG: DON’T LOOK BACK IN ANGER [C] Oasis Key C 82bpm Starting note [E] Intro [C] [F] [C] [F] [C] Slip inside the [G] eye of your [Am] mind, Needs to become: {title:Don't Look Back In Anger} {subtitle:Oasis} #Compiled by Bob Harding #Modified 14/01/2017 {comment:Starting note E} {tempo:82} {Key:C} {comment:} {duration:4:00} {time:4/4} {metronome:82} {tag:Book 4} {book:Book 4} {comment:Intro} &gray:[C] [F] [C] [F] {comment:START} [C] Slip inside the [G] eye of your [Am] mind
Make a small scraper in vb.net (mandatory)
I am a vb.net coder with 16 years experience and i want you to make a vb.net software due to lack of time. i am looking for someone to hire multiple times for years to come, that can code at an ok price for me, so this is not a one time project, if you can do it, you will be hired again by me in the future. as a test,i want to make a software for myself (because i lack the time) that scrapes from a website, price etc to an excel file. this is the site, you can right click and translate from romanian to your language in your browser. it must be done in VB.net. i am not interested in python C# or others. it's important that you know regex to extract the info details. https://www.imoradar24.ro/apartamente-de-vanzare/iasi-iasi these are the things that have to get scraped. https://www.screencast.com/t/0vDT0uBJhaWW and the links to each ad and then go to the link and scrape more details https://www.screencast.com/t/dG1lzDnj while getting the pictures of the ad. use regex to get the details from the pages. get all the details on both the search page and the ad page (with the price history and map if you can, if not, it's ok without too). after putting the details each in a row in excel,the software should make a screenshot of the whole ad page if possible(if not, it's ok), the details at "istoric pret" ,history price are optional also. the software should be able to run in the background minimized without using keyboard and mouse. it should put the whole source code of each page in the last column of the excel file. i must be able to schedule scraping every x days so it would start by itself and scrape again the prices for the links of each ad and put it in a column if it is different than it was x days ago(also add the date when that new price was scraped). from the column on the right of each ad where prices from other sites appear, those prices should be put in columns along with the link to the external site. please bid with the lowest amount you could do this project.
Implement SIP/VoIP client to detect incoming calls
Your task is to implement Java based method/function, which detects incoming SIP/VoIP calls and opens a browser with a URL. The URL contains some query parameters, like the callers number. As a UI-less (cmd line only) app on the desktop it will be so a simply app which does basically a call listening and triggering a browser based URL to show. The listener shall be configurable, means - what URL to call - which phone numbers to ignore - which phone numbers to track (regEx) Please recommend what to use JTAPI or SIP-RI to detected phone calls. You will get a SIP/VoIP account to be able to implement and test incoming calls (if you do not have your own) Your task is to implement Java classes to be able to access details of incoming calls as a listener. Implement JUnit test cases, which is showing how the implemented listener is required to be configured (regex, ignore numbers, URL) The Q-question is a formula, place result into the bid. A good starting point for SIP starters may be: https://aebb.es/gn2wn https://aebb.es/ne-ug You need to implement for: - linux & windows (a basic java best practice and requirement, ensure system independent development) requirements: only call from JUnit (no UI) What is NOT needed: - a UI (not required, implement a JUnit test to call your functions) - a service architecture (like spring or JEE) - any persistence Milestone MS1: - implement a main() - register a sip listener - trace all incoming calls to system.out MS2: - implement a app (no ui), which runs in the background and system tray - which can be configured - which uses regex to "match" only on specific numbers What are our requirements? - your code passes checkstyle, pmd and spotbugs (we will share you a git repo with eclipse settings) - JDK17 - maven - 24/8 formula - create a model class representing the input of your function - create a service class implementing the logic - create a unit test, which tests the service class - we do NOT need a UI, we only need the model + service method to access the logic via JUnit - if you need libs selenium or apache commons are fine. Other libs NEED prior clearance - the runtime is JRE (no JavaEE nor Spring-container) - delivery in our git Outlook - we want to have a prototype/POC to fiddle around with the possibilities on our environment - after the prototype phase, we will have more tasks to implement to a full app, so preferably you/your team will get also this tasks - if you do a good job on supporting our team, we are open to integrate you into regular work and we will share you more tasks about SIP implementations What is our budget? we do not disclose our budget nor planned hourly rate. Offer us your best bid. Your bid? Place your best hourly bid. We do not want to negotiate with you anymore after getting in touch with you. So place your best bid to save time here Hiring? We hire only open book bidders. No open book estimate results in no hiring. You get a task, you estimate cleanly, you provide the estimate, we review and maybe do queries about understanding, you get a confirm for the budget, we agree on a milestones to it (at least 2, 1x implementation, 1x successful test), you deliver in the agreed budget. Communication: Do not wait for our availability here. Just answer, just ask or just reply. Outlook We want to implementing a bigger communication application after the implemented prototype. The candidate of the prototype will be also our preferred candidate for the bigger app.
Python Keras/Tensorflow multiclass string classification
Hi, We want to classify a 35 million string collection. Some string cannot be classified using a RegEx, so we want to use ML to classify the values. The buckets are: - Name -- Human first name or Full name. It must support different languages (English, Spanish, French...) i.e: Christina Charleston OR Javi Garcia OR Chloé Monet - UserName -- Used internnaly in companies -- i.e: kaquinn, jamiesona, delaneyj - Copyright -- RegEx. -- i.e: Copyright 1999 Adobe Systems Incorporated, (and variants) - Email -- RegEx. - IDs -- RegEx (passport Numbers, Driving licenses, ID...) Fixed formats available on the web. - Unclassified -- Classification confidence is lower than a fixed threshold. Interface: It will be hosted on a API service. Input: String Output: Classification ID If you have any other question, please feel free to ask :)
I need a C# expert for collaboration on a small project(urgent)
I need to solve a very small issue in C#. It involves Regex and it is pretty urgent. Project involves Regex and i need help with one method, a few lines of code.
opportunity
Phone Checker and scrapper
This will be our 6th time making this tool, and we are trying to find or to perfect this process. The tool will check the website's HOMEPAGE/MAIN page, look for a phone number, and extract it. The tool will check for the “tel:” of a website on the homepage/main. Sample 212-828-8436 Sample websites that uses “tel:” vstocktransfer.com cerulli.com payscout.com submort.com foreside.com newpennfinancial.com Next if tel doesn’t exist, it should consider the Sample +49 341 58 61 67 0 Siamar.de We are open for any suggestion if it beats our accuracy. We run like 100k websites, we want the tool to bypass error/bugs to continue. We also want the tool to resume where it left off just in case of interruption the output file is saved and we know where to left off. Speed is also important. Another problem is it should be bug or error-proof, if the program encounters an error it should automatically skip and resume the next url(or line) The ideal output would be like this Url, suitability, phone, logic. Url - the input website Suitability - YES/NO if there is a phone number Phone - extract phone Logic - if source if from tel or class number or regex phone format (US and UK only) We also have an existing python code for these, if you can find a way to auto resume it where it left off (skip the error line) then its also ok. This is not a one-time scrape, we are asking for a TOOL and Windows OS based.
opportunity
Text extraction from invoices
Work to do. 1. Description We have bboxes that has been added to the invoices Below the table in the invoice we must consider they are moving in Y-direction since the table with prices expands and retracts. We have made a model with the table so it detects the table and the values. This can probably be enhanced. In the attached image I show an example of the problem. Same supplier can send an invoice with different number of lines in the table. This means the text below, the subtotal, VAT and total can be on different pages As you can also see on the invoice the lines can grow and shrink in height itself. We can do bounding boxes on a very large (many pages) invoice but the next invoice can be even bigger or very small (= 1 page for example). The areas we need to extract then is moving up and down and can be on different pages, but we still need to detect the bbox as we must be able to extract the data from those areas. So we must detect the text itself and utilise the bounding box. another problem is exceeding strings in bboxes. see the next two images. These are the areas we must solve ASAP -> Weekend work 2. Skills Python MySQL 5+ years vision text extraction from bounding boxes OpenCV, NLP, spaCy, regex, tesseract, OCR, PDFXML, TableNet, DeepDeSRT, Graph neural networks, GANs and genetic algorithm But it is allowed to do it simpler too. You must have done something similar previously and you know what regex and tesseract is and have used it several times. You have worked with vision, ML, DL or NN Price is a price holder
opportunity
Python: Regex, ML, vision, Text extraction for template creation
1. Description We need to create a template based flow of documents which automates the templates. Process is: 1. read the file and extract all text (you) 2. find values in table and compare with text extracted (you) 3. if not found send to template creation setting a vlue to 0 instead of 1(you) 5. manual labeling stores coordinates for every label 6. Based on the coordinates you extract text strings inside of the boxes with regex for example and store values to template table(you) 7. read the document again and extract based on coordinate and compare with media_template table and store the results in media table(you) 2+ time document arrives 1. upload document (already done) 2. Detect if there is a template already. Extract text strings with regex by using coordinates and. Store strings in media table You work is to quickly do the above work. 2. Skills Python MySQL 5+ years You must have done something similar previously and you know what regex and tesseract is and have used it several times. You have worked with vision, ML, DL or NN 3. Time 3-4 days is what this script takes to do. Price is a price holder
Acrobat JavaScript Regular Expression
I need a regular expression to use in an Acrobat form that will test if the text field ends with five or more spaces and more text. Then, delete everything after the end of the last character preceding the five spaces. So... Text, more text [space] [space] [space] [space] [space] more text, more text. ...will become this- Text, more text What I'm seeking is something like this- if(txtField1.value == regex test text string here) { txtField1.value = [replaced text here]; }
opportunity
Automation Penetration Testing in IoT
The project should be built mainly in Python3 with classes and modules with the use of regex expressions to show automation penetration testing on IoT. I need it by 14th March to see the main part of the program that works ( at least Networking scan complete and at least Firmware analysis done), after I can give you until the 17th March to complete if the first part is successfully done. There should be a main menu to ask the user which option to select (Networking Scan, Firmware Analysis, Exit). After selected the option, a submenu should ask the user other options. In case of networking it should use nmap tool to create the network map and give the option to the user to select which target to use ( ip address or website). Other tools should be used as Nessus, Nikto, Netcat , Webscan, and others of your preference to describe at least two of the following processes: The current OWASP Top 10 Web Application Security Risks: • Injection (eg. SQL Injection) • Broken Authentication • Sensitive Data Exposure • XML External Entities (XXE) • Broken Access Control • Security Misconfigurations • Cross-Site Scripting XSS • Insecure Deserialization • Using Components with Known Vulnerabilities • Insufficient Logging and Monitoring
Remove hCaptcha Response from Email
I was implementing a hCaptcha in a serverless form system. However, when I receive the email form is with the response token from hCaptcha. The objective here is to remove the response token from the e-mail. Maybe using indexOf method or a regex. You will be given all the code and should be something easy for someone with a little experience in JavaScript.
urgent
Shell Programming
I need some help making a shell program in Linux that involves using regex, awk, sed, etc. I have the details of the assignment in a document as well as the two files needed to complete it. Please let me know ASAP if you're available. Thanks!
Need Regex Help
I need to extract values from the following string using regex. These aren't the only values that will come through. I need the value following HC Location, MarketoID, and Email. In the example below, I need three regular expression to pull each one (Testing123, 1345891, jcrumb@gmail.com) "{\"{\\\"HCLocation\\\":Testing123,\\\"MarketoID\\\":1345891,\\\"Email\\\":jcrumb@gmail.com}\":\"\"}"
opportunity
Public Shopify Application, complex eBay order import
We need a private Shopify application to be developed into a public, subscription based application listed on the Shopify app store. CRITICAL FUNCTIONS OF THE APPLICATION ARE AS FOLLOWS: 1. Add unlimited eBay accounts, view incoming orders in eBay. 2. Automatic order import options – for example, import when o Order placed o Order is paid for (this is default) o Order contains line items with SKU matching … (text string, begins with, ends with, regex) o Option to unimport / delete order from Shopify o Option to Create Test Order in application, with custom line item SKUs, and shipping profile names, and test behaviour when “pushing” to Shopify order list 3. Add SKU mapping functions, such that, for example: o Option – if rule not found, import anyway, or hold order for review? o An order placed on eBay with SKU – EBAYSKU1, would map to Shopify inventory items: 1 x SHOPIFYITEM2 2 x SHOPIFYITEM3 For example – additionally, more advanced SKU mapping rules are required – one such hardcoded rule in our existing private application is as follows: • Ebay SKU is in format: %SKU%_X%Q% • %SKU% is equivalent to a valid Shopify SKU • %Q% is equivalent to a multiplier for the quantity – for example: o eBay customer purchases 3 x SKUBIT_X2 o This order is imported into Shopify as 6 x SKUBIT 4. Shipping Method mapping functions are also a requirement, for example: o Option – if rule not found, import anyway, or hold order for review? o Ebay shipping method displays as RoyalMail_FirstClassStandard o This will be optionally mapped to ANY text when the order is imported into Shopify 5. Must maintain a LIVE connection with eBay, such that if orders are CANCELLED, this information is automatically fed through into Shopify and the matching order is refunded, if orders are DISPATCHED, the item is marked as dispatched on eBay 6. Must be able to process refunds and cancellations via Shopify 7. Must include stock management capacity – for example: o When orders placed In eBay, equivalent stock items must be adjusted (reduced) depending on the quantity purchased. o On eBay, automatically replenish stock up to: Available Shopify inventory Set limit number, ie, 5, 10 (to comply with eBay listing regulations for new accounts) (the above options are configurable globally or by listing)
PHP functions to decode non-standard or malformed JSON strings
I am attempting to write a set of PHP functions capable of decoding a JSON object encoded to a string and written to a file, but am consistently running into some problems. The source file in question is a .lumi file used to encode data for the Lumise product customiser application. .lumi files are url-encoded text strings with multiple embedded data URLs which can be decoded to JSON objects, but the complexity of the file is causing me problems. The objective is to take the input .lumi file, parse out the embedded data URLs and save these to either images or text files (depending on the data URL type) and then parse the rest of the file into a format that can be easily read by PHP to extract specific sections of data. I am having some difficulty reliably the data URLs, preg_replace to match the data URLs via regex appears not to work, and various hacky alternatives such as splitting by various characters or strings such as “data:image” and looping through and seem to result in malformed JSON – or possibly just JSON which is problematic for PHP’s json_decode function to correctly parse. In cases where I manage to extract and save the embedded data URLs, replacing them with strings such as [DATAURL_1], [TEXTURL_1] etc I am then running into problems correctly parsed an embedded string, which can be decoded to a child object. I will provide a selection of .lumi files to experiment with, as well as the PHP code I have produced so far for example purposes. I will need to test the resulting code on a variety of different .lumi files generated by the Lumise application, as well as ensure I can extract all data needed. Once this is achieved I will also need a function to reverse the process, converting from a modified JSON data object which is more human readable, back into a .lumi file of the equivalent encoding. I imagine this is not an exceedingly complex task – the issue is simply handling the variable and possibly malformed or non-standard encodings – but I have wasted too much time on it myself already, so am hopeful that someone else will be able to do a better job. This project will be in 2 parts - the first will be to achieve reliable decoding of any example input .lumi - the second part, once I have tested that part 1 has been achieved, will be to reverse the process, converting an input human-readable .JSON file and associated images into a .lumi file readable by the Lumise application - with some specific adjustments such as scaling the input images to a given maximum proportion.
Flat, monochrome icons following font-awesome
We need icons for our website / application. The icons should have a flat, monochrome design and should follow the "light style - pro" icons of font-awesome in design to can complete that library. We need the following icons: • label • text input • number input • regex input • image • checkbox • checkbox-group (vertical) • button • button-group • photo capture • signature • help • placeholder • hidden • audio record • map • section / area • video capture • audio file • date input • drawing • qr code • url input • color input Icons should be in *.png format plus vector. The background must be transparent. Requirements: We get all rights on the icons.
Javascript work needed - about 5 lines of code
Im using this email parsing script here in javascript https://gist.github.com/prasanthmj/ec804de3b6355c3ca26984a892ad550d Descripion is here: https://gist.github.com/prasanthmj/ec804de3b6355c3ca26984a892ad550d Note the code works very well apart from the regex parsing which I struggle with. I want the script to extract all data from my emails and put into a table like the image below - with the date the first column, then the subject, then the data (emails will be short, so only need a maximum of 12 fields) This should be a 5 minute job for anyone who understands regex in javascript Deliverable will be sending me the revised code and it works. Only apply if you can complete the work within 24 hours.