
Search tool for compiling and organizing data
- or -
Post a project like this$150
- Posted:
- Proposals: 3
- Remote
- #2136914
- PRE-FUNDED
- Expired
Description
Experience Level: Intermediate
Overview:
I need a search tool that can go through all of my e-textbooks and pull out information based on specific keywords. The program should then compile the search results from multiple books into a single text file so that the entries can be sorted and evaluated. Since the input files are textbooks and reference material I will want the program to be able to copy images and tables in case the information isn't simply in a block of text. It will be important to be able to see where each entry in the text file came from in the original book so that I can check back if the entry is incomplete. Ideally the program would also be able to handle a queue of search terms so that I don't have to begin a new search manually every time one finishes. I'm open to any other optimizations or features you think might be necessary or useful.
EDIT: this should be compatible with Windows 10
Specific function:
1. Search a file for a keyword
2. Copy keyword and surrounding text to a separate text file (including reference file name and page number)
2a. amount of surrounding text to pull should be defined by user between a paragraph to a full page
2b. data selection should also pull tables and image files from the page when copying
3. Search the next file in the folder for the same keyword and copy that data to the same text file, repeating until the end of the folder is reached
4. Each search term has its own text file where the search data is copied (all the results for "spam" from all files goes to "spam.txt" while all the search results for "eggs" get dumped into "eggs.txt")
5. An editable queue of search terms so that the program begins on the next search term once it finishes working on the previous term
File types to handle:
Input: ebook files (.epub, .mobi, .pdf, more if possible)
Output: text files (.doc, .txt, .rtf)
Additional features (optional):
1. set to search multiple specified folders when beginning a search
2. search history tagging files with previously used search terms to prevent duplication in future searches
3. time estimates for completion of search cycle (x min., y sec. to complete search for term)
I need a search tool that can go through all of my e-textbooks and pull out information based on specific keywords. The program should then compile the search results from multiple books into a single text file so that the entries can be sorted and evaluated. Since the input files are textbooks and reference material I will want the program to be able to copy images and tables in case the information isn't simply in a block of text. It will be important to be able to see where each entry in the text file came from in the original book so that I can check back if the entry is incomplete. Ideally the program would also be able to handle a queue of search terms so that I don't have to begin a new search manually every time one finishes. I'm open to any other optimizations or features you think might be necessary or useful.
EDIT: this should be compatible with Windows 10
Specific function:
1. Search a file for a keyword
2. Copy keyword and surrounding text to a separate text file (including reference file name and page number)
2a. amount of surrounding text to pull should be defined by user between a paragraph to a full page
2b. data selection should also pull tables and image files from the page when copying
3. Search the next file in the folder for the same keyword and copy that data to the same text file, repeating until the end of the folder is reached
4. Each search term has its own text file where the search data is copied (all the results for "spam" from all files goes to "spam.txt" while all the search results for "eggs" get dumped into "eggs.txt")
5. An editable queue of search terms so that the program begins on the next search term once it finishes working on the previous term
File types to handle:
Input: ebook files (.epub, .mobi, .pdf, more if possible)
Output: text files (.doc, .txt, .rtf)
Additional features (optional):
1. set to search multiple specified folders when beginning a search
2. search history tagging files with previously used search terms to prevent duplication in future searches
3. time estimates for completion of search cycle (x min., y sec. to complete search for term)

Taylor E.
100% (1)Projects Completed
1
Freelancers worked with
1
Projects awarded
0%
Last project
22 Oct 2018
United States
New Proposal
Login to your account and send a proposal now to get this project.
Log inClarification Board Ask a Question
-
Hi Taylor,
Could you share some E-Files to look more on this job ?
Thanks
Sumit
SaS TechnologiesTaylor E.11 Sep 2018Hi Sumit,
I'm having a bit of trouble understanding your question. Are you asking me to post some e-book files? The program I am looking for should be able to process most if not all of the common ebook file types regardless of the content. I can post a few if it is necessary but the file types in question are very easy to find online.
Thanks,
TaylorSUMIT A.12 Sep 2018Hi Taylor,
Thanks for your response.
Yes, I want to see few of the e-files, so that i can check here that i am able to read these files through a program or not.
Thanks
Sumit
SaS Technologies
708343
We collect cookies to enable the proper functioning and security of our website, and to enhance your experience. By clicking on 'Accept All Cookies', you consent to the use of these cookies. You can change your 'Cookies Settings' at any time. For more information, please read ourCookie Policy
Cookie Settings
Accept All Cookies