
Python program to process files
- or -
Post a project like this1142
£502(approx. $684)
- Posted:
- Proposals: 15
- Remote
- #3855382
- OPPORTUNITY
- Awarded
WordPress Expert | Web & App Developer | SEO Specialist | Content Writer | Blockchain | Python | OpenAI | Machine Learning

Advanced AI & Custom Development | Built Platforms, Python & Plugins |Laravel & ReactJS | Bespoke Development

PPH #1 "Top Rated" Service Provider in Development & IT : Wordpress, Shopify, Magento, Squarespace, ZOHO, WHMCS, Salesforce, Vtiger, Learndash, Moodle

♛ Most Trusted #1 Team |19+ years of expertise in Website, Mobile Apps, Desktop & Console Games. Wordpress, ReactJS, Shopify, Laravel, Python, React Native, Flutter, Unity, Unreal Engine and AR/VR




Python, OpenAI, ChatGPT |Microsoft Fabric| Synapse| SQL Server, Snowflake ❄️, Postgres, MySQL| Power BI, Tableau, Domo | Azure, AWS, GCP| SSIS, Alteryx |AirFlow|AirByte|DBT| DevOps
Top rated PHP Web Development | WordPress | Magento | Drupal | OpenCart | PrestaShop | Joomla

4717353151005330617466081736384767253732312834232504276299797880436184486661050754
Description
Experience Level: Expert
I need a piece of software in Python 3.
It resides on a home server on Linux OS (Debian).
It is a routine that runs periodically.
The period is read from a flat conf file.
It reads a MariaDB table of filepaths on the same server and compares to the file system (fs)
It processes new files
The processes include
documents
indexing text documents in various formats
metadata from documents that have it
inverse indexing is fairly rudimentary, but robust, details to follow
images
face detection
object detection and recognition
EXIF data extraction
audio
speech to text (STT) transcription of voice notes with DeepSpeech model on the server
index the transcription as for documents
STT must be local not using a remote API, this is critical, the model is in place already
video
transcoding of certain video files to a new format
face detection
object recognition
EXIF data extraction if present
The program may run as a server watch program - watching user directories for new files, or run periodically.
It will run as a daemon and be started and stopped with systemd
it will not run as a cronjob
The program updates a MariaDB database for the output of the processing.
It is part of an project which will be open-source in the future, I'm paying for some aspects myself for the time being.
It will compile as python 3, not python 2, unless they are interoperable.
Expected timeframe is 3 weeks. 10 days to first testing, 10 days to test and debug.
Expected hours of work in the timeframe - 25 hours.
there is a detailed job spec which can be provided when we get into more detail
It resides on a home server on Linux OS (Debian).
It is a routine that runs periodically.
The period is read from a flat conf file.
It reads a MariaDB table of filepaths on the same server and compares to the file system (fs)
It processes new files
The processes include
documents
indexing text documents in various formats
metadata from documents that have it
inverse indexing is fairly rudimentary, but robust, details to follow
images
face detection
object detection and recognition
EXIF data extraction
audio
speech to text (STT) transcription of voice notes with DeepSpeech model on the server
index the transcription as for documents
STT must be local not using a remote API, this is critical, the model is in place already
video
transcoding of certain video files to a new format
face detection
object recognition
EXIF data extraction if present
The program may run as a server watch program - watching user directories for new files, or run periodically.
It will run as a daemon and be started and stopped with systemd
it will not run as a cronjob
The program updates a MariaDB database for the output of the processing.
It is part of an project which will be open-source in the future, I'm paying for some aspects myself for the time being.
It will compile as python 3, not python 2, unless they are interoperable.
Expected timeframe is 3 weeks. 10 days to first testing, 10 days to test and debug.
Expected hours of work in the timeframe - 25 hours.
there is a detailed job spec which can be provided when we get into more detail
Rupert N.
100% (10)Projects Completed
11
Freelancers worked with
7
Projects awarded
59%
Last project
3 Apr 2024
United Kingdom
New Proposal
Login to your account and send a proposal now to get this project.
Log inClarification Board Ask a Question
-
There are no clarification messages.
We collect cookies to enable the proper functioning and security of our website, and to enhance your experience. By clicking on 'Accept All Cookies', you consent to the use of these cookies. You can change your 'Cookies Settings' at any time. For more information, please read ourCookie Policy
Cookie Settings
Accept All Cookies