
Dual-Branch Scene Text Detection Model Development
- or -
Post a project like this£150(approx. $203)
- Posted:
- Proposals: 13
- Remote
- #4415754
- Expired
Digital Marketing Strategist | Website Designer & Developer | Email Marketing Expert

1197277811871174213508311193042120978431973711992783114636414463254116740111074983011892228
Description
Experience Level: Expert
Current Status:
I already have a fully runnable MMDetection/YOLOv3-based framework with training and evaluation scripts, core source code, and custom modules for the Tampered_IC13 dataset.
The dataset is prepared and partially configured for training.
The framework can already run standard YOLOv3 object detection training and evaluation.
Requirements:
Implement a dual-branch architecture with RGB and frequency-domain branches.
Develop a fusion module to effectively combine features from both branches.
Train and evaluate the model on the Tampered_IC13 dataset (and potentially other datasets) to achieve a target F1 score ≥ 0.85, with balanced Precision and Recall.
Deliverables include:
Trained model weights (.pth)
Complete training and evaluation logs (including hyperparameters and loss curves)
Detailed evaluation metrics (Precision, Recall, F1)
A reproducibility guide (instructions to run and replicate the results in my environment)
Optimize the model’s performance to exceed baseline results.
Required Skills:
Proficiency in PyTorch and MMDetection/YOLOv3
Experience with frequency-domain image processing (e.g., FFT, DCT)
Strong background in object detection model design and optimization
I already have a fully runnable MMDetection/YOLOv3-based framework with training and evaluation scripts, core source code, and custom modules for the Tampered_IC13 dataset.
The dataset is prepared and partially configured for training.
The framework can already run standard YOLOv3 object detection training and evaluation.
Requirements:
Implement a dual-branch architecture with RGB and frequency-domain branches.
Develop a fusion module to effectively combine features from both branches.
Train and evaluate the model on the Tampered_IC13 dataset (and potentially other datasets) to achieve a target F1 score ≥ 0.85, with balanced Precision and Recall.
Deliverables include:
Trained model weights (.pth)
Complete training and evaluation logs (including hyperparameters and loss curves)
Detailed evaluation metrics (Precision, Recall, F1)
A reproducibility guide (instructions to run and replicate the results in my environment)
Optimize the model’s performance to exceed baseline results.
Required Skills:
Proficiency in PyTorch and MMDetection/YOLOv3
Experience with frequency-domain image processing (e.g., FFT, DCT)
Strong background in object detection model design and optimization
KIm S.
0% (0)Projects Completed
-
Freelancers worked with
-
Projects awarded
0%
Last project
24 Jan 2026
United Kingdom
New Proposal
Login to your account and send a proposal now to get this project.
Log inClarification Board Ask a Question
-
There are no clarification messages.
We collect cookies to enable the proper functioning and security of our website, and to enhance your experience. By clicking on 'Accept All Cookies', you consent to the use of these cookies. You can change your 'Cookies Settings' at any time. For more information, please read ourCookie Policy
Cookie Settings
Accept All Cookies