
Phased AI Chatbot Development: Self-Hosted RAG System
- or -
Post a project like this16
                    $1.1k
        - Posted:
- Proposals: 45
- Remote
- #4439426
- OPPORTUNITY
- Open for Proposals
            Full-Stack Web & Mobile App Developer With AI Integration & Automation Expertise        
                
            Expert Web Developer - N8N, Wordpress, Shopify, Opencart, Laravel, Vue, React, PHP        
        
            PPH's TOP Notch Website & Mobile App Developer & Designer(9+ yrs) ✔ Wordpress ✔ Shopify ✔ OpenCart ✔ Laravel ✔ PHP ✔ React Native ✔ Android ✔ iOS ✔ HTML/CSS✔Javascript/jQuery✔Responsive Design✔ASP.net        
        



            ♛ Most Trusted #1 Team |19+ years of expertise in Website, Mobile Apps, Desktop & Console Games. Wordpress, ReactJS, Shopify, Laravel, Python, React Native, Flutter, Unity, Unreal Engine and AR/VR        
        



56231489231751290382012122667123023048579517559836124045053801195128342305328010749830




 + 40 others have already sent a proposal.
 + 40 others have already sent a proposal. Description
                Experience Level: Expert            
                
            Project: We seek an experienced AI/ML developer to build a self-hosted AI chatbot system utilizing Retrieval-Augmented Generation (RAG) for our health related course Website.  We have a wiki, over 1,200 articles, a 70k Youtube channel, podcast, lots of content.  End goal is to have everything relevant RAG-ed for LLM use.  
We plan for users to have access to two tiers of LLM to answer their question:
Free Tier: Intelligent FAQ using public content, guiding users to resources.
Paid Tier: Expert-level assistant with secure access to proprietary support databases and premium materials.
**** This project is structured in paid, progressive milestones, beginning with a full prototype. We're looking for a long-term development partner. ******
Milestone 1: End-to-End RAG Chatbot Prototype (Pilot Phase)
Objective: Deliver a complete, functional RAG chatbot prototype. This milestone will serve as an immediate proof-of-concept, demonstrating core RAG pipeline functionality, response quality, and technical execution in a self-hosted environment.
Scope of Work:
Ingest and process a small sample of public website content provided.
Implement text chunking and generate embeddings using a local embedding model.
Store data in a local vector database (e.g., ChromaDB, FAISS).
Set up local LLM serving (e.g., Ollama) with open-source models (e.g., Mistral, Gemma).
Develop Python script: Query -> Retrieve Context -> Prompt LLM -> Generate Response.
Implement a simple Command-Line Interface (CLI) for interaction.
Deliverables for Milestone 1:
A fully functional Python script demonstrating the RAG process.
The populated local vector database files.
Clear setup instructions for local LLM serving and running the script.
A brief README explaining models and prototype functionality.
A demonstration of the chatbot responding to example queries.
Milestone 1 Budget: We have allocated $800 - $1800 USD for this pilot milestone. Please quote your price for this specific scope.
Evaluation Criteria: Functionality, AI response quality/relevance, code clarity & documentation, technical competence in local LLM/DB setup, communication.
Full Project Vision & Long-Term Income Potential:
Successful Milestone 1 completion leads to an invitation for subsequent phases, building the complete system. The estimated total development budget for the entire project is $7,000 - $20,000+ USD. Future milestones will involve: scaling data ingestion (public & proprietary), building a web UI, implementing secure paid-tier logic, and production deployment. This offers a significant opportunity for a stable, long-term engagement.
Preferred Technology Approach:
Python, LangChain/LlamaIndex, Ollama for local LLM serving, and local vector databases like ChromaDB/FAISS. Experience with FastAPI/Flask for backend APIs and Streamlit for UIs is a plus. We value documented expertise in open-source LLMs and RAG principles. We are open to well-reasoned alternative technologies that align with our self-hosted, cost-effective, performant goals.
How to Apply:
Please submit your proposal including:
Your quote for Milestone 1.
A brief outline (max 500 words) of your strategy for the full project's future milestones.
Your relevant experience with RAG systems, self-hosted LLMs, Python, and related frameworks.
Links to your portfolio or examples of similar AI development projects.
    We plan for users to have access to two tiers of LLM to answer their question:
Free Tier: Intelligent FAQ using public content, guiding users to resources.
Paid Tier: Expert-level assistant with secure access to proprietary support databases and premium materials.
**** This project is structured in paid, progressive milestones, beginning with a full prototype. We're looking for a long-term development partner. ******
Milestone 1: End-to-End RAG Chatbot Prototype (Pilot Phase)
Objective: Deliver a complete, functional RAG chatbot prototype. This milestone will serve as an immediate proof-of-concept, demonstrating core RAG pipeline functionality, response quality, and technical execution in a self-hosted environment.
Scope of Work:
Ingest and process a small sample of public website content provided.
Implement text chunking and generate embeddings using a local embedding model.
Store data in a local vector database (e.g., ChromaDB, FAISS).
Set up local LLM serving (e.g., Ollama) with open-source models (e.g., Mistral, Gemma).
Develop Python script: Query -> Retrieve Context -> Prompt LLM -> Generate Response.
Implement a simple Command-Line Interface (CLI) for interaction.
Deliverables for Milestone 1:
A fully functional Python script demonstrating the RAG process.
The populated local vector database files.
Clear setup instructions for local LLM serving and running the script.
A brief README explaining models and prototype functionality.
A demonstration of the chatbot responding to example queries.
Milestone 1 Budget: We have allocated $800 - $1800 USD for this pilot milestone. Please quote your price for this specific scope.
Evaluation Criteria: Functionality, AI response quality/relevance, code clarity & documentation, technical competence in local LLM/DB setup, communication.
Full Project Vision & Long-Term Income Potential:
Successful Milestone 1 completion leads to an invitation for subsequent phases, building the complete system. The estimated total development budget for the entire project is $7,000 - $20,000+ USD. Future milestones will involve: scaling data ingestion (public & proprietary), building a web UI, implementing secure paid-tier logic, and production deployment. This offers a significant opportunity for a stable, long-term engagement.
Preferred Technology Approach:
Python, LangChain/LlamaIndex, Ollama for local LLM serving, and local vector databases like ChromaDB/FAISS. Experience with FastAPI/Flask for backend APIs and Streamlit for UIs is a plus. We value documented expertise in open-source LLMs and RAG principles. We are open to well-reasoned alternative technologies that align with our self-hosted, cost-effective, performant goals.
How to Apply:
Please submit your proposal including:
Your quote for Milestone 1.
A brief outline (max 500 words) of your strategy for the full project's future milestones.
Your relevant experience with RAG systems, self-hosted LLMs, Python, and related frameworks.
Links to your portfolio or examples of similar AI development projects.

Jake S.
100% (24)Projects Completed
25
Freelancers worked with
17
Projects awarded
27%
Last project
8 Jul 2025
Singapore
New Proposal
Login to your account and send a proposal now to get this project.
Log inClarification Board Ask a Question
- 
There are no clarification messages.
We collect cookies to enable the proper functioning and security of our website, and to enhance your experience. By clicking on 'Accept All Cookies', you consent to the use of these cookies. You can change your 'Cookies Settings' at any time. For more information, please read ourCookie Policy
Cookie Settings
Accept All Cookies










