
Monitoring And Evaluation Projects
Looking for freelance Monitoring And Evaluation jobs and project work? PeoplePerHour has you covered.
Build Research-Grade Coding Tasks for AI Evaluation
I need help building realistic, terminal-based STEM research tasks used to evaluate frontier AI models (GPT, Gemini, etc.). What you'll build: A self-contained coding task that looks like real research work (analyzing datasets, running simulations, validating hypotheses, comparing methods). Not a textbook problem. Each submission must include: instruction.md (workflow, inputs, outputs, success criteria) Reproducible Docker environment with data Oracle solution (solve.sh) that fully solves the task Deterministic tests for verification task.toml metadata All packaged into one zip Quality bar: Multi-step, research-grade workflow Hard enough that frontier models fail more than 80% of the time Oracle passes local tests 3 out of 3 times Objectively verifiable outputs No LLM-generated content allowed Who's a fit: STEM background (biology, chemistry, physics, ML, data science, etc.) with strong Python and Docker skills. Payout: $100 per accepted submission. Please share your research background and a code sample when applying.
11 days ago15 proposalsRemoteWebsite Designer Required Urgently
I am enhancing my company website and have attached the Website map here. I have attached website enhancement Project map for your attention. Let me know the following: - How much it will cost - How long it will take - What software would you like to use? We have Lovable account, if you want to use that. Wordpress will also be fine. - What do you need from us? I am receiving lots of bids so please send your asap for evaluation.
4 days ago148 proposalsRemoteIm looking for someone to create and run Meta Ads
Im looking for an individual who can create, monitor and scale our Meta Ads. Our product is a motorcycle tracking solution - Realtime Moto (https://realtime-moto.com/) The product is a combination of a physical GPS tracker and a mobile app allowing the users to monitor the current location of their motorcycle and see their ride history and stats. The project has been running for the past 4 years and we currently do not have the capacity to run our own Meta Ads - hence this post. Pixel is set up and working and we have ran a few Meta Ads in the past few months to see what we can do. We reached a ROAS of around 0.8 while spending roughly 1000€. Attached are the creatives we were running. In general we are looking for someone to work with thru peopleperhour on a continuous basis
4 days ago51 proposalsRemoteAi Speech Expert Need
Seeking an AI speech expert to analyze and optimize speech recognition and synthesis systems. Responsibilities include evaluating current models, improving transcription accuracy, reducing latency, refining language models for diverse accents and noisy environments, and recommending architecture or data augmentation strategies. Candidate should possess deep experience with ASR, TTS, neural architectures, transfer learning, and evaluation metrics. Deliverables: technical audit, actionable roadmap, and performance benchmarks to guide implementation.
12 days ago17 proposalsRemoteHealth check software
I'm looking to have a cost-effective solution to the following; I operate in the health and social care sector, we have the equipment to now do things like oxygen monitoring, blood pressure monitoring, heart rate monitoring, body temperature etc - what I'd like is a very basic system, that doesn't need to store any data necessarily, as I will store it in HubSpot (launching in Q3 this year). But this system just needs to allow my field based managers to input various health metrics, input hydration and fluid metrics as well as the persons mobility, risk of falls, mental capacity, hearing, eyesight et cetera. What I then needed to do is generate a personalised and highly professional PDF report which can be given to the client that gives them their health readings and recommendations surrounding their hydration, nutrition, their health readings and how to reduce their risk which will be linked to the aforementioned risk of falls or hearing or eyesight et cetera, these would be somewhat generic such as if they are high risk of falls it would encourage them to utilise mobility aids, if they haven't got mobility aids it would suggest they go to their GP for a referral to get my mobility aid or to go to a local provider and buy them privately. The PDF report (which needs creating as part of this project) can be 90-95% pre-created through this project, we just need the ability to auto populate the personalised results such as their details and their recommendations. This project is good to go ahead in the next 30 to 60 days, I just need individuals to give me their price and let me know what else they would need from me in order to go ahead with this project, as well as time frames. Budget is based on PPH recomendation - Simply give me your realistic price when offering
2 days ago48 proposalsRemoteSeeking a Seasoned Low Latency HFT Engineer
• Connects directly to Interactive Brokers via API. • Monitors authorized signal providers or trading strategies. • Uses artificial intelligence to analyze financial news and market sentiment. • Executes futures trades automatically. • Provides advanced risk management and real-time monitoring. • Supports scaling across multiple trading accounts. Core System Requirements Interactive Brokers Integration • Full integration with IBKR API. • Real-time market data access. • Automated order execution. • Support for futures, equities, ETFs, and options. • Position monitoring and account synchronization. • Trade audit logs. Signal Replication Engine • Monitor approved signal feeds. • Detect position openings and closures. • Replicate positions automatically. • Adjustable trade sizing. • Multi-account support. • Real-time synchronization. AI Market Intelligence Engine • Financial news aggregation. • NLP sentiment analysis. • Economic calendar monitoring. • Market-impact scoring. • Trend analysis. • Event-driven trading filters. Automated Execution System • Market orders. • Limit orders. • Stop orders. • Bracket orders. • Position scaling. • Dynamic order routing. Risk Management Framework • Maximum daily loss limits. • Maximum drawdown controls. • Position-size limits. • Exposure limits. • Volatility filters. • Automated trading shutdown protocols. • Trade validation engine. Analytics and Dashboard • Real-time account monitoring. • Performance metrics. • P&L tracking. • Trade history. • Risk analytics. • Strategy performance reporting. • Mobile-friendly dashboard. Preferred Technical Skills Required: • Python • Interactive Brokers API (IBKR) • Quantitative Finance • Algorithmic Trading Systems • REST APIs • Database Development Preferred: • Machine Learning • Artificial Intelligence • Natural Language Processing • Financial Market Data Systems • Cloud Infrastructure • Docker • Kubernetes • Linux Server Administration Deliverables Phase 1 • System architecture • Technical specifications • Development roadmap Phase 2 • IBKR integration • Automated execution engine • Risk management framework Phase 3 • AI news and sentiment analysis engine • Signal replication module • Monitoring infrastructure Phase 4 • Dashboard development • Testing and optimization • Deployment and documentation Performance Expectations • Reliable automated execution. • Stable operation during market hours. • Low-latency order routing through Interactive Brokers. • Comprehensive risk controls. • Scalable architecture. • Secure and maintainable codebase. Important Notes The platform must comply with all Interactive Brokers policies, exchange rules, and applicable financial regulations. The system should be designed with a primary focus on reliability, risk management, transparency, and long-term scalability.
5 days ago16 proposalsRemoteTerminal Bench Expert
Role- Terminal Bench Expert Employment Type - Remote 3-10 years of experience 3–10 years of experience in software engineering or relevant domains. Strong debugging, reasoning, and analytical skills Full-time. 40 hours per week with an overlap of 4 hours with PST. What does day-to-day look like: • Design high-quality Terminal-Bench task ideas and specifications. • Develop complex tasks requiring reasoning, investigation, and debugging. • Write clear task descriptions, solution approaches, and verification logic. • Define deterministic, outcome-based evaluation criteria. • Identify realistic failure modes, edge cases, and operational constraints. • Create tasks that challenge AI systems while remaining solvable by experts. • Collaborate with reviewers to refine task quality and difficulty. • Contribute expertise across one or more specialized domains. Required Skills: • 3–10 years of experience in software engineering or relevant domains. • Strong debugging, reasoning, and analytical skills. • Good understanding of system design, workflows, and dependencies. • Ability to analyze complex systems across multiple layers. • Experience with production systems, pipelines, or large-scale workflows. • Strong technical writing and documentation skills. • Exposure to LLMs, agentic systems, or AI evaluation frameworks. • Experience reviewing technical specifications or designing validation logic. Domains (Any of the following): • Software Engineering & Code Operations • Debugging & Codebase Navigation • System Administration & Shell Workflows • File & Text Processing Pipelines • Data Engineering (ETL & Data Pipelines) • Database & SQL Operations • Machine Learning Pipelines & MLOps • Post-training & Model Finetuning Workflows • AI Evaluation & Benchmarking Systems • Retrieval, Search & Ranking Systems • GPU / Systems Performance Optimization • Distributed Systems & Infrastructure • Cloud & Platform Engineering • DevOps & CI/CD Systems • Build & Dependency Management • Scientific & Numerical Computing • Simulation & Optimization Systems • Formal Methods & Theorem Proving • Document & Structured Data Processing (PDFs, Excel, etc.) • Media Processing (Video, Audio, Images via CLI tools) • Programmatic Graphics & Design (SVG, layout, rendering) • Data Visualization & Reporting Workflows • Geospatial & Spatial Data Processing • Time-series & Forecasting Systems • Security, Forensics & Reverse Engineering • Cybersecurity & Vulnerability Analysis • Networking & API Integration Workflows • Automation & Multi-step Toolchain Orchestration • CLI Tooling & Developer Tool Workflows • Version Control & Git Workflows • Observability, Logging & Monitoring • Storage Systems & File Systems • Finance & Accounting Workflows • Quantitative Finance & Risk Modeling • Legal & Compliance Workflows • Healthcare & Clinical Data Processing • Supply Chain & Logistics Operations • Marketing & Growth Analytics • CRM & Sales Operations • HR & Recruiting Analytics • Consulting & Strategy Modeling • Investment Workflows • Operations Research & Decision Optimization • Benchmark Infrastructure, Adapters & Harness
18 hours ago12 proposalsRemoteShopify beta-testers + reviews
Seeking Shopify brands and ecommerce agencies to beta-test a new Shopify app and provide reviews. Ideal participants are Shopify store owners, ecommerce agencies, and multi-channel merchants actively managing Shopify and Amazon workflows. Candidates must execute at least ten product transfers over a three-week beta period. We prioritize committed, professional operators who will rigorously evaluate features, report issues, and offer constructive feedback to refine the app for production release.
2 days ago29 proposalsRemoteopportunity
WordPress site maintenance
I need someone to maintenance my eshop in Wordpress. I need someone who can make: - Regular updates (WordPress core, themes, plugins) - Security monitoring - Backups - Minor fixes - Basic performance check - If for some reason my website will be crash or is down to fix it. The price in the offer is per year...
11 days ago103 proposalsRemoteAI-Generated Instagram Lifestyle Images Using My Photos
I am looking for an experienced AI image creator/editor to generate realistic, high-quality lifestyle images of me participating in fun, interesting, and Instagram-worthy activities. I will provide multiple reference photos of myself. Using these images, I would like you to create realistic AI-generated photos that maintain my likeness while placing me in a variety of engaging settings and activities. Examples of activities may include: Traveling to scenic destinations Relaxing at luxury resorts Hiking or outdoor adventures Dining at stylish restaurants Attending events or festivals Beach and waterfront activities Sports and recreational activities Unique or creative lifestyle scenes suitable for Instagram Requirements: Images should look realistic and natural. My facial features and appearance should remain consistent across all images. Final images should be suitable for Instagram posting. Include examples of similar AI image projects you have completed, if available. Sample Work Request: To help evaluate candidates, please provide examples of your previous AI image work. Ideally, I would also like applicants to create one sample AI-generated image using my reference photos. A watermark is completely acceptable if you wish to protect your work. The sample will only be used to evaluate your skills and determine whether you are a good fit for the project. Budget: Please let me know: How many completed images you can provide within my stated project budget. Your expected turnaround time. Whether revisions are included. When applying, please briefly explain: Your workflow and process. The AI tools you use. The number of images included for the budget. Whether you can provide a watermarked sample image for evaluation. Thank you, and I look forward to reviewing your proposals.
a day ago57 proposalsRemoteWriter in Google Sheet
Seeking a diligent freelancer to monitor job postings daily and record new listings into a Google Sheet. Responsibilities include locating recently published opportunities, extracting key details (title, company, location, posting date, application link), and entering accurate, consistently formatted entries. Ideal candidate is detail-oriented, reliable, and experienced with Google Sheets and data entry. Timely daily updates and attention to accuracy are essential for successful collaboration.
3 days ago35 proposalsRemoteLinkedin messaging
I seek an experienced LinkedIn messaging specialist to refine and deploy targeted outreach via LinkedIn Premium. You will polish my existing message drafts, optimize tone and structure for engagement, and implement a sequence to maximize visibility and responses. Tasks include audience selection, message personalization strategies, scheduling, and performance monitoring with concise reporting. Aim: increase open and reply rates while preserving professional authenticity and compliance with platform policies.
4 days ago41 proposalsRemoteLLM QA Reviewer / AI Output Validator (RAG Systems)
Description We are building a small network of specialists focused on AI reliability and LLM validation. At ProoflineAI, we work with companies deploying AI assistants and RAG-based knowledge systems. Our goal is to ensure these systems are accurate, grounded, and production-ready. This role is focused on evaluating AI outputs—not building models. What You’ll Do You will help assess and improve AI system reliability by: • Reviewing LLM responses for factual accuracy • Detecting hallucinations and fabricated references • Verifying grounding against source documents • Identifying retrieval vs generation failures in RAG systems • Scoring responses using structured evaluation criteria • Creating test prompts and edge cases • Documenting failure patterns clearly Ideal Profile You may be a strong fit if you have experience in: • AI QA / AI annotation • NLP or LLM evaluation • QA testing / data quality • ML Ops / AI operations • Reviewing AI-generated content critically Strong written English and attention to detail are essential. Location (Preferred) We are primarily looking to work with candidates based in: Poland Romania Portugal Estonia Latvia Lithuania Compensation Depending on workload: • €800 – €1,200/month (part-time) • €1,200 – €1,800/month (full-time) Long-term collaboration possible based on performance.
17 days ago22 proposalsRemoteopportunity
COBOL Developer - Legacy System Migration Assessment (1 week)
We're evaluating a legacy COBOL system for a migration project and need someone who can review the existing codebase, document the business logic, and assess complexity. The system handles financial transaction processing. Looking for: Production COBOL experience (mainframe or distributed) Familiar with financial/banking/insurance systems Can document business logic from existing code Available this week This is a short assessment engagement.
14 days ago24 proposalsRemoteopportunity
Ai agent for accomodation business
im looking for someone who can create an ai agent that will automatically monitor the booking.com inbox an air bib inbox and automatically reply to guests and send check in details to guests. for instance our property run with lock boxes and codes the agent would have to be able to send lock box combinations to individual bookings. also monitor and reply to WhatsApp chat messages and arrange direct bookings.
25 days ago63 proposalsRemoteFacebook/Meta Expert Required for eCommerce Project
We have a new eCommerce project that will launch shortly. We're looking for someone who is experienced in creating Meta and Facebook campaigns to a high level and has the ability to scale campaigns quickly and profitably. We plan to launch new projects every few months, so lot's of potential for future work also. Looking for someone to start ASAP. You'll need to have experience in creating static and UGC style videos also. Responsibilities: - Plan, launch, and optimize Meta ad campaigns - Monitor performance and improve ROAS - Create and test audiences, creatives, and funnels - Provide clear performance reports and insights Requirements: - Proven experience with Meta Ads Manager - Strong understanding of targeting, creatives, and scaling - Data-driven mindset with attention to detail
14 hours ago38 proposalsRemoteContract Customer Service Analyst
I am seeking a skilled Contract Customer Service Analyst to evaluate service performance and deliver data-driven insights to improve service quality and the overall client experience. The ideal candidate will be organised, detail-oriented, and capable of transforming support data into actionable process improvements. Responsibilities Respond to client inquiries and concerns in a timely and professional manner Analyse and monitor service issues to identify patterns and trends that facilitate process improvements Ensure a consistently high standard of service is maintained at all times Track and document support questions and resolutions for internal records Serve as a subject matter expert on internal policies, procedures, products, and services Develop training materials on products and services for internal teams Please highlight your service analysis and trend documentation experience in your proposal. We seek an independent professional to maintain our high standards and look forward to your application!
a month ago14 proposalsRemoteI need a MERN Stack developer for upgrading my Social Platform.
Seeking a senior MERN/Next.js developer to elevate Vibio MVP into Vibio 2.0, a premium, mobile-first social platform. Deliver a sleek responsive UI/UX with dark/light modes, engaging feed and story views, real-time messaging with typing indicators and read receipts, push and email notifications, gamification (achievements, badges, levels), personalized recommendations, referral system, premium memberships, creator tools, and admin dashboards. Build modular, scalable backend (Node/NestJS, PostgreSQL, Redis), AI-ready hooks, CI/CD, automated tests, monitoring, Figma prototypes, documentation, and a roadmap.
a day ago57 proposalsRemoteNeed an experienced SEO expert for ongoing projects
Seeking an experienced SEO specialist for ongoing projects. The ideal candidate possesses strong expertise in On-Page, Off-Page, and Technical SEO, delivering measurable improvements in organic visibility and rankings. Familiarity with GEO SEO strategies or AI-driven SEO techniques is highly preferred. Responsibilities include comprehensive audits, keyword research, content optimization, technical fixes, backlink development, performance monitoring, and reporting. Reliability, strategic thinking, and proven results are essential. Competitive, long-term collaboration for the right professional.
7 days ago54 proposalsRemoteContract Customer Service Analyst
I am seeking a skilled Contract Customer Service Analyst to evaluate service performance and deliver data-driven insights to improve service quality and the overall client experience. The ideal candidate will be organised, detail-oriented, and capable of transforming support data into actionable process improvements. Responsibilities * Respond to client inquiries and concerns in a timely and professional manner * Analyse and monitor service issues to identify patterns and trends that facilitate process improvements * Ensure a consistently high standard of service is maintained at all times * Track and document support questions and resolutions for internal records * Serve as a subject matter expert on internal policies, procedures, products, and services * Develop training materials on products and services for internal teams Please highlight your service analysis and trend documentation experience in your proposal. We seek an independent professional to maintain our high standards and look forward to your application!
a month ago17 proposalsRemote