Computer Vision is the process of using tools and algorithms to gain high-level understanding from digital images or videos. It is a subset of the field of Artificial Intelligence. In the current age, computer vision has been applied to various practical problems including facial recognition, medical image analysis, vehicle detection, and automatic victim detection in disaster scenes. By leveraging Convolutional Neural Networks (CNN), computer vision can be used to improve accuracy and precision of many tasks that used to require human labor.

A Computer Vision Expert is a specialist in Computer Vision algorithms, machine learning, neural networks, deep learning and more. A Computer Vision Expert can build projects from scratch or customize existing models for various problems like image classification and segmentation, object detection and tracking,video analysis, image restoration and enhancement. In addition, they can offer the latest techniques and technologies such as deep learning to increase accuracy results and speed up task times.

Here's some projects that our expert Computer Vision Experts made real:

  • Classifying images into categories with pre-trained models
  • Adding angle calculation to an iOS project
  • Developing eye-blink detection project with deep learning models
  • Controlling recording device with stereo visuals from microscope
  • Generating image attributes for identification documents with deep learning
  • Automating post-processing of real estate photos with computational photography algorithms

Computer Vision Experts have done an impressive job in creating the projects mentioned above, showcasing their willingness to take on all kinds of challenges. We invite you to post a new project on Freelancer.com and hire a Computer Vision Expert to work on your vision project and make it become a reality.

From 26,263 reviews, clients rate our Computer Vision Experts 4.9 out of 5 stars.
Hire Computer Vision Experts

Computer Vision is the process of using tools and algorithms to gain high-level understanding from digital images or videos. It is a subset of the field of Artificial Intelligence. In the current age, computer vision has been applied to various practical problems including facial recognition, medical image analysis, vehicle detection, and automatic victim detection in disaster scenes. By leveraging Convolutional Neural Networks (CNN), computer vision can be used to improve accuracy and precision of many tasks that used to require human labor.

A Computer Vision Expert is a specialist in Computer Vision algorithms, machine learning, neural networks, deep learning and more. A Computer Vision Expert can build projects from scratch or customize existing models for various problems like image classification and segmentation, object detection and tracking,video analysis, image restoration and enhancement. In addition, they can offer the latest techniques and technologies such as deep learning to increase accuracy results and speed up task times.

Here's some projects that our expert Computer Vision Experts made real:

  • Classifying images into categories with pre-trained models
  • Adding angle calculation to an iOS project
  • Developing eye-blink detection project with deep learning models
  • Controlling recording device with stereo visuals from microscope
  • Generating image attributes for identification documents with deep learning
  • Automating post-processing of real estate photos with computational photography algorithms

Computer Vision Experts have done an impressive job in creating the projects mentioned above, showcasing their willingness to take on all kinds of challenges. We invite you to post a new project on Freelancer.com and hire a Computer Vision Expert to work on your vision project and make it become a reality.

From 26,263 reviews, clients rate our Computer Vision Experts 4.9 out of 5 stars.
Hire Computer Vision Experts

Filter

My recent searches
Filter by:
Budget
to
to
to
Type
Skills
Languages
    Job State
    23 jobs found

    I’m assembling a practical textbook that teaches drone technologies through hands-on programming with Python. The book should walk readers from fundamental concepts straight into real-time applications, letting them write, test, and fly code as they learn. Core coverage has to include basic Python syntax for newcomers, real-time data processing techniques, and solid drone control algorithms that work on popular flight controllers (PX4, ArduPilot, DJI SDKs, etc.). Example projects—think live telemetry dashboards, autonomous waypoint missions, and computer-vision-assisted flight—should be coded step-by-step so readers can replicate results on their own hardware. Because it’s a learning resource, clear explanations, commented code, troubleshooting tips, and small rev...

    $206 Average bid
    $206 Avg Bid
    6 bids

    I need a rock-solid, real-time player tracking module for football matches that guarantees the ID assigned to each athlete at kick-off never changes until the final whistle. Right now, our OpenCV–TensorFlow–YOLO pipeline sometimes swaps or loses IDs when athletes overlap, leave the frame briefly, or the camera angle shifts, and that ruins every speed, distance, position, and heat-map metric we generate. Key requirements • Sport: football. • Camera setup: five or more synchronized feeds. • Existing stack: OpenCV, TensorFlow, YOLO – your solution must plug into this environment. What I expect 1. A multi-object tracker with integrated re-identification that preserves the same unique ID through occlusion, crossings, short disappearances, or camera changes...

    $1989 Average bid
    $1989 Avg Bid
    108 bids

    Looking for an experienced Android Java developer to build a complete app for an AR glasses device. The device connects to an Android phone via USB-C. The app must: — Open a live USB UVC camera feed fullscreen on the glasses display — Apply real-time zoom from 1x to 10x with sharpness maintained — Apply real-time contrast enhancement — Apply real-time brightness adjustment — Apply edge detection outline mode — Apply colour filter modes — Read visible text aloud using OCR and text to speech — Simple on-screen menu to switch between features — Maintain minimum 30 FPS at all times SDKs provided: — — USB camera, zoom, image effects (Java API over C/C++ native libraries) — — glasses communication and display cont...

    $1957 Average bid
    $1957 Avg Bid
    6 bids

    I have a set of GelSight images and need an end-to-end workflow that turns them into accurate, high-resolution 3D meshes of everyday objects. The goal is to capture subtle surface variations—scratches, fine textures, and edge curvature—so the final model can be inspected, measured, and exported for downstream use in CAD or visualization tools. You may choose whichever environment you are most comfortable with (Python + OpenCV/PyTorch, MATLAB, or C++). What matters is that the pipeline: • takes raw GelSight data as input (I will supply sample files), • performs all necessary photometric calibration and image alignment, • reconstructs a dense height map, then converts it to a watertight mesh, • outputs OBJ or STL alongside a brief report on reconstruct...

    $536 Average bid
    $536 Avg Bid
    21 bids

    I need a clear, evidence-based report that compares the performance of today’s most widely cited object-detection algorithms. The focus is strictly on Computer Vision, zeroing in on Object Detection, and the core goal is to evaluate how the main approaches stack up against each other in terms of accuracy, speed, computational cost, and real-world suitability. Scope • Analyse at least three state-of-the-art methods—think Faster R-CNN, SSD, YOLO (v7/8), DETR or similar. • Draw all claims from peer-reviewed journals, top-tier conference papers, or authoritative benchmark leaderboards (e.g., COCO, PASCAL VOC). • Present metrics consistently (mAP, FPS, FLOPs, params, latency) so direct comparison is effortless. • Highlight strengths, weaknesses, and trad...

    $18 / hr Average bid
    $18 / hr Avg Bid
    41 bids

    I need a clear, evidence-based report that compares the performance of today’s most widely cited object-detection algorithms. The focus is strictly on Computer Vision, zeroing in on Object Detection, and the core goal is to evaluate how the main approaches stack up against each other in terms of accuracy, speed, computational cost, and real-world suitability. Scope • Analyse at least three state-of-the-art methods—think Faster R-CNN, SSD, YOLO (v7/8), DETR or similar. • Draw all claims from peer-reviewed journals, top-tier conference papers, or authoritative benchmark leaderboards (e.g., COCO, PASCAL VOC). • Present metrics consistently (mAP, FPS, FLOPs, params, latency) so direct comparison is effortless. • Highlight strengths, weaknesses, and trad...

    $5 / hr Average bid
    $5 / hr Avg Bid
    12 bids

    **Project Title:** Precision Extraction of Black Overprints from High-Resolution Stamp Sheets (Research Project) **Project Description:** I am conducting a specialized philatelic research study on 1931 Zeppelin overprints. I have two full stamp sheets (50 stamps each, high-resolution scans provided as PDF files), and I need the overprint isolated from each individual stamp with maximum fidelity. This is NOT a graphic design or basic background removal job. This is a precision image extraction task for research purposes requiring strict adherence to instructions and preservation of original detail. The source files are high-resolution PDF scans. You must extract images at native resolution (e.g., 1200 dpi). Screenshots, resampling, or any quality loss are not acceptable. If needed, conver...

    $135 Average bid
    $135 Avg Bid
    43 bids

    I have a short video—under sixty seconds—in which I want to convincingly replace the original face with another. This is strictly for personal use, so artistic flair and attention to detail matter more to me than commercial polish. The swap must look natural in motion: skin tones matched, lighting consistent, no obvious warping around expressions, and lips staying in sync. I will supply: • The source clip (MP4, 1080p, steady lighting). • High-resolution reference images of the face to insert. Your tasks: • Run a high-quality model (DeepFaceLab, FaceSwap, or similar) to train on my provided images. • Composite the new face onto the entire clip, maintaining frame-by-frame consistency. • Deliver a final render in the same resolution and a lossless ...

    $27 Average bid
    $27 Avg Bid
    10 bids

    I have just 700 indoor photos that must be labelled for an my project upcoming object-detection model. Every visible person and every seat that is currently occupied needs its own bounding box, and because the network will train on oriented-bounding-box (OBB) data, the rectangles have to follow the exact rotation of the body or seat—especially when someone is leaning. Data is of infrared camera of a cinema check samples , please note there can be many people in one image i want each to be marked properly You will work inside the simple browser-based annotation tool I built for this purpose or any other annotation tool you like. drag the box corners, spin the angle handle where needed, hit save, and move to the next frame; the app tracks progress automatically so nothing is missed....

    $15 / hr Average bid
    $15 / hr Avg Bid
    6 bids

    AI Tool: Convert Architectural Casework Elevations (PDF) into Editable AutoCAD DWG Files with Self-Training Interface Project Description I need a custom AI tool that automatically reads architectural casework elevation drawings (PDFs) and generates complete, editable AutoCAD DWG files using my existing multiple block libraries. The tool must detect cabinets, sinks, fixtures, shelving, countertops, and miscellaneous equipment, then place the correct blocks with accurate dimensions, countertop outlines, and section views. It must also include a natural language self-training interface so a non-technical user can teach and refine the AI using plain English rules or by correcting the output drawing. This is Phase 1 (Proof of Concept) of a larger initiative. Future phases will expand to multip...

    $7333 Average bid
    $7333 Avg Bid
    104 bids

    Hybrid Artificial Intelligence System for Autonomous Drone Detection Using Computer Vision and RF Signal Analysis The idea is to build AI detection system based on hybrid method (Computer vision and RF signal) and present results :Accuracy, Precision, Recall and False Alarm Rate Algorithms: Computer vision: YOLOv8 (Object Detection) RF analysis: CNN for Spectrogram analysis Merge in : Rule-Based Fusion • or Neural Network Fusion Dataset : Vision Datasets: • Anti-UAV Dataset • VisDrone RF Datasets: • DeepSig RadioML Dataset • Custom SDR Capture (HackRF / RTL-SDR) Budget : 150$

    $132 Average bid
    $132 Avg Bid
    70 bids

    We are looking to hire experienced freelancers to help build a next-generation AI-powered camera management system for a camera business that is moving from traditional operations to a more advanced, centralized, and intelligent platform. The objective is to develop a scalable solution that can support multi-camera monitoring, AI-based body scanning, behavior tracking, vehicle number plate recognition, drone integrations, real-time alerts, and secure admin control. The platform should be designed for future growth and long-term operational use. This is not a basic prototype — we are looking for professionals who have worked on relevant, production-level systems. Project Scope The system will include, but is not limited to, the following capabilities: 1. Live camera streaming and re...

    $33306 Average bid
    $33306 Avg Bid
    80 bids

    We are looking for a talented frontend developer to bring our Figma designs to life by building a visually stunning, high-performance website using HTML, CSS, and JavaScript. This project involves translating Japanese content, so proficiency in Japanese is required. The ideal candidate will have a strong eye for design and be able to accurately replicate layouts while enhancing them with smooth animations and professional styling. Responsibilities: Convert Figma designs into clean, responsive HTML/CSS/JavaScript code Ensure pixel-perfect implementation aligned with the original design Translate Japanese content accurately into the website Implement modern UI animations and interactions Optimize for performance and cross-browser compatibility Requirements: Proven experience in frontend deve...

    $164 Average bid
    $164 Avg Bid
    39 bids

    I have just 700 indoor photos that must be labelled for an my project upcoming object-detection model. Every visible person and every seat that is currently occupied needs its own bounding box, and because the network will train on oriented-bounding-box (OBB) data, the rectangles have to follow the exact rotation of the body or seat—especially when someone is leaning. Data is of infrared camera of a cinema check samples , please note there can be many people in one image i want each to be marked properly You will work inside the simple browser-based annotation tool I built for this purpose or any other annotation tool you like. drag the box corners, spin the angle handle where needed, hit save, and move to the next frame; the app tracks progress automatically so nothing is missed....

    $49 Average bid
    $49 Avg Bid
    7 bids

    I'm looking for a skilled developer to create an AI live video call feature for entertainment. The platform should incorporate real-time effects and recognize body movements, facial expressions, and hand gestures. Key Requirements: - AI integration for real-time effects - Body movements tracking - Facial expression recognition - Hand gesture detection Ideal Skills: - Experience in AI and video call technology - Proficiency in real-time image processing - Strong background in UI/UX for engaging entertainment platforms Please share your relevant experience and approach.

    $3822 Average bid
    $3822 Avg Bid
    67 bids

    Contest: Real-Time Lipsync Avatar from a Single Photo — POC / Skills Test Important: This is a Paid Proof of Concept This contest is a skills assessment. We are looking for a talented engineer to join a multi-week (potentially multi-month) project to build a full real-time avatar platform — similar in quality and capability to HeyGen, LiveAvatar, Replika, Candy AI, and D-ID. The winner of this contest will be offered a long-term contract to build the full pipeline with us. Do NOT apply if you can only deliver pre-rendered video. We need real-time. What We Need (POC Deliverable) Build a working prototype that does the following in real time: Take a single static photo (portrait/face) as input Take a live audio stream (microphone or audio chunks) as input Output a video stream ...

    $295 Average bid
    $295
    31 entries
    ViewSeek AI
    2 days left

    I’m building ViewSeek AI as a full-featured web application that puts advanced artificial intelligence directly in the browser. The core of the product is AI itself, so every major workflow must be driven by machine-learning models and thoughtful UX. Here is what the first release needs to deliver: • Image recognition that can tag, classify and return metadata in near real-time. • Natural language processing so users can type questions, receive context-aware answers and carry on multi-turn chats (“Asking AI”). • Predictive analytics modules that surface trends or recommendations based on uploaded data sets. • Generative tools capable of creating both images and short video clips from text prompts, with download options in common formats. All ...

    $14 / hr Average bid
    $14 / hr Avg Bid
    18 bids

    Project Title: Computer Vision Expert for Eye Motion Detection (Mobile App - Closed-Eyelid Tracking) Project Overview We are looking for a highly skilled Computer Vision / AI Developer to build a specialized motion detection module for a mobile application. The core objective is to detect and quantify eyeball movements by analyzing video footage of a subject’s closed eyelids. Technical Challenge: "Bulge Tracking" Unlike standard pupil tracking, this project requires analyzing the movement of the eyeball "bulge" underneath the closed eyelid. The solution must be hardware-agnostic, working with standard smartphone RGB cameras under normal lighting conditions, without the need for specialized IR sensors. Key Deliverables & Requirements 1. Motion Detection: Identi...

    $1321 Average bid
    $1321 Avg Bid
    174 bids

    Expert Mobile Developer: Infrared Eye-Tracking & Nystagmus Detection (Computer Vision). Project Overview: We are seeking a high-level Mobile Developer/Computer Vision Engineer to build a specialized module for a vertigo diagnostic support application for smartphones. The core task is to capture and analyze Nystagmus using the device's internal infrared sensors. Scientific Context: Nystagmus is an involuntary, rhythmic oscillation of the eyes caused by vestibular (inner ear) dysfunction. While vertigo is the subjective sensation of spinning, nystagmus is the objective clinical evidence. Capturing this in total darkness is essential to prevent "visual fixation" which can suppress the symptoms. Key Responsibilities: 1. Mobile IR Capture: Access internal IR sensors (e.g., i...

    $1537 Average bid
    $1537 Avg Bid
    55 bids

    The project will be crafted from scratch and must feel as refined as any top-tier SaaS product. A clean Material Design aesthetic guides every screen, so visual polish goes hand-in-hand with smooth interactions. Core functionality is driven by three AI capabilities working together: • Natural Language Processing for conversational and content-generation tasks • Predictive Analysis that surfaces insights from user data in real time • Computer Vision for image understanding and automated tagging I plan to run these through OpenAI, Anthropic, or a custom LLM stack—whichever you wire up most effectively—so experience orchestrating multiple providers is important. The back end lives in Python, ideally Django or Flask, with well-structured APIs that the fron...

    $355 Average bid
    $355 Avg Bid
    15 bids

    I’m training a machine-learning model that must recognise consumer electronics, so I need a fresh, well-organised image dataset focused exclusively on product shots. The goal is to gather a wide visual range of smartphones, laptops, tablets, headphones, wearables, home-office gear and similar items photographed on plain backgrounds as well as in real-world settings. Here’s the scope in a nutshell: • Source or capture several thousand high-resolution JPEG or PNG images (at least 1024 px on the shortest side). • Keep each file clearly named by product type and brand, then group the images in logical folders for easy ingestion into my training pipeline. • Supply a simple CSV or JSON annotation file that pairs every filename with its product label; bounding bo...

    $434 Average bid
    $434 Avg Bid
    19 bids

    I need expertise in identifying and counting roundworm eggs from microscope images. The images will be in TIFF format. Ideal skills and experience: - Proficiency in microscopy and parasitology - Experience with TIFF image analysis - Attention to detail for accurate counting Please provide samples of similar work if available. Software to be used separately to microscope. Images will be opened and software should identify eggs such as round worms, coccidia and tape worm eggs and count the number of each type in image and add to a file or spreadsheet. These images typically come from a McMaster slide and in-between blue gridlines. Need to identify eggs and not air bubbles, dirt, etc.

    $2352 Average bid
    $2352 Avg Bid
    20 bids

    We are hiring remote contributors to create photo-based language data using everyday materials found around you. This project focuses on collecting natural, real-life text captured through a phone camera. What You’ll Do - Photograph common objects that contain written text (printed or handwritten). - Provide three unique shots per item, changing position, distance, or lighting. - Ensure content is original and varied. - Most of the visible text (minimum 75%) must be in your local language. Eligibility - Fluent in the target language (native or near-native). - Physically located in a country where the language is used. - Own a smartphone capable of taking clear photos. How It Works - Upload images through a Google Form. - Submissions are reviewed individually. - Only valid, clear, ...

    $6 / hr Average bid
    $6 / hr Avg Bid
    279 bids

    Recommended Articles Just for You