Frequently Asked Questions

Find answers to common questions about CVAT installation, usage, troubleshooting, and features.

General Questions

What is CVAT?

CVAT (Computer Vision Annotation Tool) is an open-source, interactive tool for annotating images and videos for computer vision applications. It’s used by tens of thousands of users worldwide to create high-quality training datasets for machine learning models.

Is CVAT free?

Yes! CVAT is open-source and released under the MIT License. You can:

Use cvat.ai for free with limitations (10 tasks, 500MB storage)
Self-host CVAT for unlimited usage
Subscribe to CVAT Cloud for additional features
Get Enterprise support for self-hosted deployments

What annotation types does CVAT support?

CVAT supports:

Bounding boxes (rectangles)
Rotated bounding boxes
Polygons
Polylines
Points
Ellipses
Cuboids (3D boxes)
Skeletons (keypoints)
Masks (segmentation)
Tags (image-level labels)

What file formats does CVAT support?

CVAT supports 30+ annotation formats including:

CVAT (XML)
COCO (JSON)
YOLO / Ultralytics YOLO
Pascal VOC
KITTI
Cityscapes
LabelMe
Datumaro
TFRecord
And many more!

See the full format list.

Installation

What are the system requirements?

Minimum requirements:

RAM: 8GB (16GB recommended)
CPU: 4 cores (8 cores recommended)
Storage: 20GB+ depending on data size
Docker: 20.10.0+
Docker Compose: 1.29.0+

For production:

RAM: 32GB+
CPU: 16+ cores
Storage: SSD with sufficient space for your datasets

How do I install CVAT?

For local installation:

git clone https://github.com/cvat-ai/cvat
cd cvat
docker compose up -d

Access at http://localhost:8080 See the installation guide for detailed instructions.

Can I use CVAT without Docker?

While it’s technically possible, Docker is strongly recommended because:

CVAT has many dependencies (PostgreSQL, Redis, ClickHouse, etc.)
Docker ensures consistent environments
Manual setup is complex and error-prone

For cloud deployment, consider using Kubernetes/Helm.

How do I update CVAT?

For Docker Compose installations:

cd cvat
git pull origin develop
docker compose down
docker compose pull
docker compose up -d

Important: Always backup your data before updating!

Can I install CVAT on Windows?

Yes, using:

WSL2 (Windows Subsystem for Linux) - Recommended
Docker Desktop for Windows

Follow the Windows installation guide for details.

Usage

How do I create my first annotation task?

Go to Tasks → Create new task
Enter a task name
Add labels (e.g., “car”, “person”, “dog”)
Upload images or videos
Click Submit
Wait for processing to complete
Click Open to start annotating

What’s the difference between a Task, Job, and Project?

Project: Container for related tasks with shared labels
Task: A set of media files to annotate
Job: A subset of a task’s frames, assigned to an annotator

Example: Project “Autonomous Driving” → Task “Highway Videos” → Job “Frames 1-1000”

How do I import existing annotations?

Open your task
Go to Actions → Upload annotations
Select format (COCO, YOLO, etc.)
Upload your annotation file
Click Submit

CVAT will import and validate your annotations.

How do I export annotations?

Open your task
Go to Actions → Export task dataset
Select export format
Choose whether to save images
Click Export
Download when ready

Can I use CVAT for video annotation?

Yes! CVAT has excellent video support:

Track objects across frames
Interpolation between keyframes
Video chapter navigation
Frame-by-frame control
High-performance frame caching

How does tracking work?

Create a Track annotation (not a shape)
Draw the object on the first frame
Navigate to a later frame
Adjust the annotation position
CVAT automatically interpolates between frames

You can also use AI-powered tracking with serverless functions.

Can multiple people annotate the same task?

Yes! Split your task into jobs:

When creating a task, set Job overlap and Segment size
CVAT creates multiple jobs automatically
Assign different jobs to different annotators
Each annotator works on their assigned frames

What are serverless functions?

Serverless functions enable:

Auto-annotation: Automatic annotation using AI models
Interactive annotation: AI-assisted annotation tools (e.g., Segment Anything)
Tracking: Object tracking across frames

CVAT supports models like:

Segment Anything (SAM)
YOLOv7
Mask R-CNN
Custom models via Nuclio

See the serverless functions guide.

Troubleshooting

CVAT is slow or unresponsive

Common causes:

Insufficient RAM: Increase Docker memory limit
Large images: Use smaller images or enable compression
Too many objects: Split task into smaller jobs
Slow storage: Use SSD instead of HDD

Optimizations:

Enable frame caching
Use compressed chunks
Reduce chunk size
Close unused browser tabs

I can’t upload large videos

For large files:

Increase Traefik’s upload size limit
Use cloud storage integration (AWS S3, Azure, Google Cloud)
Split video into smaller segments
Use mounted file shares

Edit docker-compose.yml:

traefik:
  command:
    - --entrypoints.web.http.middlewares=cvat-request-body-limit

Annotations disappeared or weren’t saved

Check:

Click Save (Ctrl+S) regularly
Check browser console for errors
Verify permissions (can you write to the job?)
Check server logs: docker logs cvat_server

Recover:

Check Actions → Export job dataset to see if data exists
Contact admin if data is critical

Docker containers fail to start

Common issues: Port conflicts:

docker ps -a  # Check running containers
sudo lsof -i :8080  # Check port usage

Insufficient memory:

Increase Docker memory limit (Docker Desktop → Settings → Resources)
Kill unnecessary processes

Permission issues:

sudo usermod -aG docker $USER  # Add user to docker group
# Log out and back in

How do I check logs?

# All services
docker compose logs

# Specific service
docker logs cvat_server
docker logs cvat_worker_import

# Follow logs
docker logs -f cvat_server

Database is corrupted

To reset (WARNING: This deletes all data!):

docker compose down -v  # Remove volumes
docker compose up -d    # Recreate

To backup before resetting:

docker exec cvat_db pg_dump -U root cvat > backup.sql

CVAT shows “502 Bad Gateway”

This usually means backend isn’t ready:

docker logs cvat_server  # Check for errors
docker compose restart cvat_server  # Restart backend

If persists, check:

Database is running: docker ps | grep db
Redis is running: docker ps | grep redis
Migrations completed: docker exec cvat_server python manage.py showmigrations

Features

Can I use CVAT offline?

Yes, if self-hosted! Once installed:

No internet connection required for annotation
Internet needed for:
- Auto-annotation (if using cloud models)
- Cloud storage integration
- Updates and new models

Does CVAT support 3D annotation?

Yes! CVAT supports:

Point cloud annotation (.pcd files)
3D cuboids with rotation
Multiple camera views
Related images (synchronized 2D images)

Can I customize CVAT?

Yes! CVAT is highly customizable:

UI plugins: Add custom UI components
Server plugins: Extend backend functionality
Custom annotation formats: Add import/export formats
Serverless functions: Deploy custom AI models
Webhooks: Integrate with external systems

Does CVAT have an API?

Yes! CVAT provides:

REST API: Full-featured HTTP API
Python SDK: pip install cvat-sdk
CLI: pip install cvat-cli
OpenAPI schema: https://app.cvat.ai/api/docs

See the API documentation.

Can I integrate CVAT with my ML pipeline?

Yes! Common integrations:

FiftyOne: Dataset management and visualization
Roboflow: Model training and deployment
HuggingFace: Model integration
Custom scripts: Using SDK/CLI
Webhooks: Real-time notifications

Example workflow:

Train model
Use model for auto-annotation in CVAT
Manually correct predictions
Export improved dataset
Retrain model
Repeat

What about annotation quality control?

CVAT includes:

Quality reports: Compare annotations between annotators
Consensus: Merge annotations from multiple annotators
Honeypot frames: Ground truth frames for validation
Review mode: Accept/reject annotations with comments
Inter-annotator agreement: Calculate agreement metrics

Does CVAT support SSO/LDAP?

Yes, in Enterprise version:

LDAP/Active Directory
SAML 2.0
OAuth 2.0

Contact [email protected] for enterprise features.

Can I run CVAT in production?

Yes! For production:

Use Kubernetes/Helm for orchestration
Set up SSL/TLS (HTTPS)
Configure backups (database and media)
Use cloud storage (S3, Azure, GCS)
Set up monitoring (Prometheus, Grafana)
Consider Enterprise support

See the administration guide.

Getting Help

Where can I get help?

Documentation: https://docs.cvat.ai/
GitHub Issues: https://github.com/cvat-ai/cvat/issues
Gitter Chat: https://gitter.im/opencv-cvat/public
Discord: https://discord.gg/fNR3eXfk6C
Stack Overflow: Tag questions with #cvat
Email: [email protected]

How do I report a bug?

Check if already reported: GitHub Issues
Create new issue with:
- CVAT version
- Steps to reproduce
- Expected vs actual behavior
- Screenshots/videos if applicable
- Browser console errors
- Server logs if relevant

How do I request a feature?

Check existing requests
Create new issue:
- Describe the feature
- Explain use case
- Provide examples if possible
Discuss with maintainers
Consider contributing!

Can I contribute to CVAT?

Absolutely! Contributions welcome:

Code: Fix bugs, add features
Documentation: Improve guides and docs
Testing: Report bugs, test new features
Community: Answer questions, help others

See the contributing guide.

More Resources

Website: https://www.cvat.ai/
Documentation: https://docs.cvat.ai/
GitHub: https://github.com/cvat-ai/cvat
YouTube: https://www.youtube.com/@cvat-ai
LinkedIn: https://www.linkedin.com/company/cvat-ai/
Blog: https://www.cvat.ai/post

Still Have Questions?

If your question isn’t answered here:

Check the full documentation
Ask on Gitter or Discord
Open a GitHub Issue
Contact us at [email protected]

Development

Resources

​General Questions

​What is CVAT?

​Is CVAT free?

​What annotation types does CVAT support?

​What file formats does CVAT support?

​Installation

​What are the system requirements?

​How do I install CVAT?

​Can I use CVAT without Docker?

​How do I update CVAT?

​Can I install CVAT on Windows?

​Usage

​How do I create my first annotation task?

​What’s the difference between a Task, Job, and Project?

​How do I import existing annotations?

​How do I export annotations?

​Can I use CVAT for video annotation?

​How does tracking work?

​Can multiple people annotate the same task?

​What are serverless functions?

​Troubleshooting

​CVAT is slow or unresponsive

​I can’t upload large videos

​Annotations disappeared or weren’t saved

​Docker containers fail to start

​How do I check logs?

​Database is corrupted

​CVAT shows “502 Bad Gateway”

​Features

​Can I use CVAT offline?

​Does CVAT support 3D annotation?

​Can I customize CVAT?

​Does CVAT have an API?

​Can I integrate CVAT with my ML pipeline?

​What about annotation quality control?

​Does CVAT support SSO/LDAP?

​Can I run CVAT in production?

​Getting Help

​Where can I get help?

​How do I report a bug?

​How do I request a feature?

​Can I contribute to CVAT?

​More Resources

​Still Have Questions?

Build docs developers (and LLMs) love