doccano is an open-source data labeling tool for machine learning practitioners. You can perform different types of labeling tasks with many data formats. You can try doccano from the demo page.
You can also integrate doccano with your script because it exposes the features as REST APIs. By using the APIs, you can label your data by using some machine learning model. See API documentation in detail.
Start and finish a labeling project with doccano by the following steps:
pip install doccano
doccano init
doccano createuser
doccano webserver
# In another terminal, run the following command:
doccano task
doccano createuser
.Create
to create a project and start labeling data.Import dataset
on the dataset page and import the dataset you want to use.Start annotation
and label the data.Export dataset
on the dataset page and export the labeled dataset.You can customize doccano to suit your needs. The architecture of doccano consists of two parts: backend and frontend.
Module | Technology | Description |
---|---|---|
doccano backend | Python, Django, and Django Rest Framework | Perform data labeling via REST APIs. |
doccano frontend | Javascript web app using Vue.js and Nuxt.js | Perform data labeling in a user interface. |
For help and feedback, please feel free to contact the author.