We propose DALTag, an on-demand deep active learning-based text annotation platform that leverages few-shot learning (FSL).DALTag uses different active learning acquisition techniques that rank the unlabeled data points according to the model's confidence in their automatic annotations. It identifies the top data points with initial annotations. Users can decide whether to keep the annotations or update them. The updated annotations are used to fine-tune the pre-trained models using FSL. Users can easily configure DALTag based on one of its built-in models and techniques. Moreover, DALTag allows users to add their own acquisition techniques and datasets, enabling efficient handling of new datasets and text classification tasks. During the demonstration, the attendees will witness the ability of DALTag to annotate data samples using active learning and a few human-annotated samples. We evaluated DALTag using different text classification tasks, e.g. sentiment analysis and named entity recognition (NER).DALTag achieves results equivalent to training using the whole training data by using 45% of training data in the Question Classification task and 30% of training data in the Named Entity Recognition task, making it a flexible and accurate solution for annotating text data. Our DALTag demo video is available online: https://rebrand.ly/daltag
pip install -r requirements.txt
cd backend
python manage.py migrate
python manage.py create_roles
python manage.py createsuperuser
python manage.py runserver
cd backend
celery --app=config worker --loglevel=INFO --concurrency=1
cd frontend
yarn install (use node version 16.0.0 && yarn version 1.22.19)
yarn dev