label_studio_annotation
example offers a detailed guide for each cloud provider on how to set it up.label_studio
flavor stack component as part of your stack, you'll need to have registered a cloud artifact store and a secrets manager to handle authentication with Label Studio as well as any secrets required for the Artifact Store. (See the docs on how to register and setup a cloud artifact store as well as a secrets manager.)--authentication_secret
as described in this guide, for example in the case of AWS.Ctrl-c
out of the Label Studio server that is running on the terminal.<>
with whatever you choose:zenml annotator dataset list
this should work without any errors. You're ready to use your annotator in your ML workflow!zenml annotator...
CLI command.zenml annotator dataset list
. To work on annotation for a particular dataset, you can run zenml annotator dataset annotate <dataset_name>
.BaseAnnotator
class. There are some methods that are core methods that must be defined, like being able to register or get a dataset. Most annotators handle things like the storage of state and have their own custom features, so there are quite a few extra methods specific to Label Studio.LabelStudioDatasetRegistrationConfig
- a step config object to be used when registering a dataset with Label studio using the get_or_create_dataset
stepLabelStudioDatasetSyncConfig
- a step config object to be used when registering a dataset with Label studio using the sync_new_data_to_label_studio
step. Note that this requires a secret schema to have been pre-registered with your artifact store as being the one that holds authentication secrets specific to your particular cloud provider. (Label Studio provides some documentation on what permissions these secrets require here.)get_or_create_dataset
step - This takes a LabelStudioDatasetRegistrationConfig
config object which includes the name of the dataset. If it exists, this step will return the name, but if it doesn't exist then ZenML will register the dataset along with the appropriate label config with Label Studio.get_labeled_data
step - This step will get all labeled data available for a particular dataset. Note that these are output in a Label Studio annotation format, which will subsequently converted into a format appropriate for your specific use case.sync_new_data_to_label_studio
step - This step is for ensuring that ZenML is handling the annotations and the files being used are stored and synced with the ZenML cloud artifact store. This is an important step as part of a continuous annotation workflow since you want all the subsequent steps of your workflow to remain in sync with whatever new annotations are being made or have been created.integrations.label_studio.label_config_generators
module for those two functions.