Here’s my situation:
- I create a list of assets containing Text data rows (let’s say
n
data rows). - I upload these assets to Catalog as a batch called
B1
- I annotate all data rows in B1
- I export the annotated data rows into my code on my machine and do some operations based on the labels and some operations that can’t be done on Labelbox (so we’re not talking about simple filtering). And let’s say, out of the
n
initial data rows I had, I decided to leave outm
data rows. - Now, I want to do a new round of annotation with new annotators and new schema on those
n-m
data rows (let’s call this new batch,B2
). But I don’t want to upload new assets to the Catalog, instead, I want to create a new batch which is a sample ofB1
but only with the data rows I kept after my internal operations (reason? because I don’t want to get charged new LBUs and use data rows that are already uploaded, labeled, and for which I got charged.) - The question is how can I achieve this? As far as I know, If I decide to upload new data rows that are a subset of an existing batch I will get a global key error. And if I create new global keys then I will get charged again.