AWS re:Invent 2021 پر لانچ کیا گیا، ایمیزون سیج میکر گراؤنڈ ٹروتھ پلس helps you create high-quality training datasets by removing the undifferentiated heavy lifting associated with building data labeling applications and managing the labeling workforce. All you do is share data along with labeling requirements, and Ground Truth Plus sets up and manages your data labeling workflow based on these requirements. From there, an expert workforce that is trained on a variety of machine learning (ML) tasks performs data labeling. You don’t even need deep ML expertise or knowledge of workflow design and quality management to use Ground Truth Plus.
Building a high-quality training dataset for your ML algorithm is an iterative process. ML practitioners often build custom systems to inspect data labels because accurately labeled data is critical to ML model quality. To ensure you get high-quality training data, Ground Truth Plus provides you with a built-in user interface (Review UI) to inspect the quality of data labels and provide feedback on data labels until you’re satisfied that the labels accurately represent the ground truth, or what is directly observable in the real world.
This post walks you through steps to create a project team and use several new built-in features of the Review UI tool to efficiently complete your inspection of a labeled dataset. The walkthrough assumes that you have an active Ground Truth Plus labeling project. For more information, see ایمیزون سیج میکر گراؤنڈ ٹروتھ پلس - کوڈ یا اندرون ملک وسائل کے بغیر ٹریننگ ڈیٹا سیٹس بنائیں.
ایک پروجیکٹ ٹیم قائم کریں۔
A project team provides access to the members from your organization to inspect data labels using the Review UI tool. To set up a project team, complete the following steps:
- On the Ground Truth Plus دلاسامنتخب کریں پروجیکٹ ٹیم بنائیں.
- منتخب کریں Create a new Amazon Cognito user group . If you already have an existing ایمیزون کاگنیٹو user group, select the Import members آپشن.
- کے لئے Amazon Cognito user group name, enter a name. This name can’t be changed.
- کے لئے ای میل ایڈریس, enter the email addresses of up to 50 team members, separated by commas.
- میں سے انتخاب کریں پروجیکٹ ٹیم بنائیں.
Your team members will receive an email inviting them to join the Ground Truth Plus project team. From there, they can log in to the Ground Truth Plus project portal to review the data labels.
Inspect labeled dataset quality
Now let’s dive into a video object tracking example using CBCL StreetScenes ڈیٹاسیٹ
After the data in your batch has been labeled, the batch is marked as جائزہ لینے کے لیے تیار ہیں۔.
Select the batch and choose Review batch. You’re redirected to the Review UI. You have the flexibility to choose a different sampling rate for each batch you review. For instance, in our example batch, we have a total of five videos. You can specify if you want to review only a subset of these five videos or all of them.
Now let’s look at the different functionalities within the Review UI that will help you in inspecting the quality of the labeled dataset at a faster pace, and providing feedback on the quality:
- Filter the labels based on label category – Within the Review UI, in the right-hand pane, you can filter the labels based on their label category. This feature comes in handy when there are multiple label categories (for example,
Vehicles
,Pedestrians
، اورPoles
) in a dense dataset object, and you want to view labels for one label category at a time. For example, let’s focus on theCar
label category. Enter theCar
label category in the right pane to filter for all annotations of only typeCar
. The following screenshots show the Review UI view before and after applying the filter.
- Overlay associated annotated attribute values – Each label can be assigned attributes to be annotated. For example, for the label category
Car
, say you want to ask the workers to also annotate theColor
اورOcclusion
attributes for each label instance. When you load the Review UI, you will see the corresponding attributes under each label instance on the right pane. But what if you want to see these attribute annotations directly on the image instead? You select the labelCar:1
, and to overlay the attribute annotations forCar:1
, you press Ctrl + A
Now you will see the annotationDark Blue
کے لئےColor
attribute and annotationNone
کے لئےOcclusion
attribute directly displayed on the image next to theCar:1
bounding box. Now you can easily verify thatCar:1
was marked asDark Blue
, with no occlusion just from looking at the image instead of having to locateCar:1
on the right pane to see the attribute annotations.
- Leave feedback at the label level – For each label, you can leave feedback at the label level in that label’s Label feedback free string attribute. For example, in this image,
Car:1
looks more black than dark blue. You can relay this discrepancy as feedback forCar:1
کا استعمال کرتے ہوئے Label feedback field to track the comment to that label on that frame. Our internal quality control team will review this feedback and introduce changes to the annotation process and label policies, and train the annotators as required.
- Leave feedback at the frame level – Similarly, for each frame, you can leave feedback at the frame level under that frame’s Frame feedback free string attribute. In this case, the annotations for
Car
اورPedestrian
classes look correct and well implemented in this frame. You can relay this positive feedback using the تاثرات فراہم کریں۔ field, and your comment is linked to this frame.
- Copy the annotation feedback to other frames – You can copy both label-level and frame-level feedback to other frames if you right-click that attribute. This feature is useful when you want to duplicate the same feedback across frames for that label, or apply the same frame-level feedback to several frames. This feature allows you to quickly complete the inspection of data labels.
- Approve or reject each dataset object – For each dataset object you review, you have the option to either choose منظور if you’re satisfied with the annotations or choose رد کرو if you’re not satisfied and want those annotations reworked. When you choose جمع کرائیں, you’re presented with the option to approve or reject the video you just reviewed. In either case, you can provide additional commentary:
- اگر آپ کا انتخاب ہے منظور, the commentary is optional.
- اگر آپ کا انتخاب ہے رد کرو, commentary is required and we suggest providing detailed feedback. Your feedback will be reviewed by a dedicated Ground Truth Plus quality control team, who will take corrective actions to avoid similar mistakes in subsequent videos.
- اگر آپ کا انتخاب ہے منظور, the commentary is optional.
After you submit the video with your feedback, you’re redirected back to the project detail page in the project portal, where you can view the number of rejected objects under the Rejected objects column and the error rate, which is calculated as the number of accepted objects out of reviewed objects under the قبولیت کی شرح column for each batch in your project. For example, for batch 1 in the following screenshot, the acceptance rate is 80% because four objects were accepted out of the five reviewed objects.
نتیجہ
A high-quality training dataset is critical for achieving your ML initiatives. With Ground Truth Plus, you now have an enhanced built-in Review UI tool that removes the undifferentiated heavy lifting associated with building custom tools to review the quality of the labeled dataset. This post walked you through how to set up a project team and use the new built-in features of the Review UI tool. Visit the گراؤنڈ ٹروتھ پلس کنسول شروع کرنے کے لئے.
ہمیشہ کی طرح، AWS تاثرات کا خیرمقدم کرتا ہے۔ براہ کرم کوئی تبصرہ یا سوالات بھیجیں۔
مصنف کے بارے میں
منیش گوئل Amazon SageMaker Ground Truth Plus کے پروڈکٹ مینیجر ہیں۔ اس کی توجہ ایسی مصنوعات بنانے پر ہے جو صارفین کے لیے مشین لرننگ کو اپنانا آسان بناتی ہیں۔ اپنے فارغ وقت میں، وہ سڑک کے سفر اور کتابیں پڑھنے سے لطف اندوز ہوتے ہیں۔
Revekka Kostoeva is a Software Developer Engineer at Amazon AWS where she works on customer facing and internal solutions to expand the breadth and scalability of Sagemaker Ground Truth services. As a researcher, she is driven to improve the tools of the trade to drive innovation forward.
- سکے سمارٹ۔ یورپ کا بہترین بٹ کوائن اور کرپٹو ایکسچینج۔
- پلیٹو بلاک چین۔ Web3 Metaverse انٹیلی جنس۔ علم میں اضافہ۔ مفت رسائی۔
- کرپٹو ہاک۔ Altcoin ریڈار. مفت جانچ.
- Source: https://aws.amazon.com/blogs/machine-learning/inspect-your-data-labels-with-a-visual-no-code-tool-to-create-high-quality-training-datasets-with-amazon-sagemaker-ground-truth-plus/
- "
- 100
- 2021
- a
- تک رسائی حاصل
- کے پار
- اعمال
- فعال
- ایڈیشنل
- پتے
- یلگورتم
- تمام
- کی اجازت دیتا ہے
- پہلے ہی
- ہمیشہ
- ایمیزون
- ایپلی کیشنز
- کا اطلاق کریں
- درخواست دینا
- منظور
- تفویض
- منسلک
- اوصاف
- AWS
- کیونکہ
- اس سے پہلے
- سیاہ
- جرات مندانہ
- کتب
- باکس
- تعمیر
- عمارت
- تعمیر میں
- حساب
- کیس
- قسم
- میں سے انتخاب کریں
- کلاس
- کوڈ
- تبصروں
- مکمل
- کنسول
- کنٹرول
- اسی کے مطابق
- تخلیق
- اہم
- اپنی مرضی کے
- گاہک
- گاہکوں
- گہرا
- اعداد و شمار
- وقف
- گہری
- ڈیزائن
- تفصیل
- تفصیلی
- ڈیولپر
- مختلف
- براہ راست
- ڈرائیو
- کارفرما
- ہر ایک
- آسانی سے
- مؤثر طریقے سے
- ای میل
- انجینئر
- درج
- مثال کے طور پر
- توسیع
- ماہر
- مہارت
- سامنا کرنا پڑا
- تیز تر
- نمایاں کریں
- خصوصیات
- آراء
- لچک
- توجہ مرکوز
- توجہ مرکوز
- کے بعد
- آگے
- فریم
- مفت
- سے
- گروپ
- ہونے
- مدد
- مدد کرتا ہے
- اعلی معیار کی
- کس طرح
- کیسے
- HTTPS
- تصویر
- عملدرآمد
- کو بہتر بنانے کے
- معلومات
- اقدامات
- جدت طرازی
- مثال کے طور پر
- انٹرفیس
- IT
- میں شامل
- علم
- لیبل
- لیبل
- لیبل
- سیکھنے
- چھوڑ دو
- سطح
- اٹھانے
- لوڈ
- دیکھو
- تلاش
- مشین
- مشین لرننگ
- بنا
- انتظام
- مینیجر
- مینیجنگ
- اراکین
- غلطیوں
- ایم ائی ٹی
- ML
- ماڈل
- زیادہ
- ایک سے زیادہ
- اگلے
- تعداد
- اختیار
- تنظیم
- دیگر
- مہربانی کرکے
- پالیسیاں
- پورٹل
- مثبت
- عمل
- مصنوعات
- حاصل
- منصوبے
- فراہم
- فراہم کرتا ہے
- فراہم کرنے
- معیار
- جلدی سے
- RE
- پڑھنا
- حقیقی دنیا
- وصول
- کو ہٹانے کے
- کی نمائندگی
- ضرورت
- ضروریات
- کا جائزہ لینے کے
- دایاں کلک کریں
- سڑک
- اسی
- اسکیل ایبلٹی
- سروسز
- مقرر
- کئی
- سیکنڈ اور
- دکھائیں
- اسی طرح
- اسی طرح
- سافٹ ویئر کی
- حل
- شروع
- سسٹمز
- کاموں
- ٹیم
- ۔
- کے ذریعے
- وقت
- کے آلے
- اوزار
- ٹریک
- ٹریکنگ
- تجارت
- ٹرین
- ٹریننگ
- ui
- کے تحت
- استعمال کی شرائط
- مختلف اقسام کے
- اس بات کی تصدیق
- ویڈیو
- ویڈیوز
- لنک
- کیا
- کیا ہے
- ڈبلیو
- کے اندر
- بغیر
- کارکنوں
- افرادی قوت۔
- کام کرتا ہے
- دنیا
- اور