标注整体以JSON格式保存,每张图片对应一个字典,字典的键为nuScenes数据集中图片的名称,值则为车轮标注,格式如下 ```json { "n015-2018-07-24-11-22-45+0800__CAM_FRONT__1532402935662460": { "scores": [ 0.7676898241043091, 0.6984323263168335, ... ], "boxes": [ [ 162.60946655273438, 519.4454956054688, 201.25282287597656, 583.3193969726562 ], [ 422.3385314941406, 580.2855834960938, 465.64898681640625, 604.3241577148438 ], ... ], "masks": [ [ [ 520, 170 ], [ 520, 171 ], ... ], ... ], "mask_scores": [ 0.88671875, 0.93359375, ... ], "wheel_num": 3, "wheel_tokens": [ "9695de5e535e45a7af2be6bbdd8ab6f1", "011e076fae7a4a4ebba7ebbdab60df49", ... ], "assoc_box_tokens": [ "b9dcffffb5b441178705f39d7582e34c", "2fc559f2d63349f89df453b7831feacf", ... ] }, ... } ``` 具体地: - "**n015-2018-07-24-11-22-45+0800__CAM_FRONT__1532402935662460**" 为nuScenes数据集中的图片名称 - "**scores**" 是车轮的2D检测得分,每个得分对应一个车轮检测 is the detection scores of wheel 2D bounding boxes. - "**boxes**" 是车轮的2D检测框,每个框对应一个车轮,框的坐标为(umin, vmin, umax, vmax),umin, vmin为左上角坐标,umax, vmax为右下角坐标 - "**masks**" 是车轮的像素级分割结果,每个点对应一个像素,点的坐标为(v, u),v为行坐标,u为列坐标 - "**mask_scores**" 是轮子的像素级分割得分,每个得分对应一个mask - "**wheel_num**" 是图片中的车轮数量 - "**wheel_tokens**" 是图片中的车轮的唯一标识符,用一个字符串表示 - "**assoc_box_tokens**" 是该车轮所关联的nuScenes中车辆的标识符,表示关联信息