Why Random Baseline? 
Given a standard dataset with a fixed set of models, it is easier to compare the performance of different models. But what if we are working on a new model which has performance far from the best set of models but as a first step, we simply want to check if the model is learning anything at all. In such cases, it is useful to compare the performance of the model with a random baseline.
 
Proposed Idea 
To formalize the problem, let’s say for an arbitrary image, model predicts \(k\)  bounding boxes with sizes \((h_1, w_1), (h_2, w_2), \ldots, (h_k, w_k)\) . 
A simple random baseline would be to generate \(k\)  random bounding boxes for that image with sizes \((h_1, w_1), (h_2, w_2), \ldots, (h_k, w_k)\) . In other words, we can simply move the predicted bounding boxes to random locations ensuring that the bounding boxes are within the image. 
 
 
Imports 
import  os'CUDA_VISIBLE_DEVICES' ] =  '1' import  numpy as  npfrom  tqdm.notebook import  tqdmimport  supervision as  svfrom  roboflow import  Roboflowfrom  dotenv import  load_dotenvfrom  ultralytics import  YOLOfrom  copy import  deepcopyfrom  PIL import  Imagefrom  IPython.display import  clear_output 
 
Dataset 
=  "/tmp/poker-cards-fmjio" =  Roboflow(api_key= os.getenv("ROBOFLOW_API_KEY" ))=  rf.workspace("roboflow-jvuqo" ).project("poker-cards-fmjio" )=  project.version(4 )=  version.download("yolov8" , location= data_location)
loading Roboflow workspace...
loading Roboflow project... 
 
 
Train Model 
=  YOLO("yolo11m" )= f" { data_location} /data.yaml" , epochs= 1 , project= "/tmp/poker-cards-fmjio" , exist_ok= True ) 
 
Evaluate Model 
=  sv.DetectionDataset.from_yolo(f" { data_location} /test/images" , f" { data_location} /test/labels" , f" { data_location} /data.yaml" )len (test_dataset) 
=  []=  []for  _, img, annotations in  tqdm(test_dataset):=  model.predict(img, verbose= False )[0 ]=  sv.Detections.from_ultralytics(results)
 
=  sv.metrics.MeanAveragePrecision().update(detections_list, annotations_list).compute() 
 
Random Baseline 
As per our assumption, we would simply need to randomly move the existing bounding boxes keeping their sizes constant with the following constraints:
The bounding box should be within the image boundaries. 
 
=  0 =  model.args['imgsz' ]=  []for  random_seed in  tqdm(range (100 )):=  []for  detections in  detections_list:=  deepcopy(detections)=  np.random.rand(len (detections))=  -  detections.xyxy.min (axis= 1 ) +  1e-6 =  max_size -  detections.xyxy.max (axis= 1 ) -  1e-6 =  lower_limit +  shift *  (upper_limit -  lower_limit)=  random_detections.xyxy +  transformed_shift.reshape(- 1 , 1 )=  sv.metrics.MeanAveragePrecision().update(random_detections_list, annotations_list).compute()print (f"mAP50:  { mAP. map50:.2f}  +/-  { np. std(mAPs):.2f} " )
 
We can also modify the confidence values.
=  0 =  model.args['imgsz' ]=  []for  random_seed in  tqdm(range (100 )):=  []for  detections in  detections_list:=  deepcopy(detections)=  np.random.rand(len (detections))=  -  detections.xyxy.min (axis= 1 ) +  1e-6 =  max_size -  detections.xyxy.max (axis= 1 ) -  1e-6 =  lower_limit +  shift *  (upper_limit -  lower_limit)=  random_detections.xyxy +  transformed_shift.reshape(- 1 , 1 )=  np.random.rand(len (detections))=  sv.metrics.MeanAveragePrecision().update(random_detections_list, annotations_list).compute()print (f"mAP50:  { mAP. map50:.2f}  +/-  { np. std(mAPs):.2f} " )
 
So, in this case, our model got better than the random baseline with a single epoch.