< Back to previous page

Dataset

RGB-statistics derived from Nile red-stained reference plastics for the construction of the PIM (Polymer Identification Model)

Dataset containing RGB-statistics extracted from photographed fluorescent reference plastics stained with Nile red. The most abundantly produced plastic polymers worldwide considered for this dataset. The spectral data was used to construct a supervised machine learning model that allows to accurately identify the polymer types microplastics belong to, in a cost- and time-efficient way.,The dataset was built to train and validate the ‘Polymer Identification Model’ (PIM) in R and contains Red, Green and Blue (RGB) statistics extracted from Nile red-stained reference plastics (50-1200 μm) photographed under three different microscope filters (UV: Filter System A S, BP 340-380 nm; blue: Filter System I3 S, BP 450-490 nm; and green: Filter system N2.1 S, BP 515-560 nm) (LEICA DM 1000). Image analysis to extract all RGB-values was performed using a macro in ImageJ. The supervised machine learning model (CART algorithm) trained by and validated with this dataset predicts with high accuracy the polymer types microplastics belong to, in a cost- and time-efficient way. RGB statistics of the most abundantly produced plastic polymers worldwide were compiled into the dataset. The statistics itself were calculated per reference particle as the 10th, 50th and 90th percentile as well as the mean of each of the three different color components extracted from all pixels laying along the maximum Feret diameter of that photographed particle. The dataset contains RGB-statistics calculated through image analysis of 210 plastic particles belonging to seven polymer types (Nylon, PE, PET, PP, PS, PUR and PVC) (30 particles/polymer type), where 168 particles (1/4) were randomly selected and used to serve as training data (24 particles/polymer) (worksheet tab ‘training data’), while the remaining 42 particles (1/5) were kept as independent validation data (6 particles/polymer type) (worksheet tab ‘training data’).
Publication year:2021
Accessibility:open
Publisher:Vlaams Instituut voor de Zee
License:CC-BY-4.0