Published online by Cambridge University Press: 01 February 2011
Standard machine-learning algorithms were used to build models capable of predicting the molecular weights of polymers generated by a homogeneous catalyst. Using descriptors calculated from only the two-dimensional structures of the ligands, the average accuracy of the models on an external validation data set was approximately 70%. Because the models show no bias and perform significantly better than equivalent models built using randomized data, we conclude that they learned useful rules and did not overfit the data.