What makes a good QSAR model?

According to REACH regulation (Annex XI) an assessment using a QSAR model is valid if:

  1. the model is recognised as scientifically valid;
  2. the substance is included in the applicability domain of the model;
  3. results are adequate for classification and labelling and for risk assessment;
  4. adequate documentation of the methods is provided.’

These four ‘conditions’ are specific to the regulatory context, and address important scientific and practical aspects of the use of in silico models.  It is important to note that the four conditions are not just about the model, but also about the particular chemicals it is used for in an assessment, the particular regulatory function it is used for, and how well the model and its use are documented.  

A QSAR model is built on a set of experimental data for particular chemicals (a ‘training set’) and it is trialled by testing other chemicals for which there is also experimental data available (the ‘test set’).  In this way a QSAR model is designed to evaluate a particular set of chemicals in relation to a particular endpoint.  So while it could be used for other chemicals outside this ‘applicability domain’, the evaluation of those chemicals will have a higher level of uncertainty.  

For this reason, REACH and ECHA do not provide approval for models: each use of QSAR models has to be evaluated on its own merit, on a case-by-case basis.  

The first of the four conditions above is nevertheless about the model itself.  In the implementation of REACH, ECHA and others usefully refer to the five OECD principles for QSAR models.  The OECD stated that ‘to facilitate the consideration of a (Q)SAR model for regulatory purposes, it should be associated with the following information:

  1. a defined endpoint;
  2. an unambiguous algorithm;
  3. a defined domain of applicability;
  4. appropriate measures of goodness-of-fit, robustness and predictivity;
  5. a mechanistic interpretation, if possible.’

These five principles refer to the rigour of the model and the transparency of the information provided by the model developer.  The REACH demand to provide ‘adequate documentation of the models’ and their use, is central to demanding and ensuring the quality of models and the quality of their use, now and in the future.  It is ultimately about enabling the regulator to evaluate the chemical safety assessments independently in the interest of human health and the environment.

Related FAQ

What's next