Comparison of MLR, PLS and GA-MLR in QSAR analysis.
The use of the internet has evolved in quantitative structure-activity relationship (QSAR) over the past decade with the development of web based activities like the availability of numerous public domain software tools for descriptor calculation and chemometric toolboxes. The importance of chemometrics in QSAR has accelerated in recent years for processing the enormous amount of information in form of predictive mathematical models for large datasets of molecules. With the availability of huge numbers of physicochemical and structural parameters, variable selection became crucial in deriving interpretable and predictive QSAR models. Among several approaches to address this problem, the principle component regression (PCR) and partial least squares (PLS) analyses provide highly predictive QSAR models but being more abstract, they are difficult to understand and interpret. Genetic algorithm (GA) is a stochastic method well suited to the problem of variable selection and to solve optimization problems. Consequently the hybrid approach (GA-MLR) combining GA with multiple linear regression (MLR) may be useful in derivation of highly predictive and interpretable QSAR models. In view of the above, a comparative study of stepwise-MLR, PLS and GA-MLR in deriving QSAR models for datasets of alpha1-adrenoreceptor antagonists and beta3-adrenoreceptor agonists has been carried out using the public domain software Dragon for computing descriptors and free Matlab codes for data modeling.[1]References
- Comparison of MLR, PLS and GA-MLR in QSAR analysis. Saxena, A.K., Prathipati, P. SAR and QSAR in environmental research. (2003) [Pubmed]
Annotations and hyperlinks in this abstract are from individual authors of WikiGenes or automatically generated by the WikiGenes Data Mining Engine. The abstract is from MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine.About WikiGenesOpen Access LicencePrivacy PolicyTerms of Useapsburg