Discrimination of the notifiable pathogen Gyrodactylus salaris from G. thymalli (Monogenea) using statistical classifiers applied to morphometric data

E. S. McHUGH a1, A. P. SHINN a2c1 and J. W. KAY a1
a1 Department of Statistics, University of Glasgow, Glasgow G12 8QQ, Scotland, UK
a2 Institute of Aquaculture, University of Stirling, Stirling FK9 4LA, Scotland, UK


The identification and discrimination of 2 closely related and morphologically similar species of Gyrodactylus, G. salaris and G. thymalli, were assessed using the statistical classification methodologies Linear Discriminant Analysis (LDA) and k-Nearest Neighbours (KNN). These statistical methods were applied to morphometric measurements made on the gyrodactylid attachment hooks. The mean estimated classification percentages of correctly identifying each species were 98·1% (LDA) and 97·9% (KNN) for G. salaris and 99·9% (LDA) and 73·2% (KNN) for G. thymalli. The analysis was expanded to include another 2 closely related species and the new classification efficiencies were 94·6% (LDA) and 98·0% (KNN) for G. salaris; 98·2% (LDA) and 72·6% (KNN) for G. thymalli; 86·7% (LDA) and 91·8% (KNN) for G. derjavini; and 76·5% (LDA) and 77·7% (KNN) for G. truttae. The higher correct classification scores of G. salaris and G. thymalli by the LDA classifier in the 2-species analysis over the 4-species analysis suggested the development of a 2-stage classifier. The mean estimated correct classification scores were 99·97% (LDA) and 99·99% (KNN) for the G. salarisG. thymalli pairing and 99·4% (LDA) and 99·92% (KNN) for the G. derjaviniG. truttae pairing. Assessment of the 2-stage classifier using only marginal hook data was very good with classification efficiencies of 100% (LDA) and 99·6% (KNN) for the G. salarisG. thymalli pairing and 97·2% (LDA) and 99·2% (KNN) for the G. derjaviniG. truttae pairing. Paired species were then discriminated individually in the second stage of the classifier using data from the full set of hooks. These analyses demonstrate that using the methods of LDA and KNN statistical classification, the discrimination of closely related and pathogenic species of Gyrodactylus may be achieved using data derived from light microscope studies.

(Received November 5 1999)
(Revised March 23 2000)
(Accepted March 23 2000)

Key Words: Gyrodactylus; Monogenea; Gyrodactylus salaris; statistical classifiers; linear discriminant analysis; k-nearest neighbours.

c1 Corresponding author: Institute of Aquaculture, University of Stirling, Stirling FK9 4LA, Scotland. Tel: +44 1786 473171. Fax: +44 1786 472133. E-mail: