Methodology to Weight Evaluation Areas from Autism
Spectrum Disorder ADOS-G Test with Artificial
Neural Networks and Taguchi Method
M. Reyes * P. Ponce * D. Grammatikou * A. Molina * * Tec de Monterrey, CCM. |
Keywords: Autism Spectrum Disorder (ASD), diagnosis, screening, ADOS-G, Artificial Neural Networks, Feed-forward networks, Taguchi Method, Orthogonal Arrays, classify. |
Correspondencia: |
Palabras clave: Trastorno del Espectro Autista (TEA), diagnostico, detección, ADOS-G, Redes neuronales artificiales, Método de Taguchi, arreglos ortogonales, clasificación. |
Table 1. ASD screening and diagnostic tools | ||||
Tool | Type of tool | Age range | Advantages | Disadvantages |
Checklist for Autism in Toddlers (CHAT) [11] | Screening test | 18 months | Quick application | Low detection capacity |
Modified Checklist for Autism in Toddlers Revised with Follow-Up (MCHAT-R/F) [12] | Screening test | 16 - 30 months | The predictive capacity increases when used with a clinician’s interview | Low positive predictive capacity. Large number of false positives. |
Screening Tool for Autism in Two-Year- Olds (STAT) [13] | Screening test | 12-23 months | High sensibility | Sensitivity and Specificity based only on 12 cases. |
Infant Toddler Checklist (ITC) [14] | Screening test | Less than 18 months | High sensibility | Does not differentiate between ASD and any other developmental disorder. |
Childhood Autism Rating Scale (CARS) [15] | Diagnostic test | Starting from 24 months | Quantitative tool that evaluates the severity of the symptoms. Also useful to control evolution of the patient after treatment. | Can misdiagnose ASD in children with intellectual disabilities. |
Autism Diagnostic Observation Schedule - Generic (ADOS-G) [16] | Diagnostic test | It can be used for children over 2 years of mental age or in adults | Direct observation of the child interaction through specified activities. | Requires clinical training and practice to observe and evaluate. Takes around 30 minutes to perform the activities and then some more time to evaluate the algorithm.
|
Autism Diagnostic Interview-Revised (ADI-R) [17] | Diagnostic test | Starting from 18 months | Interview answered by parents that help distinguish ASD from other disorders. | Takes from 1 to 2 hours to apply because it has 93 questions with multiple options. |
Figure 1. Multi-layered Artificial Neural Network
Figure 2. ANN learning types [21], [22].
Tool name | Reference | Tool description | Advantage | Disadvantage | Minimization of training sample |
Artificial Neural Network (ANN) trained with Backpropagation of the error | Cohen & Sudhalter. (1993) A Neural Network Approach to the Classification of Autism [25] | ANN created to discriminate between Autism and Mental retardation based on the Autism Behavior Interview (ABI) | Increased accuracy from discriminant function analysis with 85% to 92% with ANN. | Based on the DSM-III which evaluates differently Autism compared to the most current version DSM-V. Selection of 11 from 28 questions of the ABI tool which is not validated as a gold standard for Autism diagnosis. 138 samples needed. | No |
Knowledge Based Screener (KBS) | Veeraraghavan & Srinivasan (2007). Exploration of Autism using Expert Systems [26] | Ruled based expert system with factual and heuristic knowledge to analyze children development and identify developmental disorders. | Available through internet | Does not mention if the knowledge is obtained from a standardized tool or only from clinical experience. Does not mention if it was tested. | No |
Neuro fuzzy system | Arthi & Tamilarasi (2008). Prediction of autistic disorder using neuro- fuzzy system by applying ANN technique [27] | Neurofuzzy system converts inputs from a parent answered questionnaire which is converted to fuzzy membership values. Those values are evaluated with if-then rules and the fuzzy output becomes the input for the artificial neural network trained with backpropagation method. | Helps diagnosing autism with an overall performance of 85-90% | Not based on a certified test, depends on the expertise of the clinicians that help to construct the system. Started with 40 samples and needed to increase to 194 to increase training performance. | No |
Alternating Decision tree (ADT) | Wall, Kosmicki, DeLuca, Harstad & Fusaro (2012). Use of machine learning to shorten observation-based screening and diagnosis of autism [28] | ADTree classifier consisting of 8 questions from the ADOS Module1 tool. | Reduction from 29 to 8 items to classify autism with 99.8% accuracy with False positive rate of 0 and True positive rate of 1 | Large sample needed for system training (623 individuals) | No |
Alternating Decision tree (ADT) | Wall, Dally, Luyster, Jung, DeLuca (2012). Use of Artificial Intelligence to shorten the behavioral Diagnosis of Autism [29] | Decision tree classifier to detect autism rapidly through 7 questions from the ADI-R tool | Reduction from 93 to 7 questions to classify autism with 99.9% accuracy with False positive rate of .013 and True positive rate of 1 | Large sample needed for system training (966 individuals) | No |
Table 3. Orthogonal Array L27
METHODThis paper presents the methodology used to find the most significant items from the ADOS-G tool to detect Autism Spectrum Disorders through Feed-forward Artificial Neural Networks with back-propagation training. The number of cases for the network training data was minimized using the Taguchi method with Orthogonal Arrays. The methodology starts by defining the Autism diagnosis tool; in this case, the ADOS-G was selected for being an international validated tool considered one of the gold standards for Autism detection [16]. The algorithm for this tool evaluates 12 items with 3 possible states. That means that the complete factorial design would be of 531, 441 cases. The next step was to reduce the number of cases to train the ANN, it has been mentioned that the L27 orthogonal array should be selected for the number of parameters and states. Since the OA shown in Table 3 considers the states 1, 2 and 3 and the ADOS-G algorithm consists of three states 0, 1 and 2, Table 4 was created as the combination of cases that was used to train the ANN containing the items evaluated with the possible states. Since the information of column 13 is included in the other 12, only 12 columns were used. The 27 cases were evaluated with the ADOS-G algorithm. The sum of the first 5 items should be greater or equal than 4, the sum of the next 7 items should be greater or equal to 7 and the sum of all the 12 items should be greater or equal to 12. Only when these three conditions are met, then the case is diagnosed as Autism. This algorithm evaluation is shown as the last column in Table 4. |
Table 4. L27 Orthogonal array evaluated with ADOS-G test rules.
The next step was to train the ANN. Since both inputs and desired outputs are available, a supervised artificial neural network was created using Matlab software [32]. The ANN was trained using the back-propagation method and it consists of 3 layers, the input layer has 40 neurons, the hidden layer has 60 and the output layer has 1 neuron (see Figure 4). The 12 inputs, which are the same 12 items that the ADOS-G algorithm evaluates, can have values of 0, 1 or 2. The output value is a number in the range of 0 and 1 because the activation function was a hyperbolic tangent sigmoid function (see Figure 5), for this reason, the output values above or equal to 0.5 are considered as Autism spectrum disorders and below 0.5 and zero are considered as non-Autism spectrum disorders. Once the ANN was created, validation of the network was performed. It is important to notice that it is a common practice for ANN training to perform a cross validation method to estimate the performance of the learning algorithm. K-fold cross validation consists of dividing the total number of cases available in k parts, so that the k% of the cases are used only for validation while the 100-k% is used for training. The training is repeated until all k parts have been used for validation. One of the most used k% for machine learning is the 10-fold cross-validation which means that 90% of the samples are used to train the network, and the other 10% are used for testing its accuracy. Another validation form is the hold out validation, which avoids the overlapping of train data and validation data, the available data is held out during training and used only for validation purpose. The problem with this type of validation is that the results are highly dependent on the choice of the training data [33]. For the presented work here, the hold out validation method was used. It makes no sense to divide the orthogonal array of 27 cases into two parts (training and validation), because the 27 cases are meant to be the most representative combinations in this method. Therefore the complete orthogonal array of 27 cases was taken as training data. In order to validate the network, 11 different cases were used. These 11 cases were obtained from real children evaluated with the ADOS-G tool by a Psychologist. That means that for this work, a total of 38 cases were used from which 71% were used for training (27 cases from the orthogonal array) and 29% were held out for validation (11 cases from real children). After the ANN was validated, the following step was to classify the 12 items from the ADOS-G tool into impact degrees for Autism diagnosis. Tests and results from the ANN were observed to find the factors that consistently generate an Autism diagnosis. The 12 items were classified within 3 ranges of impact: low, medium and high. The complete methodology is represented as a flow diagram in Figure 6. |
Figure 4. ANN network designed with OA L27
RESULTS AND DISCUSSIONValidation of the ANN was performed with11 real cases that were not used for training before. First the 11 cases were diagnosed by a Psychologist based on clinical observation of the DSM-V parameters [6], the psychologist diagnosed 6 cases as Autism Spectrum Disorder and 5 were diagnosed as no Autism Spectrum Disorder. The same 11 real cases were diagnosed with the ADOS-G algorithm and the same 6 cases diagnosed as ASD by the psychologist were also diagnosed by the ADOS-G algorithm and the same for the 5 non ASD cases. Finally the 11 cases were tested on the ANN. Remembering that values from the ANN output above or equal to 0.5 are considered ASD, 6 true positive cases were classified as ASD and 5 true negative cases were classified as non-ASD. These results yield to a sensitivity of 1 and specificity of 1. Once the ANN was trained and validated, the following step was to classify the 12 factors through their impact on diagnosis. Observation of the orthogonal array was needed to find the factors that consistently generate an ASD diagnosis. Then using the ANN, several tests were performed to classify the 12 areas within 3 ranges of impact: low, medium and high as shown in Table 6. It can be observed in Table 6 first row, that the factors classified as high (A2, B5 and B9) when assigned a value of 2 and zero for the rest, provide an output of 0.832 which is an Autism diagnosis. Only the combination of those 3 areas already provides an Autism diagnosis. This is the reason why they are called high impact factors. Medium and low impact factors alone diagnose no Autism; see Table 6 rows 2 and 3. When high impact factors are weighted in 2 and medium factors in 1, the diagnosis get a value of 0.996 which is even higher than the high impact factors alone, see row 6 from Table6. It can also be observed that the Low impact areas have a minimum relevance for diagnosing Autism when combined with high or Medium impact factors see rows 6-9 in Table 6. By classifying the areas from the test in three ranges, it allows the user to focus more on the High and Medium impact areas but still considers the Low one for specific cases, see Table 6.
In Table 7, the 12 items from ADOS-G tool algorithm are classified according to their impact range as High, Medium or Low according to the tests performed with the ANN. The Codes column refers to the ADOS-G tool code for easier identification. The order of the items within each impact range was not selected specifically. A comparison was made between the result obtained from the work here presented and the work presented by Wall, Kosmicki, DeLuca, Harstad & Fusaro [28] to reduce from 29 to 8 items in the ADOS-G tool in order to classify autism using an Alternating Decision Tree. Table 8 presents the summary of the 8 items that they found. B2, C1 and C2 are items that are evaluated during the activities in the ADOS-G tool, but they are not included in the diagnosis algorithm. It is interesting to see that the 3 high impact factors A2, B5 and B9, one medium impact factor B1 and one low impact factor B10 are included in Wall´s items as well. |
Table 5. ANN validation
Table 6. Impact area tests
ConclusionsArtificial Neural Networks can be used for Autism Spectrum Disorder detection. Due to the fact that ANN can learn by examples, a Feed-forward network was trained with back-propagation method to approximate Autism diagnosis based on the ADOS-G tool algorithm. The training samples were selected as an orthogonal array using the Taguchi method to pick the least number of combinations that would be a representative sample suitable for training. The Design of Experiments through the Taguchi Method reduces considerably the number of cases used to train the ANN from 531,441 to 27, which reduces as well training time and computer resources. It was observed that the network provides an accuracy of 100% for Autism diagnosis, with specificity and sensitivity of 1, validated against the ADOS-G algorithm and a psychologist evaluation based on the DSM-V. A general advantage of ANN is that they can create approximations of an unknown system when trained by examples. This same advantage can turn into a disadvantage when the model of the system is needed to perform certain actions such as to control or to observe it. As every tool, ANN should be analyzed before using it with each specific situation. The designed ANN was used to classify the 12 items from the ADOS-G tool algorithm into three impact ranges Low, Medium and High. It can be said that Showing, Shared enjoyment in Interaction and Frequency of vocalization directed to others are the three items of high impact for Autism detection. The medium impact items are Stereotyped use of Words or Phrases, Unusual eye contact, Use of other’s body to communicate, Pointing, Facial expression directed to others and Response to joint attention. The items that influence the least are Gestures, Spontaneous initiation of joint attention and Quality of Social overtures. The combination of High impact with Medium impact factors can improve the value obtained during diagnosis. This classification was compared to the work done by [28]. The big difference between both works is that they used 623 individuals to train the ADT while the methodology here presented used only 27 cases using the Taguchi method to select the training data.
References
|