Subscribe to RSS

DOI: 10.1055/s-0045-1810080
Translation and Cross-Cultural Adaptation of the Neck Dissection Assessment Tool to Spanish Language
Authors
Funding The authors received no financial support for the research.
Abstract
Introduction
Training in Head and Neck surgery represents a complex process always in evolution. Due to the absence of specific surgical evaluation tools to assess neck dissection performance by residents during training, finding a proper way to evaluate those standards will be a matter of debate for decades, and continues to be a pending task in the field.
Objective
Validation and cross-cultural adaptation of the TSCND questionnaire to the Spanish language.
Methods
A prospective data collection, about the performance of Spanish TSCND measured with Cronbach α.
Results
Internal consistency of the task-specific checklist section measured was 0.752 (95% CI = 0.638 to 0.839) with an intraclass correlation coefficient of 0.749 (95% CI = 0.651–0.837). Internal consistency of the global rating scale section measured with Cronbach α was 0.788 (95% CI = 0.681 to 0.865) with an intraclass correlation coefficient of 0.789 (95% CI = 0.703–0.857).
Conclusion
The development of specific tools to support surgical training education makes it possible to improve surgical skills evaluation. The Spanish translation of the TSCND is a reliable option for Spanish-Speaking surgical training and trainers in the development of an up-to-date competency-based curriculum.
Keywords
assessment - competency - neck dissection - resident - translation - head neck - otolaryngologyIntroduction
Training in Head & Neck (H&N) surgery represents a complex process always in evolution. Finding a proper way to evaluate those standards will be a matter of debate for decades and continues to be a pending task in the field.
In some countries like the USA, an outcome initiative directed to collect reliable and accurate data that depicts the residents' ability to care for patients in the health care system was proposed in 1998, under the umbrella of the Accreditation Council for Graduate Medical Education (ACGME), requiring for US medical education programs to enhance assessment of resident performance during their training.[1]
In Spanish-speaking countries, despite the search for the same criteria of excellence during residency training in Otolaryngology-Head and Neck Surgery (OTL – HNS), we still have a lack of reliable instruments to objectively evaluate resident competency acquisition. And like in other systems, verbal comments among supervising physicians on a case-by-case basis is the main form of feedback that residents will obtain.[2]
Since its description by Crile in 1906, neck dissection (ND) remains one of the most relevant procedures during training in OTL-HNS residency programs.[3] In a recent systematic review conducted by Mercier et al. [4] the authors highlight the absence of specific surgical evaluation tools to assess neck dissection performance by residents during training.
Following these findings and considering the challenges and requirements of modern surgical education, Yang et al. developed the first tool assessing surgical competency in neck dissection. They provided preliminary evidence of feasibility, reliability, and validity of the Task Specific Checklist for Neck Dissection (TSCND).[5]
The aim of this study is to document the reliability, validity, and cross-cultural adaptation of the TSCND in the Spanish Language.
Methods
This study was approved by the Research and Ethics Committee (register code 2019/CCH011). The study was performed in accordance with the ethical standards laid down in the Declaration of Helsinki.
The TSCND is a questionnaire designed to evaluate resident's competency in neck dissection[5] inspired by the Objective Structured Assessment of Technical Skills (OSATS), commonly used in general surgery training.[6] This assessment tool uses a double checklist methodology: A global rating scale (GRS) which examines the resident's overall technical skills as well as fundamental surgical principles, and a task-specific checklist (TSC) which examines the resident's ability to perform each step of the procedure.
We followed the recommendations of the World Health Organization for the translation and adaptation of instruments and the cross-cultural adaptation of health-related quality of life measures proposed by Guillemin et al,[7] the following way: Stage I: first translation was carried out by two independent Otolaryngologists - Head & Neck Surgeons, Spanish English native speakers. No consensus was needed since the two translations were perfectly similar. Stage II: After translation the survey was sent to a panel of six experts in Head and Neck Surgery throughout Spain and Latin-America. Stage III: After consensus was reached, the definitive version was sent to a third bilingual English Spanish speaker Otorhinolaryngology - Head & Neck Surgeon for back-translation. Afterward, Stage IV: the comprehensibility of the translated questionnaire was evaluated in a group of residents to check the reliability of the questionnaire. Stage V: The definitive version of the questionnaire obtained was finally reviewed again by a panel of experts and a consensus was obtained ([Figs. 1] & [2]).






To assess the reliability of the Spanish translation of this questionnaire, five 3rd year residents (PGY-3) and five 4th year residents (PGY-4) from 2 different hospitals were evaluated during their H&N training rotation, prospectively by 6 different consultants between January 15th of 2020 and January 15th of 2022, when at least 2 different consultants evaluate the same resident on each single surgery. First and second-year residents were excluded given their limited involvement in neck dissection in the departments included. All residents and consultants agreed to participate in the validation process. During all the process, the evaluations remain confidential and written consent was obtained from each participant.
As in the validation study, assessing the feasibility of implementing the TSCND time needed to complete the evaluation was recorded in minutes. Furthermore, the time delay between the day of the surgery and the day the scoring grid was completed was recorded in days.
Neck dissection was performed according to common practice. Dissection of levels IIa and IIb, III and IV were the most often performed during the study period.
Statistical analysis was conducted with SPSS for Macintosh Version 21.0 (IBM Corp, Armonk, NY). Internal consistency was measured with Cronbach α and temporal stability by using the intraclass correlation coefficient (P < 0.05). To analyze potential differences between groups with continuous variables, we performed the Wilcoxon test for nonparametric data and student's t-test for parametric data. Differences between groups with categorical variables were computed by Chi-square test and Fisher's exact test. A p value <0.05 was considered significant.
In the psychometric evaluation of the instruments, Reliability refers to consistency or dependability after repeated measurements. Consistency, to evaluate which each question of the scale relates to the rest, using the Cronbach α method to measure it, and taking a value over 0.70 to suggest a strong construct validity, which gets better the closer it is to 1. And finally, Temporal Stability, known as test-retest reliability, was performed after 8 weeks; and it refers to the concordance between the scores of repeated measurements from the same participant. It can be assessed by the intraclass correlation coefficient, considering 0.70 as a good correlation value.
An exploratory factor analysis (EFA) was conducted to assess the underlying structure of the Spanish version of the TSC and GRS for Neck Dissection. Due to the exploratory nature of the study and the lack of prior evidence regarding the dimensionality of the instrument in the Spanish-speaking context, EFA was deemed appropriate. Prior to the analysis, raw item scores were screened, and non-numeric responses (e.g., "n/c") were treated as missing values. The data were standardized to ensure comparability across items. The factor analysis was performed using Principal Axis Factoring, and the number of factors to retain was determined based on the Kaiser criterion (eigenvalue > 1) and visual inspection of the scree plot. All analyses were conducted using Python 3.9 with the sklearn library.
Results
A total of 40 neck dissections (ND) were performed by 10 residents and evaluated by six supervising surgeons using the TSCND during the study period. Of these, 20 dissections (50%) were performed by five PGY-3 residents, and the remaining 20 (50%) by five PGY-4 residents. On average, each resident performed four neck dissections during the study period. Key steps such as identifying the accessory nerve (cranial nerve XI) and the posterior belly of the digastric muscle, dissecting level IIb, and addressing vascular structures were most strongly associated with higher overall scores.
Regarding the overall performance, the TSCND-TSC score represents the technical skills assessment for ND. The mean score across all residents was 60.2 points. Breaking this down by year of training, PGY-3 residents scored an average of 55.4 points, while PGY-4 residents achieved a mean score of 65 points. This difference was statistically significant (p = 0.001), indicating that more advanced training was associated with better technical performance.
The TSCND-GRS score reflects a general rating scale used to evaluate overall performance during the procedure. The mean score for all residents was 27.6. For PGY-3 residents, the average was 27.7 points, compared to 30.6 points for PGY-4 residents. However, this difference was not statistically significant (p = 0.228), suggesting that the variation in general performance between the two groups was less pronounced.
The Cronbach's α was used to assess internal consistency and reliability. When we used this test, values closer to 1 indicate that the test items measure the same underlying construct. For the TSC section, Cronbach's α was 0.752 (95% CI: 0.638–0.839), which suggests a good level of reliability. After removing individual test items, the α value remained above 0.731, indicating that no single item disproportionately influenced the test's reliability. Items related to defining the inferior and medial boundaries of the neck dissection showed particularly strong contributions to reliability ([Table 1]).
The intraclass correlation coefficient (ICC) measures agreement or consistency between evaluators. For the TSC section, the ICC was 0.749 (95% CI: 0.651–0.837), indicating substantial agreement among evaluators.
When we perform the subgroup analysis by residency year, the internal consistency of the TSC section was analyzed separately for each training year. For PGY-3 residents, Cronbach's α was 0.786 (95% CI: 0.658–0.879), and the ICC was 0.787 (95% CI: 0.650–0.889). For PGY-4 residents, Cronbach's α was 0.744 (95% CI: 0.540–0.782), and the ICC was 0.709 (95% CI: 0.549–0.749). These values indicate good reliability for both training groups. Removing individual test items did not significantly reduce reliability, with minimum Cronbach α values of 0.700 for PGY-3 residents and 0.691 for PGY-4 residents ([Table 1]).
The GRS section also showed strong internal consistency, with a Cronbach's α of 0.788 (95% CI: 0.681–0.865). This suggests that the items in the GRS reliably evaluate general surgical performance. Even after removing items, the α value remained above 0.759, with the highest reliability for items assessing assistant support and surgical progress ([Table 2]). The ICC for the GRS section was 0.789 (95% CI: 0.703–0.857), reflecting high agreement among evaluators.
For PGY-3 residents, Cronbach's α for the GRS section was 0.853 (95% CI: 0.751–0.954), indicating excellent reliability. The ICC was 0.823 (95% CI: 0.698–0.910), reflecting high consistency among evaluators. For PGY-4 residents, Cronbach's α was 0.718 (95% CI: 0.529–0.798), and the ICC was 0.713 (95% CI: 0.578–0.774), indicating good reliability. Also, removing items did not significantly affect consistency, with minimum Cronbach α values of 0.758 for PGY-3 residents and 0.699 for PGY-4 residents. These results demonstrate that the GRS is a reliable tool for evaluating performance across different levels of training ([Table 2]).
Exploratory factor analysis revealed that the Spanish version of the TSCND exhibited a unidimensional structure. The scree plot and eigenvalue analysis (Kaiser criterion) indicated that only one factor had an eigenvalue greater than 1, supporting the retention of a single factor. This factor accounted for most of the variance among the items. All items demonstrated acceptable loadings on factor analysis, with most items exceeding 0.6, indicating strong contributions to the latent construction of technical performance in neck dissection. Items such as "Conservación de la rama marginal" and "Area I" presented particularly high loadings, highlighting their relevance within the checklist framework. ([Table 3]) Exploratory factor analysis was performed also to examine the dimensionality of the GRS. The analysis revealed that the eigenvalue criterion and scree plot supported the retention of a single factor. However, the factor loading varied across the items. "Conocimientos del procedimiento especifico" showed the highest loading (0.79), indicating a strong contribution to the latent factor, presumably representing overall surgical competence. The remaining items, including "Respeto a los tejidos," "Tiempo/Progreso," "Manejo del instrumental," and "Uso de asistentes," exhibited lower loadings (range 0.22–0.34), suggesting a more moderate or weaker contribution to the underlying construct. These findings imply that while the GRS can be considered to measure an overarching domain of surgical competence, some items contribute less strongly, highlighting potential areas for refinement or further validation in larger samples. ([Table 4])
Finally, the average time required to complete the TSCND evaluation was 3.8 minutes (SD: 3.20), making it a quick and efficient tool. Supervising consultants typically completed their evaluations within two days of the surgical procedure (mean: 2 days, SD: 2.23).
Discussion
Neck dissection is considered a diagnostic and therapeutic procedure in H&N cancers patients, that allows eradication of regional lymph node metastasis. Previous data suggests that ND offers an effective and oncological safe surgical procedure in selected patients with clinically positive metastatic nodes in the neck.[8] [9] However, teaching and evaluation methods are different and mostly empirical across the world as well as work-hour restrictions, limitations about surgical availability, burnout or pressure related to clinical productivity make mandatory to design innovative assessment strategies.
Usually, surgical training deficiencies are identified at the end of training or in the first years of practice as a consultant when those are more difficult to improve. The TSCND was the first standardized method designed to assess resident competency in ND and track their progress during the training program. According to the best of our knowledge, this is the first study designed to perform a translation, validation, and cross-cultural adaptation of the TSCND from English to Spanish.
More recently, Dowling et al.[10] focused on selective neck dissection and designed for the evaluation of trainees according to the PGY-Level and the number of H&N surgical rotations. According to the authors, identification of key vascular structures as well as the omohyoid muscle and the posterior belly of the digastric muscle were the most strongly associated factors with the overall score, results in accordance with those observed in our cohort.
About the psychometric evaluation of the translated test, we demonstrated a good internal consistency of the TSCND-TSC and the TSCND-GRS over 0.7, a high specific year of training related consistency, also higher than 0.7 on both and a good intraclass correlation coefficient over 0.7 in all measures. Our results suggest that the questionnaire its reliable and useful for application in H&N Spanish-Speaking teaching environments. ([Table 1] & [2]). Moreover, the results of our factor and Kaiser analyses reinforce the robustness and reliability of the TSC and GRS evaluation frameworks in assessing surgical performance. Both analyses revealed that a single dominant factor accounts for most of the variance in each dataset, emphasizing the coherence of the assessment items and their alignment with a unified construct of surgical competence. The strong contributions of Total and PGY-3 scores highlight the reliability of these measures, while the moderate influence of PGY-4 scores suggests developmental progression in skill acquisition. These findings validate the use of TSC and GRS as effective tools for evaluating residents' technical skills and overall performance across different training levels, providing a solid foundation for refining surgical education and assessment methodologies. ([Table 3] & [4])
Results obtained in this analysis are like previously published data by Mercier et al.[11] and Yang et al.,[5] in the two first publication about the use of the TSCND and allow us to suggest implementation of this tool. As we mentioned above, consistency of each item was good when comparing the evolution from PGY-3 to PGY-4 in our cohort. This also corroborates that the TSCND may be able to measure a difference in scores between groups that are expected to have unequal surgical abilities. Regarding the need to fill both checklists, it is important to understand that each checklist provides different feedback. The TSCND-GRS evaluates fundamental surgical abilities, whereas the TSCND-TSC evaluates the resident's proficiency in each step of ND.
Regarding surgical and educational purposes, the application of methods like this will be interesting for resident evaluation. However, some major obstacles need to be overcome. The need for surgical time, faculty compromise, and coordination with different trainees' schedules and department logistics requires a significant compromise. It is here, where integration of these methods into the surgical curriculum will facilitate the participation in activities such as this.
The translation of this test was done in Spain. Even though Latin American Spanish medical and surgical terms is almost the same as mainland Spain Spanish. However, the main author and some reviewers of this translated version of TSCND are from Latin America and the Caribbean, so words and expressions used in these countries were used.
Finally, there are some issues regarding the application of the TSCND that need to be considered: 1) participating assessors using the tool in this study were not blinded about the residents' identity and level of training; 2) the small sample size due to the limited number of residents available on each participating department 3) the variability in the complexity of neck dissections that may impact scoring, and 4) the lack of surgical skill and instrumentation evolution evaluation using this method during the training process. In this vein, further studies are required to validate whether this discriminates competence or improvement over time, that no correlation can currently be drawn to patient outcomes.
Conclusion
In modern medicine, teaching methods are evolving. The development of specific tools to support surgical training education makes possible to improve case discussion between mentors and learners to stimulate individualized support. Due to the relevance of the Spanish Language, still the second most spoken language regarding the number of native speakers, the TSCND represents an option for Spanish-Speaking surgical training and trainers in the development of an up-to-date competency-based curriculum.
Conflict of Interest
The authors declare that there is no conflict of interest.
Authors' Contributions
CMCE: conceptualization, formal analysis, drafting; MMY: data curation; JRL: data curation; JASS: investigation; JAGG: supervision; EL: investigation; LRR: writing, review, editing; TA: conceptualization, drafting.
Data Availability Statement
The data that support the findings of this study are available from the authors upon reasonable request.
-
References
- 1 Swing SR. The ACGME outcome project: retrospective and prospective. Med Teach 2007; 29 (07) 648-654
- 2 Brown DJ, Thompson RE, Bhatti NI. Assessment of operative competency in otolaryngology residency: Survey of US Program Directors. Laryngoscope 2008; 118 (10) 1761-1764
- 3 Crile G. Excision of cancer of the head and neck. With special reference to the plan of dissection based on 132 patients. JAMA 1906; 47: 1780-1786
- 4 Mercier É, Chagnon-Monarque S, Lavigne F, Ayad T. Objective assessment of technical skills in otorhinolaryngology – head and neck surgery residents: a systematic review. Otolaryngol Head Neck Surg 2018; 158 (01) 54-61
- 5 Yang N, Mercier É, Guertin L. et al. Development, translation, and preliminary validation of the neck dissection assessment tool. Head Neck 2021; 43 (08) 2307-2315
- 6 Faulkner H, Regehr G, Martin J, Reznick R. Validation of an objective structured assessment of technical skill for surgical residents. Acad Med 1996; 71 (12) 1363-1365
- 7 Guillemin F, Bombardier C, Beaton D. Cross-cultural adaptation of health-related quality of life measures: literature review and proposed guidelines. J Clin Epidemiol 1993; 46 (12) 1417-1432
- 8 López F, Fernández-Vañes L, García-Cabo P. et al. Selective neck dissection in the treatment of head and neck squamous cell carcinoma patients with a clinically positive neck. Oral Oncol 2020; 102: 104565
- 9 Lassaletta L, García-Pallarés M, Morera E. et al. Functional neck dissection for the clinically negative neck: effectiveness and controversies. Ann Otol Rhinol Laryngol 2002; 111 (02) 169-173
- 10 Dowling E, Larson D, Carlson ML, Price DL. Development and Validation of Instrument for Operative Competency Assessment in Selective Neck Dissection. Ann Otol Rhinol Laryngol 2022; ;34894221081101
- 11 Mercier É, Guertin L, Bissada E. et al. Assessment of surgical competence for neck dissection: a pilot study. Can J Surg 2022; 65 (02) E178-E187
Address for correspondence
Publication History
Received: 02 November 2023
Accepted: 20 June 2025
Article published online:
09 October 2025
© 2025. The Author(s). This is an open access article published by Thieme under the terms of the Creative Commons Attribution 4.0 International License, permitting copying and reproduction so long as the original work is given appropriate credit (https://creativecommons.org/licenses/by/4.0/)
Thieme Revinter Publicações Ltda.
Rua Rego Freitas, 175, loja 1, República, São Paulo, SP, CEP 01220-010, Brazil
Carlos M. Chiesa-Estomba, Miguel Mayo-Yanez, Jerome R. Lechien, Jon Alexander Sistiaga-Suarez, José Ángel González-García, Ehkiñe Larruscain, Laura Rodrigáñez-Riesco, Tareck Ayad. Translation and Cross-Cultural Adaptation of the Neck Dissection Assessment Tool to Spanish Language. Int Arch Otorhinolaryngol 2025; 29: s00451810080.
DOI: 10.1055/s-0045-1810080
-
References
- 1 Swing SR. The ACGME outcome project: retrospective and prospective. Med Teach 2007; 29 (07) 648-654
- 2 Brown DJ, Thompson RE, Bhatti NI. Assessment of operative competency in otolaryngology residency: Survey of US Program Directors. Laryngoscope 2008; 118 (10) 1761-1764
- 3 Crile G. Excision of cancer of the head and neck. With special reference to the plan of dissection based on 132 patients. JAMA 1906; 47: 1780-1786
- 4 Mercier É, Chagnon-Monarque S, Lavigne F, Ayad T. Objective assessment of technical skills in otorhinolaryngology – head and neck surgery residents: a systematic review. Otolaryngol Head Neck Surg 2018; 158 (01) 54-61
- 5 Yang N, Mercier É, Guertin L. et al. Development, translation, and preliminary validation of the neck dissection assessment tool. Head Neck 2021; 43 (08) 2307-2315
- 6 Faulkner H, Regehr G, Martin J, Reznick R. Validation of an objective structured assessment of technical skill for surgical residents. Acad Med 1996; 71 (12) 1363-1365
- 7 Guillemin F, Bombardier C, Beaton D. Cross-cultural adaptation of health-related quality of life measures: literature review and proposed guidelines. J Clin Epidemiol 1993; 46 (12) 1417-1432
- 8 López F, Fernández-Vañes L, García-Cabo P. et al. Selective neck dissection in the treatment of head and neck squamous cell carcinoma patients with a clinically positive neck. Oral Oncol 2020; 102: 104565
- 9 Lassaletta L, García-Pallarés M, Morera E. et al. Functional neck dissection for the clinically negative neck: effectiveness and controversies. Ann Otol Rhinol Laryngol 2002; 111 (02) 169-173
- 10 Dowling E, Larson D, Carlson ML, Price DL. Development and Validation of Instrument for Operative Competency Assessment in Selective Neck Dissection. Ann Otol Rhinol Laryngol 2022; ;34894221081101
- 11 Mercier É, Guertin L, Bissada E. et al. Assessment of surgical competence for neck dissection: a pilot study. Can J Surg 2022; 65 (02) E178-E187







