RESUMO
Objective: To evaluate the intra and inter observer agreement of the Sauvegrain, Greulich and Pyle methods. Material and methods: This is an observational, retrospective and cross-sectional study ethically approved by opinion 6,192,391. 100 radiographic images of the elbow and 100 of the left wrist and hand were collected from children whose images were selected by a researcher who did not carry out the evaluations. The Sauvegrain, Greulich and Pyle methods were used to determine bone age. We provided a detailed explanation of each method and the evaluators received a file with the study images. After three weeks, the exams were randomized and the radiograms were reevaluated. Of the 100 patients in group A, 61 (61%) were boys and 39 (39%) were girls. In group B, 67 (67%) were boys and 33 (33%) were girls. Four statistical analyzes were used: correlation; intraclass correlation; analysis using the Bland-Altman graph; differences between groups. Results: Intra and interobserver agreement between groups was considered excellent. Conclusions: Despite the excellent agreement, group A presented a significantly better value than B. Biological ages show a greater difference compared to chronological ages in group A. In group B, skeletal and chronological ages do not show statistical difference according to the accuracy test. Level of Evidence III, Cross-Sectional Observational Study.
Objetivo: Avaliar a concordância intra e interobservadores dos métodos de Sauvegrain e Greulich e Pyle. Material e métodos: Trata-se de um estudo observacional, retrospectivo e transversal, aprovado eticamente pelo parecer 6.192.391. Foram coletadas cem imagens radiográficas do cotovelo e cem do punho e mão esquerdos de crianças, selecionadas por um pesquisador que não realizou as avaliações. Utilizou-se os métodos de Sauvegrain e Greulich e Pyle para determinar a idade óssea. Uma explicação detalhada de cada método foi realizada, e os avaliadores receberam um arquivo com as imagens do estudo. Após três semanas, os exames foram randomizados e os radiogramas reavaliados. Dos cem pacientes do grupo A, 61(61%) eram meninos e 39(39%) meninas. No grupo B, 67(67%) eram meninos e 33(33%) meninas. Quatro análises estatísticas foram utilizadas: correlação; correlação intraclasse; análise pelo gráfico de Bland-Altman; e diferenças entre grupos. Resultados: A concordância intra e interobservador entre os grupos foi considerada excelente. Conclusões: Apesar da concordância excelente, o grupo A apresentou valor significantemente melhor que o B. As idades biológicas apresentam maior diferença frente as idades cronológicas no grupo A. No grupo B, as idades esqueléticas e cronológicas não apresentam diferença estatística segundo o teste de acurácia. Level of Evidence III, Cross-Sectional Observational Study .
RESUMO
INTRODUCTION: Despite being the most used exam today, few studies have evaluated the accuracy of findings on non-contrast magnetic resonance imaging (MRI). The primary objective of the study was to evaluate the sensitivity, specificity, positive predictive value, negative predictive value, and accuracy of non-contrast MRI findings in frozen shoulder, isolated and in combination. The secondary objectives were to define the interobserver and intraobserver agreement of the assessments and the odds ratio for frozen shoulder because of the various findings of MRI. METHODS: A retrospective diagnostic accuracy study comparing non-contrast MRI findings between the frozen shoulder group and the control group. Sensitivity, specificity, positive and negative predictive value, accuracy, odds ratio, interobserver and intraobserver agreement were calculated for each finding and their possible associations. RESULTS: The hyperintensity on capsule in the axillary recess presented 84% sensitivity, 94% specificity, and 89% accuracy. The obliteration of the subcoracoid fat triangle in the rotator interval had sensitivity 34%, specificity 82% and accuracy 58%. For coracohumeral ligament thickness ≥ 2 mm had specificity 66%, 48% specificity and 57% accuracy. Capsule thickness in the axillary recess ≥ 4 mm resulted in 54% sensitivity, 82% specificity, and 68% accuracy. Regarding interobserver agreement, only the posteroinferior and posterosuperior quadrants showed moderate results, and all the others showed strong reliability. The odds ratio for hyperintensity in the axillary recess was 82.3 for frozen shoulder. The association of these findings increased specificity (95%). CONCLUSION: The accuracy of non-contrast magnetic resonance imaging is high for diagnosing frozen shoulder, especially when evaluating the hyperintensity of the axillary recess. The exam has high reliability and reproducibility. The presence of an association of signs increases the specificity of the test. LEVEL OF EVIDENCE: Level III, study of diagnostic test.
Assuntos
Bursite , Articulação do Ombro , Humanos , Estudos Retrospectivos , Reprodutibilidade dos Testes , Articulação do Ombro/patologia , Imageamento por Ressonância Magnética/métodos , Bursite/diagnóstico por imagem , Sensibilidade e EspecificidadeRESUMO
ABSTRACT Objective: To evaluate the intra and inter observer agreement of the Sauvegrain, Greulich and Pyle methods. Material and methods: This is an observational, retrospective and cross-sectional study ethically approved by opinion 6,192,391. 100 radiographic images of the elbow and 100 of the left wrist and hand were collected from children whose images were selected by a researcher who did not carry out the evaluations. The Sauvegrain, Greulich and Pyle methods were used to determine bone age. We provided a detailed explanation of each method and the evaluators received a file with the study images. After three weeks, the exams were randomized and the radiograms were reevaluated. Of the 100 patients in group A, 61 (61%) were boys and 39 (39%) were girls. In group B, 67 (67%) were boys and 33 (33%) were girls. Four statistical analyzes were used: correlation; intraclass correlation; analysis using the Bland-Altman graph; differences between groups. Results: Intra and interobserver agreement between groups was considered excellent. Conclusions: Despite the excellent agreement, group A presented a significantly better value than B. Biological ages show a greater difference compared to chronological ages in group A. In group B, skeletal and chronological ages do not show statistical difference according to the accuracy test. Level of Evidence III, Cross-Sectional Observational Study.
RESUMO Objetivo: Avaliar a concordância intra e interobservadores dos métodos de Sauvegrain e Greulich e Pyle. Material e métodos: Trata-se de um estudo observacional, retrospectivo e transversal, aprovado eticamente pelo parecer 6.192.391. Foram coletadas cem imagens radiográficas do cotovelo e cem do punho e mão esquerdos de crianças, selecionadas por um pesquisador que não realizou as avaliações. Utilizou-se os métodos de Sauvegrain e Greulich e Pyle para determinar a idade óssea. Uma explicação detalhada de cada método foi realizada, e os avaliadores receberam um arquivo com as imagens do estudo. Após três semanas, os exames foram randomizados e os radiogramas reavaliados. Dos cem pacientes do grupo A, 61(61%) eram meninos e 39(39%) meninas. No grupo B, 67(67%) eram meninos e 33(33%) meninas. Quatro análises estatísticas foram utilizadas: correlação; correlação intraclasse; análise pelo gráfico de Bland-Altman; e diferenças entre grupos. Resultados: A concordância intra e interobservador entre os grupos foi considerada excelente. Conclusões: Apesar da concordância excelente, o grupo A apresentou valor significantemente melhor que o B. As idades biológicas apresentam maior diferença frente as idades cronológicas no grupo A. No grupo B, as idades esqueléticas e cronológicas não apresentam diferença estatística segundo o teste de acurácia. Level of Evidence III, Cross-Sectional Observational Study .
RESUMO
Objective: To assess the reliability of phase-sensitive inversion recovery (PSIR) magnetic resonance imaging (MRI) and its accuracy for determining the topography of demyelinating cortical lesions in patients with multiple sclerosis (MS). Materials and Methods: This was a cross-sectional study conducted at a tertiary referral center for MS and other demyelinating disorders. We assessed the agreement among three raters for the detection and topographic classification of cortical lesions on fluid-attenuated inversion recovery (FLAIR) and PSIR sequences in patients with MS. Results: We recruited 71 patients with MS. The PSIR sequences detected 50% more lesions than did the FLAIR sequences. For detecting cortical lesions, the level of interrater agreement was satisfactory, with a mean free-response kappa (κFR) coefficient of 0.60, whereas the mean κFR for the topographic reclassification of the lesions was 0.57. On PSIR sequences, the raters reclassified 366 lesions (20% of the lesions detected on FLAIR sequences), with excellent interrater agreement. There was a significant correlation between the total number of lesions detected on PSIR sequences and the Expanded Disability Status Scale score (ρ = 0.35; p < 0.001). Conclusion: It seems that PSIR sequences perform better than do FLAIR sequences, with clinically satisfactory interrater agreement, for the detection and topographic classification of cortical lesions. In our sample of patients with MS, the PSIR MRI findings were significantly associated with the disability status, which could influence decisions regarding the treatment of such patients.
Objetivo: Avaliar a confiabilidade da sequência PSIR e sua precisão no diagnóstico topográfico de lesões corticais desmielinizantes em pacientes com esclerose múltipla (EM). Materiais e Métodos: Estudo transversal realizado em centro de referência terciário para EM e distúrbios desmielinizantes. Avaliamos a concordância entre três avaliadores na identificação e classificação topográfica de lesões corticais na ressonância magnética de pacientes com EM, utilizando as sequências FLAIR e PSIR. Resultados: Foram incluídos 71 pacientes com EM. Em PSIR detectou-se 1,5× mais lesões do que em FLAIR, com concordância satisfatória entre examinadores na identificação de lesões corticais, com coeficiente kappa de resposta livre (κFR) = 0,60, e na reclassificação topográfica das lesões, com κFR médio = 0,57. Os avaliadores reclassificaram 366 lesões em PSIR (20% das lesões detectadas em FLAIR), com excelente concordância. Houve correlação significativa do total de lesões detectadas em PSIR e o escore da escala de incapacidade EDSS (ρ = 0,35; p < 0,001). Conclusão: PSIR mostrou-se superior na detecção de lesões corticais e na classificação topográfica destas em comparação ao FLAIR, com concordâncias entre examinadores clinicamente satisfatórias. A associação significativa entre o número de lesões corticais em PSIR e o grau de incapacidade dos pacientes pode influenciar em decisões terapêuticas.
RESUMO
Hemoglobin and hematocrit are parameters widely used. They can be obtained from an automated hematology analyzer or from an arterial blood gas analyzer. Its variability is shown in the article "Variability of hemoglobin and hematocrit determined in blood gas equipment." Clinical and statistical information requested is extended for a better understanding of the article and its conclusions. It is suggested to carry out an analysis of variability in parameters and laboratory equipment.
La hemoglobina y el hematocrito son parámetros de amplio uso. Pueden ser obtenidos de un analizador automatizado de hematología o de un analizador de gases arteriales. Su variabilidad se muestra en el artículo "Variabilidad de la hemoglobina y hematocrito determinados en equipo de gases sanguíneos". Se amplía la información clínica y estadística solicitada para la mejor comprensión del trabajo y sus conclusiones. Se sugiere hacer un análisis de variabilidad en parámetros y equipos de laboratorios.
Assuntos
Hemoglobinas , Humanos , HematócritoRESUMO
BACKGROUND: The training needed for doing obstetric ultrasounds is rarely reported. The aim of this study was to determine whether the training of the ultrasonographer influences the prenatal diagnostic certainty of some congenital malformations. METHODS: We conducted a retrospective evaluation of antepartum sonographic findings of newborn infants found ultimately to have a congenital anomaly in a tertiary level pediatric reference center. Data were collected on admission for consecutive patients at a tertiary-level pediatric reference center. The mother´s pregnancy and birth demographic variables and those of the prenatal ultrasound (PUS) were analyzed and correlated with the final diagnosis. RESULTS: Sixty-seven neonates were included. All cases underwent PUS with a mean of 4.6. Prenatal diagnosis was established in 24 cases (35.8%). Thirteen surgical anomalies were detected, particularly anorectal malformation and gastroschisis. The accuracy of PUS was associated with the training of the physician performing the PUS, whereby PUS with the greatest accuracy were performed by gynecologists and maternal-fetal specialists against radiologists and general practitioners (p = 0.005). Patients without an accurate prenatal diagnosis had a greater risk of presenting comorbidities (relative risk [RR]: 1.65, p = < 0.001, 95% confidence interval [CI]: 1.299-2.106). CONCLUSIONS: In our setting, prenatal diagnosis of these malformations is directly determined by the training of the person performing the ultrasound.
INTRODUCCIÓN: Con poca frecuencia se ha reportado el entrenamiento necesario para realizar ultrasonido (US) obstétrico. El objetivo de este estudio fue determinar si el entrenamiento del ultrasonografista influye en la certeza del diagnóstico prenatal de algunas malformaciones congénitas. MÉTODOS: Se llevó a cabo una evaluación retrospectiva de los hallazgos ultrasonográficos prenatales de neonatos que tuvieron malformaciones congénitas en un hospital de referencia pediátrico de tercer nivel. Se realizó al ingreso de neonatos consecutivos en un hospital de referencia de tercer nivel. Se recolectaron y analizaron datos del embarazo y alumbramiento, así como los de los ultrasonidos prenatales (USP) correlacionando con el diagnóstico final. RESULTADOS: Se incluyeron 67 neonatos. Todos tuvieron USP con media de 4.6. Se realizó diagnóstico prenatal en 24 casos (35.8%). Se detectaron 13 malformaciones congénitas, predominando malformación anorectal gastrosquisis. La certeza del USP se asoció con el entrenamiento del individuo que realizó el US y la mayor certeza se encontró cuando lo realizaron ginecólogos y especialistas materno-fetales contra radiólogos y médicos generales (p = 0.005). Los pacientes sin diagnóstico prenatal certero tuvieron mayor riesgo de presentar comorbilidades (riesgo relativo [RR]: 1.65, p = < 0.001, 95% intervalo de confianza [CI]: 1.299-2.106). CONCLUSIONES: En nuestro medio, el diagnóstico prenatal de estas malformaciones está determinado directamente por el entrenamiento de la persona que realiza el ultrasonido.
Assuntos
Diagnóstico Pré-Natal , Cirurgiões , Gravidez , Feminino , Recém-Nascido , Criança , Humanos , Estudos Retrospectivos , Ultrassonografia Pré-NatalRESUMO
Abstract Background: The training needed for doing obstetric ultrasounds is rarely reported. The aim of this study was to determine whether the training of the ultrasonographer influences the prenatal diagnostic certainty of some congenital malformations. Methods: We conducted a retrospective evaluation of antepartum sonographic findings of newborn infants found ultimately to have a congenital anomaly in a tertiary level pediatric reference center. Data were collected on admission for consecutive patients at a tertiary-level pediatric reference center. The mother´s pregnancy and birth demographic variables and those of the prenatal ultrasound (PUS) were analyzed and correlated with the final diagnosis. Results: Sixty-seven neonates were included. All cases underwent PUS with a mean of 4.6. Prenatal diagnosis was established in 24 cases (35.8%). Thirteen surgical anomalies were detected, particularly anorectal malformation and gastroschisis. The accuracy of PUS was associated with the training of the physician performing the PUS, whereby PUS with the greatest accuracy were performed by gynecologists and maternal-fetal specialists against radiologists and general practitioners (p = 0.005). Patients without an accurate prenatal diagnosis had a greater risk of presenting comorbidities (relative risk [RR]: 1.65, p = < 0.001, 95% confidence interval [CI]: 1.299-2.106). Conclusions: In our setting, prenatal diagnosis of these malformations is directly determined by the training of the person performing the ultrasound.
Resumen Introducción: Con poca frecuencia se ha reportado el entrenamiento necesario para realizar ultrasonido (US) obstétrico. El objetivo de este estudio fue determinar si el entrenamiento del ultrasonografista influye en la certeza del diagnóstico prenatal de algunas malformaciones congénitas. Métodos: Se llevó a cabo una evaluación retrospectiva de los hallazgos ultrasonográficos prenatales de neonatos que tuvieron malformaciones congénitas en un hospital de referencia pediátrico de tercer nivel. Se realizó al ingreso de neonatos consecutivos en un hospital de referencia de tercer nivel. Se recolectaron y analizaron datos del embarazo y alumbramiento, así como los de los ultrasonidos prenatales (USP) correlacionando con el diagnóstico final. Resultados: Se incluyeron 67 neonatos. Todos tuvieron USP con media de 4.6. Se realizó diagnóstico prenatal en 24 casos (35.8%). Se detectaron 13 malformaciones congénitas, predominando malformación anorectal gastrosquisis. La certeza del USP se asoció con el entrenamiento del individuo que realizó el US y la mayor certeza se encontró cuando lo realizaron ginecólogos y especialistas materno-fetales contra radiólogos y médicos generales (p = 0.005). Los pacientes sin diagnóstico prenatal certero tuvieron mayor riesgo de presentar comorbilidades (riesgo relativo [RR]: 1.65, p = < 0.001, 95% intervalo de confianza [CI]: 1.299-2.106). Conclusiones: En nuestro medio, el diagnóstico prenatal de estas malformaciones está determinado directamente por el entrenamiento de la persona que realiza el ultrasonido.
RESUMO
BACKGROUND: Pediatric hepatic steatosis is a global public health concern, as an increasing number of children are affected by this condition. Liver biopsy is the gold standard diagnostic method; however, this procedure is invasive. Magnetic resonance imaging (MRI)-derived proton density fat fraction has been accepted as an alternative to biopsy. However, this method is limited by cost and availability. Ultrasound (US) attenuation imaging is an upcoming tool for noninvasive quantitative assessment of hepatic steatosis in children. A limited number of publications have focused on US attenuation imaging and the stages of hepatic steatosis in children. OBJECTIVE: To analyze the usefulness of ultrasound attenuation imaging for the diagnosis and quantification of hepatic steatosis in children. MATERIAL AND METHODS: Between July and November 2021, 174 patients were included and divided into two groups: group 1, patients with risk factors for steatosis (n = 147), and group 2, patients without risk factors for steatosis (n = 27). In all cases, age, sex, weight, body mass index (BMI), and BMI percentile were determined. B-mode US (two observers) and US attenuation imaging with attenuation coefficient acquisition (two independent sessions, two different observers) were performed in both groups. Steatosis was classified into four grades (0: absent, 1: mild, 2: moderate and 3: severe) using B-mode US. Attenuation coefficient acquisition was correlated with steatosis score according to Spearman's correlation. Attenuation coefficient acquisition measurements' interobserver agreement was assessed using intraclass correlation coefficients (ICC). RESULTS: All attenuation coefficient acquisition measurements were satisfactory without technical failures. The median values for group 1 for the first session were 0.64 (0.57-0.69) dB/cm/MHz and 0.64 (0.60-0.70) dB/cm/MHz for the second session. The median values for group 2 for the first session were 0.54 (0.51-0.56) dB/cm/MHz and 0.54 (0.51-0.56) dB/cm/MHz for the second. The average attenuation coefficient acquisition was 0.65 (0.59-0.69) dB/cm/MHz for group 1 and 0.54 (0.52-0.56) dB/cm/MHz for group 2. There was excellent interobserver agreement at 0.94 (95% CI 0.92-0.96). There was substantial agreement between both observers (κ = 0.77, with a P < 0.001). There was a positive correlation between ultrasound attenuation imaging and B-mode scores for both observers (r = 0.87, P < 0.001 for observer 1; r = 0.86, P < 0.001 for observer 2). Attenuation coefficient acquisition median values were significantly different for each steatosis grade (P < 0.001). In the assessment of steatosis by B-mode US, the agreement between the two observers was moderate (κ = 0.49 and κ = 0.55, respectively, with a P < 0.001 in both cases). CONCLUSION: US attenuation imaging is a promising tool for the diagnosis and follow-up of pediatric steatosis, which provides a more repeatable form of classification, especially at low levels of steatosis detectable in B-mode US.
Assuntos
Fígado Gorduroso , Hepatopatia Gordurosa não Alcoólica , Humanos , Criança , Fígado/diagnóstico por imagem , Fígado/patologia , Fígado Gorduroso/diagnóstico por imagem , Fígado Gorduroso/patologia , Ultrassonografia/métodos , Biópsia , Imageamento por Ressonância Magnética/métodos , Curva ROCRESUMO
INTRODUCTION: The proprioception plays an important role in the stability of the shoulder joint. However, clinical practice lacks reliable and user-friendly tools. OBJECTIVES: To evaluate the intra- and inter-rater reliability of the Laser-Pointer assisted Angle Reproduction Test (LP-ART), to analyze the difference in proprioception between the symptomatic and asymptomatic shoulders, and to investigate if there is a correlation between the LP-ART and the pain intensity assessed by 11-point Numerical Rating Pain Scale (NRPS) and the level of shoulder disability and pain assessed by the Disability Index and Shoulder Pain (SPADI - BR). METHODS: Fifty patients (age = 56.2 ± 10.4 years) performed the LP-ART at 90° of shoulder flexion. RESULTS: The intra and interrater reliability of the LP-ART measurements was moderate (Intraclass Correlation Coefficient2,3 = 0.41 to 0.65) for both shoulders, symptomatic and asymptomatic. There was no difference in the absolute angular deviation between shoulders (mean difference of 0.4°, P = .581). The absolute angular deviation was not significantly correlated with the pain intensity (rs = 0.007, P = .962) and the SPADI - BR (rs = 0.022, P = .881). CONCLUSION: The LP-ART measurement showed moderate reliability in participants with subacromial pain syndrome. The active joint position sense was not different between symptomatic and asymptomatic shoulders, and there was no correlation between proprioception and the pain intensity and shoulder pain and disability level.
Assuntos
Articulação do Ombro , Dor de Ombro , Humanos , Pessoa de Meia-Idade , Idoso , Dor de Ombro/diagnóstico , Reprodutibilidade dos Testes , Ombro , LasersRESUMO
Abstract Objective: To assess the reliability of phase-sensitive inversion recovery (PSIR) magnetic resonance imaging (MRI) and its accuracy for determining the topography of demyelinating cortical lesions in patients with multiple sclerosis (MS). Materials and Methods: This was a cross-sectional study conducted at a tertiary referral center for MS and other demyelinating disorders. We assessed the agreement among three raters for the detection and topographic classification of cortical lesions on fluid-attenuated inversion recovery (FLAIR) and PSIR sequences in patients with MS. Results: We recruited 71 patients with MS. The PSIR sequences detected 50% more lesions than did the FLAIR sequences. For detecting cortical lesions, the level of interrater agreement was satisfactory, with a mean free-response kappa (κFR) coefficient of 0.60, whereas the mean κFR for the topographic reclassification of the lesions was 0.57. On PSIR sequences, the raters reclassified 366 lesions (20% of the lesions detected on FLAIR sequences), with excellent interrater agreement. There was a significant correlation between the total number of lesions detected on PSIR sequences and the Expanded Disability Status Scale score (ρ = 0.35; p < 0.001). Conclusion: It seems that PSIR sequences perform better than do FLAIR sequences, with clinically satisfactory interrater agreement, for the detection and topographic classification of cortical lesions. In our sample of patients with MS, the PSIR MRI findings were significantly associated with the disability status, which could influence decisions regarding the treatment of such patients.
Resumo Objetivo: Avaliar a confiabilidade da sequência PSIR e sua precisão no diagnóstico topográfico de lesões corticais desmielinizantes em pacientes com esclerose múltipla (EM). Materiais e Métodos: Estudo transversal realizado em centro de referência terciário para EM e distúrbios desmielinizantes. Avaliamos a concordância entre três avaliadores na identificação e classificação topográfica de lesões corticais na ressonância magnética de pacientes com EM, utilizando as sequências FLAIR e PSIR. Resultados: Foram incluídos 71 pacientes com EM. Em PSIR detectou-se 1,5× mais lesões do que em FLAIR, com concordância satisfatória entre examinadores na identificação de lesões corticais, com coeficiente kappa de resposta livre (κFR) = 0,60, e na reclassificação topográfica das lesões, com κFR médio = 0,57. Os avaliadores reclassificaram 366 lesões em PSIR (20% das lesões detectadas em FLAIR), com excelente concordância. Houve correlação significativa do total de lesões detectadas em PSIR e o escore da escala de incapacidade EDSS (ρ = 0,35; p < 0,001). Conclusão: PSIR mostrou-se superior na detecção de lesões corticais e na classificação topográfica destas em comparação ao FLAIR, com concordâncias entre examinadores clinicamente satisfatórias. A associação significativa entre o número de lesões corticais em PSIR e o grau de incapacidade dos pacientes pode influenciar em decisões terapêuticas.
RESUMO
Objective: Evaluating intra- and inter-observer agreement of the Neer, AO, and AO/OTA proximal humerus fractures classification systems in adults. Methods: In total, 100 X-rays of patients with proximal humerus fractures were selected according to the inclusion and exclusion criteria established in this study. They were evaluated by four evaluators with different levels of expertise. The evaluation was performed at two distinct moments, with an interval of 21 days between each analysis. Images were randomized for the second evaluation by a researcher who did not participate in the image selection process. A Fleiss Kappa test was performed to evaluate intra- and inter-observer agreement. Results: We observed a substantial agreement with k = 0.669, k = 0.715, and k = 0.780 for the Neer, AO, and AO/OTA classification systems, respectively. Conclusion: In the second evaluation, intra-observer agreement improved. In the first evaluation, we obtained values of k = 0.724, k = 0.490, and k = 0.599 for the evaluation of the Neer, AO, and AO/OTA classifications. In the second evaluation, the values k = 0.759, k = 0.772, and k = 0.858. Therefore, the evaluations went from moderate to substantial for the AO classification and from moderate to practically perfect for the AO/OTA classification. The level of inter-observer agreement was substantial (0.61-0.80), with k = 0.669, k = 0.715, and k = 0.780 for the Neer, AO, and AO/OTA classifications, respectively. Level of Evidence III, Cross-Sectional Observational Study.
Objetivo: Avaliar a concordância intra e interobservadores entre os sistemas de classificação Neer, AO e AO/OTA nas fraturas do úmero proximal de indivíduos adultos. Métodos: Após a aplicação dos critérios de inclusão e exclusão determinados para a realização deste trabalho, foram selecionadas 100 radiografias de pacientes com fratura do úmero proximal. Estas foram submetidas à avaliação de quatro examinadores com níveis diferentes de expertise. A avaliação foi realizada em dois momentos distintos, com intervalo de 21 dias entre cada análise. As imagens foram randomizadas para a segunda avaliação por um pesquisador que não participou da seleção de imagens. Foi aplicado o teste kappa de Fleiss para verificar a concordância intra e interobservador. Resultados: Na primeira avaliação obtivemos valores de k = 0,724, k = 0,490 e k = 0,599, enquanto na segunda avaliação, os valores k = 0,759, k = 0,772 e k = 0,858 para as avaliações de Neer, AO e AO/OTA, respectivamente. Isso indica que a concordância intraobservador melhorou na segunda avaliação. Conclusões: As avaliações passaram de moderada para substancial para a classificação AO e de moderada para praticamente perfeita para o sistema AO/OTA. O nível de concordância interobservadores foram considerados substanciais (0,61-0,80) com k = 0,669, k = 0,715 e k = 0,780 para as classificações de Neer, AO e AO/OTA, respectivamente. Nível de Evidência III, Estudo Transversal Observacional.
RESUMO
Introducción: En 2013, desarrollamos una escala, para evaluar resúmenes de congresos de la Sociedad de Cirujanos de Chile (SOCICH). Objetivo: Determinar consistencia interna y confiabilidad interobservador de una escala para evaluar resúmenes de congresos. Material y Método: Estudio de confiabilidad. Doce cirujanos fueron capacitados de forma virtual durante 8 horas, para aplicar la escala. Una vez finalizado el entrenamiento, se les envió un cuestionario para evaluar contenidos de la capacitación, y varios resúmenescasos para ser evaluados con la escala antes señalada. Se aplicó estadística descriptiva, luego se estimó el grado de acuerdo entre observadores para cada ítem de la escala. Posteriormente, se evaluó el coeficiente de correlación (CCI), utilizando un modelo de dos factores mixtos en el que los efectos de los evaluadores son aleatorios y los ítems fijos; utilizando una definición de acuerdo absoluto. Además, se evaluó la consistencia interna de los ítems utilizando alfa de Cronbach, considerando intérvalos de confianza del 95% (IC 95%). Resultados: Luego de analizar las mediciones de los 9 ítems por los 12 observadores, se verificó que el CCI fue de 0,871; con un IC 95% de 0,700; 0,965. El valor de la consistencia interna fue de 0,7 considerando los 9 ítems, no se recomienda eliminar ningún ítem. Conclusión: La escala tiene buena confiabilidad interobservador y los ítems son consistentes entre sí; por lo que puede ser considerada como un instrumento confiable para la valoración de resúmenes de congresos.
Background: In 2013, we developed a scale to evaluate the abstracts of the congresses of the Society of Surgeons of Chile (SOCICH). Objective: To determine internal consistency and interobserver reliability of a scale to evaluate conference abstracts. Material and Methods: Reliability study. Twelve surgeons were trained virtually for 8 hours, to apply the scale. Once the training was finished, they were sent a questionnaire to evaluate the contents of the training, and several summaries-cases to be evaluated with the aforementioned scale. Descriptive statistics were applied, then the degree of agreement between observers was estimated for each item of the scale. Subsequently, intraclass correlation coefficient (ICC) was evaluated, using a mixed two-factor model where the effects of the evaluators are random and the items are fixed, using a definition of absolute agreement. In addition, the internal consistency of the items was evaluated using Cronbach's alpha, considering 95% confidence intervals (95% CI). Results: After analyzing the measurements of the 9 items by the 12 observers, it was verified that the ICC was 0.871; with a 95% CI of 0.700; 0.965. The internal consistency value was 0.7 considering the 9 items, it is not recommended to delete any item. Conclusions: The scale has good internal consistency and interobserver reliability. Therefore, it can be considered as reliable instrument to be used in the evaluation of abstracts for congresses.
Assuntos
Humanos , Masculino , Feminino , Reprodutibilidade dos Testes , Congressos como Assunto , Variações Dependentes do Observador , Distribuição por SexoRESUMO
BACKGROUND: The interpretation of the chest radiograph may vary because it depends on the reader and due to the non-specificity of findings in tuberculosis (TB). We aim to assess the reproducibility of a standardized chest radiograph reading protocol in contacts of patients with pulmonary TB under the 5 years of age. METHODS: Descriptive, cross-sectional study with children under the age of five, household contacts of patients with confirmed pulmonary TB from Medellín, Bello and Itagüí (Colombia) between Jan-01-2015 and May-31-2016. Standardized reading protocol: two radiologists, blinded independent reading, use of template (Dr. Andronikou design) in case of disagreement a third reading was performed. Kappa coefficient for intra and inter observer agreement, and prevalence ratio were estimated of sociodemographic characteristics, TB exposure and interpretation of chest X-ray. RESULTS: From 278 children, standardized reading found 255 (91.7%) normal X-rays, 10 (3.6%) consistent with TB, and 13 (4.7%) other alterations. Global agreement was 91.3% (Kappa = 0.51). Inter-observer agreement between readers 1-2 was 90.0% (Kappa = 0.59) and 1-3 93.2% (Kappa = 0.59). Intra-observer agreement for reader 1 was 95.5% (Kappa = 0.86), 2 84.0% (Kappa = 0.51), and 3 94.7% (Kappa = 0.68). Greater inter-observer disagreement was between readers 1-2 for soft tissue density suggestive of adenopathy (4.6%), airspace opacification (1.17%) and pleural effusion (0.58%); between readers 1-3 for soft tissue density suggestive of adenopathy (4.2%), opacification of airspace (2.5%) and cavities (0.8%). CONCLUSIONS: Chest radiographs are an affordable tool that contributes to the diagnosis of TB, so having a standardized reading protocol showed good agreement and improves the reproducibility of radiograph interpretation.
Assuntos
Linfadenopatia , Tuberculose Pulmonar , Criança , Estudos Transversais , Humanos , Variações Dependentes do Observador , Radiografia Torácica/métodos , Reprodutibilidade dos Testes , Tuberculose Pulmonar/diagnóstico por imagem , Raios XRESUMO
Objective: To assess interobserver agreement among radiologists regarding the current Fleischner Society diagnostic criteria for usual interstitial pneumonia (UIP) patterns on computed tomography (CT). Materials and Methods: Using the Fleischner Society criteria for UIP CT patterns, five raters, working independently, categorized the high-resolution CT (HRCT) scans of 44 patients with interstitial lung disease who underwent lung biopsy. The raters also evaluated the presence, extent, and distribution of the most relevant imaging findings, as well as indicating their level of confidence in the most likely diagnosis and in up to three diagnostic hypotheses. Results: There was moderate to substantial interobserver agreement regarding the UIP patterns on HRCT-kappa statistic (κ) = 0.59-0.61. Interobserver agreement for the binary scores was substantial (κ = 0.77-0.79), whereas that for the presence of honeycombing was almost perfect (κ = 0.81-0.96). There was agreement regarding at least one of the three diagnostic hypotheses in only 36.4% of the cases. For the level of confidence in the most likely diagnosis, there was only slight to fair agreement (κ = 0.19-0.21). Conclusion: Interobserver agreement regarding the current Fleischner Society CT criteria for UIP was moderate to substantial among raters with varying levels of experience. There was only slight to fair agreement regarding the diagnostic hypotheses and for the level of confidence in the most likely diagnosis.
Objetivo: Avaliar a concordância interobservador entre radiologistas para os critérios atuais da Fleischner Society para categorias diagnósticas de pneumonia intersticial usual (PIU) em tomografia computadorizada (TC). Materiais e Métodos: Cinco observadores categorizaram independentemente as imagens de TC de 44 pacientes com doença pulmonar intersticial que foram submetidos a biópsia pulmonar empregando as últimas categorias de diagnóstico da Sociedade Fleischner para UIP. Também foram avaliadas presença, extensão e distribuição dos achados de imagem mais relevantes, bem como a confiança no diagnóstico mais provável e em até três hipóteses diagnósticas. Resultados: Houve concordância moderada a alta para as categorias diagnósticas entre os observadores (κ = 0,59-0,61). A concordância interobservador para a pontuação binária foi alta (κ = 0,77-0,79), enquanto para a presença de faveolamento foi considerada de alta a muito alta (κ = 0,81-0,96). Houve concordância em uma das três hipóteses diagnósticas em apenas 36,4% dos casos. Baixa concordância foi encontrada para o diagnóstico mais provável (κ = 0,19-0,21). Conclusão: A concordância entre observadores para os critérios atuais de TC da Fleischner Society para UIP foi moderada a alta entre observadores com diferentes níveis de experiência. Houve baixa concordância nas hipóteses diagnósticas e quanto ao grau de confiança no diagnóstico primário.
RESUMO
Abstract Objective: To assess interobserver agreement among radiologists regarding the current Fleischner Society diagnostic criteria for usual interstitial pneumonia (UIP) patterns on computed tomography (CT). Materials and Methods: Using the Fleischner Society criteria for UIP CT patterns, five raters, working independently, categorized the high-resolution CT (HRCT) scans of 44 patients with interstitial lung disease who underwent lung biopsy. The raters also evaluated the presence, extent, and distribution of the most relevant imaging findings, as well as indicating their level of confidence in the most likely diagnosis and in up to three diagnostic hypotheses. Results: There was moderate to substantial interobserver agreement regarding the UIP patterns on HRCT—kappa statistic (κ) = 0.59-0.61. Interobserver agreement for the binary scores was substantial (κ = 0.77-0.79), whereas that for the presence of honeycombing was almost perfect (κ = 0.81-0.96). There was agreement regarding at least one of the three diagnostic hypotheses in only 36.4% of the cases. For the level of confidence in the most likely diagnosis, there was only slight to fair agreement (κ = 0.19-0.21). Conclusion: Interobserver agreement regarding the current Fleischner Society CT criteria for UIP was moderate to substantial among raters with varying levels of experience. There was only slight to fair agreement regarding the diagnostic hypotheses and for the level of confidence in the most likely diagnosis.
Resumo Objetivo: Avaliar a concordância interobservador entre radiologistas para os critérios atuais da Fleischner Society para categorias diagnósticas de pneumonia intersticial usual (PIU) em tomografia computadorizada (TC). Materiais e Métodos: Cinco observadores categorizaram independentemente as imagens de TC de 44 pacientes com doença pulmonar intersticial que foram submetidos a biópsia pulmonar empregando as últimas categorias de diagnóstico da Sociedade Fleischner para UIP. Também foram avaliadas presença, extensão e distribuição dos achados de imagem mais relevantes, bem como a confiança no diagnóstico mais provável e em até três hipóteses diagnósticas. Resultados: Houve concordância moderada a alta para as categorias diagnósticas entre os observadores (κ = 0,59-0,61). A concordância interobservador para a pontuação binária foi alta (κ = 0,77-0,79), enquanto para a presença de faveolamento foi considerada de alta a muito alta (κ = 0,81-0,96). Houve concordância em uma das três hipóteses diagnósticas em apenas 36,4% dos casos. Baixa concordância foi encontrada para o diagnóstico mais provável (κ = 0,19-0,21). Conclusão: A concordância entre observadores para os critérios atuais de TC da Fleischner Society para UIP foi moderada a alta entre observadores com diferentes níveis de experiência. Houve baixa concordância nas hipóteses diagnósticas e quanto ao grau de confiança no diagnóstico primário.
RESUMO
Abstract Objective: To assess intra- and interobserver agreement among non-expert pathologists in identifying features of the eosinophilic esophagitis histologic scoring system (EoEHSS) in pediatric patients. Patients and methods: The authors used 50 slides from patients (aged 1-15 years; 72% male) with EoE. EoEHSS evaluates eosinophilic inflammation and other features including epithelial basal zone hyperplasia, eosinophilic abscesses, eosinophil surface layering, dilated intercellular spaces, surface epithelial alteration, dyskeratotic epithelial cells, and lamina propria fibrosis. Grade and stage of abnormalities are scored using a 4-point scale (0 normal; 3 maximum change). Four pathologists determined EoEHSS findings on two occasions. Intra- and interobserver agreement was assessed using Kappa (κ) statistics and intra-class correlation coefficients. Results: Intra- and interobserver agreement for the identification of eosinophil counts ≥ 15/high power field (HPF) was excellent, however varied when assessing additional features of the EoEHSS. For the more experienced pathologist, agreement for most EoEHSS items and the composite scores was substantial to excellent. For the less experienced pathologists, intraobserver agreement ranged from absent to substantial for individual features and ranged from moderate to substantial for the composite scores. Conclusion: Most items of the EoEHSS had substantial to excellent reliability when assessed by a pathologist experienced in the diagnosis of EoE but presented lower repeatability among less experienced pathologists. These findings suggest that specific training of pathologists is required for the identification of EoEHSS characteristics beyond eosinophil count, as these features are considered useful in the evaluation of response to treatment and correlation with clinical manifestations and endoscopic findings.
Assuntos
Humanos , Masculino , Feminino , Lactente , Pré-Escolar , Criança , Adolescente , Adulto , Esofagite Eosinofílica/diagnóstico , Esofagite Eosinofílica/tratamento farmacológico , Variações Dependentes do Observador , Reprodutibilidade dos Testes , Eosinófilos/patologiaRESUMO
OBJECTIVE: To assess intra- and interobserver agreement among non-expert pathologists in identifying features of the eosinophilic esophagitis histologic scoring system (EoEHSS) in pediatric patients. PATIENTS AND METHODS: The authors used 50 slides from patients (aged 1-15 years; 72% male) with EoE. EoEHSS evaluates eosinophilic inflammation and other features including epithelial basal zone hyperplasia, eosinophilic abscesses, eosinophil surface layering, dilated intercellular spaces, surface epithelial alteration, dyskeratotic epithelial cells, and lamina propria fibrosis. Grade and stage of abnormalities are scored using a 4-point scale (0 normal; 3 maximum change). Four pathologists determined EoEHSS findings on two occasions. Intra- and interobserver agreement was assessed using Kappa (κ) statistics and intra-class correlation coefficients. RESULTS: Intra- and interobserver agreement for the identification of eosinophil counts ≥ 15/high power field (HPF) was excellent, however varied when assessing additional features of the EoEHSS. For the more experienced pathologist, agreement for most EoEHSS items and the composite scores was substantial to excellent. For the less experienced pathologists, intraobserver agreement ranged from absent to substantial for individual features and ranged from moderate to substantial for the composite scores. CONCLUSION: Most items of the EoEHSS had substantial to excellent reliability when assessed by a pathologist experienced in the diagnosis of EoE but presented lower repeatability among less experienced pathologists. These findings suggest that specific training of pathologists is required for the identification of EoEHSS characteristics beyond eosinophil count, as these features are considered useful in the evaluation of response to treatment and correlation with clinical manifestations and endoscopic findings.
Assuntos
Esofagite Eosinofílica , Adolescente , Criança , Pré-Escolar , Esofagite Eosinofílica/diagnóstico , Esofagite Eosinofílica/tratamento farmacológico , Eosinófilos/patologia , Feminino , Humanos , Lactente , Masculino , Variações Dependentes do Observador , Reprodutibilidade dos TestesRESUMO
OBJECTIVE: to provide test-retest reliability for the TGlittre-P in children and adolescents with cystic fibrosis (CFG) and healthy controls (HCG), to establish the minimal detectable change for time in TGlittre-P and comparing the performance in the TGlittre-P test between these populations. METHOD: A cross-sectional study evaluated 36 children and adolescents aged 6 to 13. Anthropometric and spirometric evaluation was performed, as well as, on the same day, two TGlittre-P tests with a 30-minute interval between them. RESULTS: TGlittre-P time test-retest reliability was excellent for both groups (CFG: intraclass correlation coefficient [ICC] = 0.849, p < 0.001 and HCG: ICC = 0.913, p < 0.001). As concerning absolute reliability, the time spent presented a small variability with a standard error of measurement of 8.4 s (s) to CFG and 5.3 s to HCG. The minimal detectable change at 95% confidence level (MDC95) was 23.2 s and 14.6 s, respectively. There was no difference between the groups regarding performance in the TGlittre-P test (CFG 179.1 s ± 25.7 s vs. HCG 174.7 s ± 22.3 s) p = 0.589. CONCLUSION: The TGlittre-P is a reliable tool in children and adolescents with CF and healthy controls. The TGlittre-P appears not to be sensitive enough to discriminate a group of children and adolescents with mild cystic fibrosis from healthy counterparts.IMPLICATIONS FOR REHABILITATIONTGlittre-P is a multitasking test that has been used to assess the functional capacity of children and adolescents with chronic diseases.TGlittre-P has excellent reliability in children and adolescents with and without CF.TGlittre-P differences time greater than 12% could indicate changes in the functional capacity of children and adolescents with CF.Other functional capacity tests may be preferred to detect continuous increases in functional capacity through rehabilitation or training, whether children and adolescents obtain performance values close to 100% of predicted.
Assuntos
Fibrose Cística , Adolescente , Criança , Estudos Transversais , Fibrose Cística/diagnóstico , Teste de Esforço , Humanos , Reprodutibilidade dos Testes , EspirometriaRESUMO
ABSTRACT Objective: Evaluating intra- and inter-observer agreement of the Neer, AO, and AO/OTA proximal humerus fractures classification systems in adults. Methods: In total, 100 X-rays of patients with proximal humerus fractures were selected according to the inclusion and exclusion criteria established in this study. They were evaluated by four evaluators with different levels of expertise. The evaluation was performed at two distinct moments, with an interval of 21 days between each analysis. Images were randomized for the second evaluation by a researcher who did not participate in the image selection process. A Fleiss Kappa test was performed to evaluate intra- and inter-observer agreement. Results: We observed a substantial agreement with k = 0.669, k = 0.715, and k = 0.780 for the Neer, AO, and AO/OTA classification systems, respectively. Conclusion: In the second evaluation, intra-observer agreement improved. In the first evaluation, we obtained values of k = 0.724, k = 0.490, and k = 0.599 for the evaluation of the Neer, AO, and AO/OTA classifications. In the second evaluation, the values k = 0.759, k = 0.772, and k = 0.858. Therefore, the evaluations went from moderate to substantial for the AO classification and from moderate to practically perfect for the AO/OTA classification. The level of inter-observer agreement was substantial (0.61-0.80), with k = 0.669, k = 0.715, and k = 0.780 for the Neer, AO, and AO/OTA classifications, respectively. Level of Evidence III, Cross-Sectional Observational Study.
RESUMO Objetivo: Avaliar a concordância intra e interobservadores entre os sistemas de classificação Neer, AO e AO/OTA nas fraturas do úmero proximal de indivíduos adultos. Métodos: Após a aplicação dos critérios de inclusão e exclusão determinados para a realização deste trabalho, foram selecionadas 100 radiografias de pacientes com fratura do úmero proximal. Estas foram submetidas à avaliação de quatro examinadores com níveis diferentes de expertise. A avaliação foi realizada em dois momentos distintos, com intervalo de 21 dias entre cada análise. As imagens foram randomizadas para a segunda avaliação por um pesquisador que não participou da seleção de imagens. Foi aplicado o teste kappa de Fleiss para verificar a concordância intra e interobservador. Resultados: Na primeira avaliação obtivemos valores de k = 0,724, k = 0,490 e k = 0,599, enquanto na segunda avaliação, os valores k = 0,759, k = 0,772 e k = 0,858 para as avaliações de Neer, AO e AO/OTA, respectivamente. Isso indica que a concordância intraobservador melhorou na segunda avaliação. Conclusões: As avaliações passaram de moderada para substancial para a classificação AO e de moderada para praticamente perfeita para o sistema AO/OTA. O nível de concordância interobservadores foram considerados substanciais (0,61-0,80) com k = 0,669, k = 0,715 e k = 0,780 para as classificações de Neer, AO e AO/OTA, respectivamente. Nível de Evidência III, Estudo Transversal Observacional.