Improvement in Strain Concordance between Two Major Vendors after the Strain Standardization Initiative




Background


Disagreement of strain measurements among different vendors has provided an obstacle to the clinical use of strain. A joint standardization task force between professional societies and industry was initiated to reduce intervendor variability of strain. Although feedback from this process has been used in software upgrades, little is known about the effects of efforts to improve conformity. The aim of this study was to assess whether intervendor agreement for global longitudinal strain (GLS) has improved after standardization initiatives.


Methods


Eighty-two subjects (mean age, 52 ± 21 years; 55% men) prospectively underwent two sequential examinations using two most common ultrasound systems (Vivid E9 and iE33). GLS was calculated using proprietary software (EchoPAC-PC BT12 [E12] and BT13 [E13] vs QLAB version 8.0 [Q8], QLAB version 9.0 [Q9], and QLAB version 10.0 [Q10]). Agreements in GLS were evaluated with Bland-Altman plots. Coefficients of variation (CVs) were compared using the Friedman test and compared with CVs of left ventricular volumes and ejection fraction (LVEF).


Results


Median GLS using E12 was −19.2% (interquartile range [IQR], −15.2% to −23.2%), compared with −19.3% (IQR, −14.9% to −23.7%) for E13, −15.7% (IQR, −11.4% to −20%) for Q8, −19% (IQR, −15.7% to −22.3%) for Q9, and −18.7% (IQR, −15.7% to −21.7%) for Q10. The CVs of prestandardization GLS (12 ± 8% [E12/Q8] and 14 ± 8 [E13/Q8]) were significantly larger than that of LVEF (5 ± 5) ( P < .001). Since standardization, the CVs of GLS have shown improvement (6 ± 4 [E12/Q9], 7 ± 4 [E12/Q10], 6 ± 4 [E13/Q9], and 7 ± 4 [E13/Q10]) and are similar to those of LVEF.


Conclusions


Subsequent to the joint standardization task force, there has been improvement in between-vendor concordance in GLS between two leading ultrasound manufactures, the variability of which is now analogous to that of LVEF. The removal of concerns about measurement variability should allow wider use of GLS.


Measurement of strain using two dimensional speckle-tracking echocardiography (STE) is a relatively new test for the assessment of cardiac function, especially for the quantitative evaluation of global and regional myocardial function. STE has largely replaced tissue Doppler–derived strain because of reductions of angle and operator dependence, and STE has become an integrated application in most commercially available ultrasound systems. However, variations in proprietary software among vendors, causing poor intervendor agreement in measurements, have become a significant limitation to the implementation of STE. This limitation has raised concerns with regard to STE’s becoming an acceptable mainstream methodology in daily clinical application, especially in laboratories with echocardiographic instruments from multiple vendors.


To achieve a consensus on methodology for the quantitative evaluation of cardiac mechanics, the European Association of Cardiovascular Imaging and the American Society of Echocardiography (ASE) invited vendors to participate in a concerted effort to reduce intervendor variability of strain measurement. Guidance has been provided on the steps necessary to reduce variability, and this information has been used in software upgrades, but little is known about the effects of this process. Thus, the aim of this study was to elucidate whether there has been an improvement in intervendor global strain (GLS) agreement using STE since the implementation of standardization initiatives. Despite variability of left ventricular (LV) volume and LV ejection fraction (LVEF) measurements, these are used clinically for serial evaluation, so variations in these parameters were used as a frame of reference to compare variations in GLS.


Methods


Study Population


Adult participants who underwent clinically indicated echocardiography were prospectively recruited in the outpatient clinics at Royal Hobart Hospital, Australia, and Takasaki General Medical Center, Japan, from June 2013 to November 2013. A group of adult healthy volunteers was also included in the study. We included patients >18 years of age, without atrial fibrillation or flutter. All subjects underwent two transthoracic echocardiographic studies using two ultrasound systems by the same experienced sonographers. The study protocol was approved by the relevant institutional review boards.


Standard Echocardiography


Transthoracic echocardiography was performed using commercially available ultrasound systems from two vendors (S5-1 probe, iE33 [Philips Medical Systems, Andover, MA]; M5S probe, Vivid E9, and M4S probe, Vivid 7 Dimension [GE Medical Systems, Milwaukee, WI]). Each participant first underwent extensive standard assessments of cardiac anatomy and cardiac function according to clinical protocol with one ultrasound system. This was repeated using the other ultrasound system. Acquisition was obtained at the highest possible frame rate, with optimization of image depth and sector width. Multiple consecutive cardiac cycles of the three standard apical views (apical four-chamber, apical two-chamber, and apical long-axis views) were acquired and digitally stored as raw data for offline analysis. LV end-diastolic volume (LVEDV) and LV end-systolic volume (LVESV) were determined using the biplane method of disks. The baseline assessment included standard two-dimensional, M-mode, color Doppler, pulsed-wave and continuous-wave Doppler, and Doppler tissue imaging modalities using standard parasternal, apical, subcostal, and suprasternal windows.


Measurement of Myocardial Strain


Measurement of GLS has been previously described. Briefly, two-dimensional images from three apical views (apical four-chamber, apical two-chamber, and apical long-axis views) were used. Readings were obtained by averaging six segments in each view. GLS was determined from the average of all 18 segments. Figure 1 is a schematic description of image acquisition, strain analyses, and comparisons using the respective three generations of proprietary software packages from the two vendors. In the images acquired with the Vivid E9 ( Figure 1 A), speckle-tracking analyses were performed using proprietary software (EchoPAC-PC BT12 [E12], released March 2012, and BT13 [E13], released May 2013; GE Medical Systems). After tracing of the endocardial border, the region of interest was adjusted to include the entire myocardial thickness and avoid the pericardium. The software then selected stable speckles within the myocardium and performed speckle-tracking on a frame-to-frame basis throughout the entire cardiac cycle. The adequacy of tracking was verified manually. The cardiac cycle with the best tracking and visually most credible strain curves was selected for analysis. In segments with poor tracking, the border was readjusted manually until optimal tracking was achieved. After adjustment, segments with consistently poor tracking were excluded. Final GLS was calculated as the averaged value of GLS values from each apical view, using peak negative longitudinal strain during the cardiac cycle.




Figure 1


Schematic description of ultrasound systems, software versions and combinations of comparisons. (A) In the between-vendor comparisons, echocardiographic images were acquired with two ultrasound systems for each patient. The images were analyzed with vendor-specific software versions (E12 and E13 for Vivid E9; Q8, Q9, and Q10 for iE33). (B) A separate group of normal volunteers was imaged using GE Vivid 7 and analyzed and compared using E11 and E12. Software release dates were shown in the brackets.


Because EchoPAC BT11 (E11; released May 2010) cannot analyze the images acquired using the current Vivid E9 system, because of a lack of compatibility with older versions of software, a separate group of normal volunteers were imaged using the Vivid 7 Dimension (mean frame rate, 72 ± 6 Hz) only for the comparison of E11 and E12 ( Figure 1 B). All offline measurements with E11, E12, and E13 were performed by a single observer (H.Y.).


Images acquired on the iE33 were analyzed using proprietary software from before the standardization initiative (QLAB version 8.0 [Q8], released April 2010; Philips Medical Systems) and after the standardization initiative (QLAB version 9.0 [Q9], released February 2012, and version 10.0 [Q10], released August 2013; Philips Medical Systems). For each view, endocardial and epicardial borders were manually traced in the end-systolic frame using three software versions with the same image view. All three software versions were used to trace the borders automatically frame by frame throughout the same cardiac cycle. Visual inspection of the tracking was carefully performed, and if automated tracking was unsatisfactory, manual point-to-point and frame-to-frame adjustments were carefully made until satisfactory tracking was achieved. Electrocardiographic tracing was used to estimate the timing of end-systole and early diastole. All offline measurement with QLAB was performed by the same observer (H.Y.). The analyses using QLAB were performed 4 weeks after analysis with EchoPAC, and the operator was blinded to the previous measurements.


Statistical Analysis


Continuous variables are presented as mean ± SD. Categorical variables are expressed as percentages. Normality was evaluated using the Shapiro-Wilk W test. Because strain was not normally distributed, Spearman’s ρ was used to express the correlation coefficient. The agreement between two vendors and with two software versions was studied using Bland-Altman analysis to quantify a systemic difference (bias) between two techniques and the spread of differences of mean bias (limits of agreement [LOA]). Intraclass correlation coefficients (ICCs) were also used for the assessment of agreement. Coefficients of variation (CV) were calculated for all measurements. The difference of each set of paired variables was also assessed by percentage error. Percentage error was derived by dividing the LOA by the mean. The CVs of different systems among four series of parameters (LVEDV, LVESV, LVEF, and GLS) were compared using the Friedman test, followed by pairwise tests with the Bonferroni correction.


To assess intraobserver variability, the same operator performed measurements after an interval of ≥2 weeks in 20 randomly selected subjects, blinded to previous measurements. For the assessment of interobserver variability, a second operator (K.N.) measured the same group of subjects in a blinded fashion. Intraobserver and interobserver variability values were summarized as the absolute difference, CV, ICC, and Bland-Altman statistics and by calculating the CVs.


Statistical analysis was performed using IBM SPSS Statistics version 20 (IBM Corporation, Armonk, NY) and MedCalc version 12.0.1.0 (MedCalc Software, Mariakerke, Belgium). Statistical significance was accepted at a P value of <.05.




Results


Characteristics of the Study Population


Two patients were excluded at the time of image acquisition because of inadequate and suboptimal image quality; a total of 82 subjects (45 men [55%]) were included in the main comparison between the vendors ( Figure 1 A). Their baseline demographics, conventional echocardiographic parameters, and indications for echocardiography are summarized in Table 1 . Among them, 16 (20%) were healthy volunteers who were free of clinical symptoms and signs. The study population had wide ranges of age (18–95 years; mean, 52 ± 20.3 years) and body mass index (16–43 kg/m 2 ; mean, 25 ± 6 kg/m 2 ). Their mean LVEF was 58 ± 12% (range, 14%–72%). Frame rates of images acquired with the iE33 (mean, 60 ± 6 Hz) were significantly higher ( P < .0001) than those acquired with the Vivid E9 (mean, 56 ± 6 Hz).



Table 1

General characteristics and conventional echocardiographic parameters ( n = 82)




































































































Clinical characteristic Value
Age (y) 52 ± 20.6
Men 45 (55%)
Weight (kg) 71 ± 19
Height (cm) 166 ± 11
Body mass index (kg/m 2 ) 25 ± 6
Body surface area (m 2 ) 1.78 ± 0.25
Systolic blood pressure (mm Hg) 128 ± 21
Diastolic blood pressure (mm Hg) 72 ± 15
Heart rate (beats/min) 67 ± 11
Echocardiographic data
LVEDV (mL/m 2 ) 54 ± 23
LVESV (mL/m 2 ) 25 ± 20
LVSV (mL/m 2 ) 29 ± 7
LVEF (%) 58 ± 12
Left atrial volume (mL/m 2 ) 30 ± 11
Deceleration time (msec) 204 ± 58
E/A ratio 1.24 ± 0.64
e′ 0.08 ± 0.04
E/e′ ratio 10.3 ± 6.6
Indications for echocardiography
Ischemic heart disease 11 (13.4%)
Arrhythmia 10 (12.2%)
Heart murmur 7 (8.5%)
Shortness of breath 9 (11.0%)
Heart failure 8 (9.8%)
Cardiomyopathy 7 (8.5%)
Pericarditis 2 (2.4%)
Post chemotherapy 3 (3.7%)
Systemic disease 4 (4.9%)
Miscellaneous 5 (6.1%)
Normal volunteer 16 (19.5%)

Data are expressed as mean ± SD or as number (percentage). LVEDV , Left ventricular end-diastolic volume; LVEF , left ventricular ejection fraction; LVESV , left ventricular end-systolic volume.


Measurement of GLS Using Various Software Versions


The measurement of GLS was feasible with each software version in all cases. The median value of GLS using E12 was −19.2% (interquartile range [IQR], −15.2% to −23.2%), compared with −19.3% (IQR, −14.9% to −23.7%) for E13, −15.7% (IQR, −11.4% to −20%) for Q8, −19% (IQR, −15.7% to −22.3%) for Q9, and −18.7% (IQR, −15.7% to −21.7%) for Q10. These are summarized in Supplemental Table 1 . Overall, GLS showed a good correlation between EchoPAC versions (E12 and E13) and QLAB versions (Q8, Q9, and Q10). However, the bias and LOA were greater between E12 and E13 versus Q8 than between E12 and E13 versus Q9 and Q10 ( Table 2 ; Supplemental Figures 1–4 ).



Table 2

Analyses of variation between variables













































































































Variable ICC (95% CI) ρ Bias LOA Percentage error (mean ± SD)
LVEDV (GE/Philips) 0.96 (0.93 to 0.97) 0.79 −6.4 33.8 0.39 ± 0.12
LVESV (GE/Philips) 0.98 (0.96 to 0.98) 0.77 −2.9 22.8 0.68 ± 0.29
LVEF (GE/Philips) 0.95 (0.93 to 0.97) 0.79 −0.1 9.8 0.18 ± 0.08
GLS (E12_Q8) 0.79 (0.67 to 0.86) 0.81 −2.7 4.5 −0.28 ± 0.14
GLS (E13_Q8) 0.78 (0.65 to 0.86) 0.81 −2.8 4.5 −0.29 ± 0.11
GLS (E12_Q9) 0.91 (0.86 to 0.94) 0.81 −0.2 3.5 −0.21 ± 0.08
GLS (E13_Q9) 0.95 (0.92 to 0.97) 0.78 −0.2 3.7 −0.22 ± 0.09
GLS (E12_Q10) 0.95 (0.92 to 0.97) 0.76 −0.5 3.5 −0.22 ± 0.09
GLS (E13_Q10) 0.94 (0.91 to 0.96) 0.74 −0.6 3.7 −0.23 ± 0.09
GLS (E13_E12) 0.99 (0.98 to 0.99) 0.94 0.1 2.0 −0.12 ± 0.05
GLS (Q9_Q10) 0.97 (0.96 to 0.98) 0.90 0.4 2.5 −0.15 ± 0.06
GLS (Q9_Q8) 0.75 (0.61 to 0.84) 0.68 −2.6 5.0 −0.33 ± 0.12
GLS (Q10_Q8) 0.80 (0.68 to 0.87) 0.67 2.2 4.9 −0.32 ± 0.12
GLS (E11_E12) 0.94 (0.91 to 0.96) 0.89 0.1 2.0 −0.098 ± 0.01

CI , Confidence interval; GLS , global longitudinal strain; LVEDV , left ventricular end-diastolic volume; LVEF , left ventricular ejection fraction; LVESV , left ventricular end-systolic volume.

A separate group of normal volunteers ( n = 76) comparing E11 and E12 only.



Results of GLS between Q9 (after standardization) and Q8 (before standardization) showed an association but a significant difference (ρ = 0.68, bias = −2.6%, LOA = 5.0%). A similar result was observed comparing Q10 (after standardization) with Q8 (ρ = 0.67, bias = 2.2%, LOA = 4.9%). A good correlation was observed comparing E12 with E13 (ρ = 0.94, bias = 0.1%, LOA = 2.0%) and Q9 with Q10 (ρ = 0.90, bias = 0.4%, LOA = 2.5%); both were developed after the standardization task force ( Table 2 ; Supplemental Figures 1–4 ).


GLS was also analyzed and compared using E11 and E12 in a separate group of normal volunteers ( n = 76; Figure 1 B). The comparability of E11 and E12 showed a good correlation (ρ = 0.883, bias = 0.1%, LOA = 2.0%; Figure 2 ). The CV between E11 and E12 was −0.03 ± 0.02, and the ICC was 0.94 (0.91–0.96).




Figure 2


Scatterplot ( left ) and Bland-Altman plot ( right ): comparison between E11 and E12.


Comparison between GLS and LV Volumes


Using the Vivid E9, the mean LVEDV (95 ± 41 mL), LVESV (43 ± 35 mL), and LVEF (58.0 ± 11.8%) were largely similar compared with those obtained using the iE33 (102 ± 48 mL, 46 ± 41 mL, and 58.1 ± 11.9%, respectively; Supplemental Table 1 ). The CV and percentage error of LVEDV, LVESV, and LVEF between the two machines are summarized in Table 2 . Using the Friedman test with the Bonferroni correction, the CV of LVEF ws significantly different from those of LVEDV ( P < .001) and LVESV ( P < .001). The CVs of LVEDV and LVESV showed no significant difference.


The CVs of prestandardization GLS (0.12 ± 0.08 [E12 vs Q8] and 0.14 ± 0.08 [E13 vs Q8]) were significantly larger than that of LVEF (0.05 ± 0.05) ( P < .001). Since standardization, the CVs of GLS have shown improvement (0.06 ± 0.04 [E12 vs Q9], 0.07 ± 0.04 [E12 vs Q10], 0.06 ± 0.04 [E13 vs Q9], and 0.07 ± 0.04 [E13 vs Q10]). There was no significant difference in the CV of differences in LVEF and that of GLS using poststandardization software (E12 vs Q9, P = .99); E13 vs Q9, P = .81; E12 vs Q10, P = .22; and E13 vs Q10, P = .091; Figure 3 a, Supplemental Table 2 ). Figure 3 b describes comparison of CVs of within-vendor software versions (Q8 vs Q9, Q8 vs Q10, Q9 vs Q10, and E12 vs E13) with the CV of LVEF.




Figure 3


Comparison of CVs. (A) Comparisons among the CVs from LVEF and between-vendor software comparisons of GLS (Q8 vs E12, Q8 vs E13, Q9 vs E12, Q9 vs E13, Q10 vs E12, and Q10 vs E13). Software before standardization: Q8; software after standardization: Q9, Q10, E12, and E13. (B) Comparisons among the CVs of LVEF and within-vendor software comparisons of GLS (Q8 vs Q9, Q8 vs Q10, Q9 vs Q10, and E12 vs E13).


Intraobserver and Interobserver Variability


The intraobserver and interobserver variability of GLS from E12, E13, Q8, Q9, and Q10 was acceptable ( Table 3 ). There were no significant differences among the CVs in intraobserver ( P = .96) and interobserver ( P = .11) observer variability.



Table 3

Reproducibility analysis



































































Version Intraobserver reproducibility Interobserver reproducibility
Absolute mean difference (%) CV ± SD ICC (95% CI) Bias (LOA) (%) Absolute mean difference (%) CV ± SD ICC (95% CI) Bias (LOA) (%)
E12 0.08 ± 1.03 −0.03 ± 0.03 0.98 (0.97 to 0.99) 0.10 (2.0) 0.41 ± 1.36 −0.04 ± 0.03 0.98 (0.94 to 0.99) 0.4 (2.6)
E13 −0.09 ± 0.71 −0.02 ± 0.02 0.994 (0.986 to 0.997) −0.09 (1.4) 0.40 ± 0.82 −0.04 ± 0.02 0.995 (0.988 to 0.998) −0.3 (1.1)
Q8 −0.19 ± 0.71 −0.03 ± 0.02 0.99 (0.98 to 0.99) −0.20 (1.3) −0.52 ± 0.7 −0.04 ± 0.03 0.99 (0.94 to 0.99) −0.5 (1.3)
Q9 0.37 ± 1.16 −0.04 ± 0.04 0.98 (0.96 to 0.99) 0.40 (2.3) −0.27 ± 2.27 −0.08 ± 0.07 0.94 (0.85 to 0.98) −0.3 (4.4)
Q10 0.20 ± 0.72 −0.03 ± 0.02 0.99 (0.98 to 0.99) 0.20 (2.1) 0.55 ± 1.30 −0.05 ± 0.05 0.96 (0.91 to 0.99) −0.7 (2.9)

Only gold members can continue reading. Log In or Register to continue

Stay updated, free articles. Join our Telegram channel

Apr 21, 2018 | Posted by in CARDIOLOGY | Comments Off on Improvement in Strain Concordance between Two Major Vendors after the Strain Standardization Initiative

Full access? Get Clinical Tree

Get Clinical Tree app for offline access