RESUMEN
Lung cancer is one of the most commonly occurring malignant tumours worldwide. Although some reference methods such as X-ray, computed tomography or bronchoscope are widely used for clinical diagnosis of lung cancer, there is still a need to develop new methods for early detection of lung cancer. Especially needed are approaches that might be non-invasive and fast with high analytical precision and statistically reliable. Herein, we developed a swab "dip" test in saliva whereby swabs were analysed using attenuated total reflection Fourier-transform infrared (ATR-FTIR) spectroscopy harnessed to principal component analysis-quadratic discriminant analysis (QDA) and variable selection techniques employing successive projections algorithm (SPA) and genetic algorithm (GA) for feature selection/extraction combined with QDA. A total of 1944 saliva samples (56 designated as lung-cancer positive and 1888 designed as controls) were obtained in a lung cancer-screening programme being undertaken in North-West England. GA-QDA models achieved, for the test set, sensitivity and specificity values of 100.0% and 99.1%, respectively. Three wavenumbers (1422 cm-1, 1546 cm-1 and 1578 cm-1) were identified using the GA-QDA model to distinguish between lung cancer and controls, including ring C-C stretching, CîN adenine, Amide II [δ(NH), ν(CN)] and νs(COO-) (polysaccharides, pectin). These findings highlight the potential of using biospectroscopy associated with multivariate classification algorithms to discriminate between benign saliva samples and those with underlying lung cancer.