Abstract

Identifying the emotional state of individuals has useful applications, particularly to reduce the risk of suicide. Users’ thoughts on social media platforms can be used to find cues on the emotional state of individuals. Clinical approaches to suicide ideation detection primarily rely on evaluation by psychologists, medical experts, etc., which is time-consuming and requires medical expertise. Machine learning approaches have shown potential in automating suicide detection. In this regard, this study presents a soft voting ensemble model (SVEM) by leveraging random forest, logistic regression, and stochastic gradient descent classifiers using soft voting. In addition, for the robust training of SVEM, a hybrid feature engineering approach is proposed that combines term frequency-inverse document frequency and the bag of words. For experimental evaluation, “Suicide Watch” and “Depression” subreddits on the Reddit platform are used. Results indicate that the proposed SVEM model achieves an accuracy of 94%, better than existing approaches. The model also shows robust performance concerning precision, recall, and F1, each with a 0.93 score. ERT and deep learning models are also used, and performance comparison with these models indicates better performance of the SVEM model. Gated recurrent unit, long short-term memory, and recurrent neural network have an accuracy of 92% while the convolutional neural network obtains an accuracy of 91%. SVEM’s computational complexity is also low compared to deep learning models. Further, this study highlights the importance of explainability in healthcare applications such as suicidal ideation detection, where the use of LIME provides valuable insights into the contribution of different features. In addition, k-fold cross-validation further validates the performance of the proposed approach.

Keywords

Ensemble learning, feature extraction, feature fusion, machine learning, suicide ideation

Document Type

Journal Article

Date of Publication

12-1-2026

Volume

19

Issue

1

Publication Title

International Journal of Computational Intelligence Systems

Publisher

Springer

School

Centre for Artificial Intelligence and Machine Learning (CAIML) / School of Science

Funders

European University of Atlantic

Creative Commons License

Creative Commons Attribution-Noncommercial-No Derivative Works 4.0 License
This work is licensed under a Creative Commons Attribution-Noncommercial-No Derivative Works 4.0 License.

Comments

Kina, E., Choi, J., Ishaq, A., Shafique, R., Villar, M. G., Alvarado, E. S., De La Torre Diez, I., & Ashraf, I. (2026). Suicide ideation detection using social media data and ensemble machine learning model. International Journal of Computational Intelligence Systems, 19. https://doi.org/10.1007/s44196-025-01123-9

Share

 
COinS
 

Link to publisher version (DOI)

10.1007/s44196-025-01123-9