Education, Science, Technology, Innovation and Life
Open Access
Sign In

Bayesian-Grid Hyperparameter Optimization with Bootstrap Validation for Ensemble Risk Stratification on Tabular Data

Download as PDF

DOI: 10.23977/acss.2026.100203 | Downloads: 2 | Views: 74

Author(s)

Weiyi Zhu 1, Can Wu 1, Hongxun Ye 1, Shengjun Xu 1, Ting Zhong 1

Affiliation(s)

1 School of Big Data and Statistics, Sichuan Tourism University, Chengdu, Sichuan, China

Corresponding Author

Ting Zhong

ABSTRACT

Constructing structured knowledge representations from large-scale unstructured text streams remains a fundamental challenge in natural language processing, requiring the joint solution of entity recognition, relation extraction, event evolution analysis, and graph visualization within a coherent end-to-end pipeline. This paper presents a knowledge graph construction framework that integrates pretrained language model based named entity recognition with rule-augmented relation triple extraction and force-directed graph layout for interactive visualization of the resulting structure. The proposed architecture employs a five-class entity schema covering events, persons, organizations, locations, and temporal expressions, applies a transformer-based sequence labeling model for entity boundary detection, and constructs subject-predicate-object triples through a dependency-guided extraction module that resolves coreference and disambiguates polysemous mentions. Event temporal evolution is recovered by chaining triples along their timestamp annotations to expose causal and sequential dependencies. The pipeline was evaluated on a corpus of 4,328 domain-specific documents collected through web crawling and preprocessed by deduplication, tokenization, and stopword removal, yielding 18,754 entity mentions and 12,463 relation triples. The proposed framework achieves a named entity recognition F1 score of 91.4% and a relation extraction F1 of 87.6%, exceeding BiLSTM-CRF and rule-based baselines by 5.2 and 7.8 percentage points respectively. The constructed graph supports interactive exploration through force-directed visualization with optimized label placement.

KEYWORDS

Named Entity Recognition, Relation Triple Extraction, Knowledge Graph Construction, Force-Directed Graph Layout, Event Temporal Evolution, Pretrained Language Model

CITE THIS PAPER

Weiyi Zhu, Can Wu, Hongxun Ye, Shengjun Xu, Ting Zhong. Bayesian-Grid Hyperparameter Optimization with Bootstrap Validation for Ensemble Risk Stratification on Tabular Data. Advances in Computer, Signals and Systems (2026). Vol. 10, No. 2, 19-29. DOI: http://dx.doi.org/10.23977/acss.2026.100203.

REFERENCES

[1] Kaur, N., Saha, A., Swami, M., Singh, M. and Dalal, R. (2024) BERT-NER: A transformer-based approach for named entity recognition. Proceedings of the 2024 15th International Conference on Computing Communication and Networking Technologies (ICCCNT), 1-7.
[2] Wu, B., Ding, Z. and Huang, J. (2026) A review of continual learning in edge AI. IEEE Transactions on Network Science and Engineering.
[3] Gardazi, N.M., Daud, A., Malik, M.K., Bukhari, A., Alsahfi, T. and Alshemaimri, B. (2025) BERT applications in natural language processing: A review. Artificial Intelligence Review, 58, 166.
[4] Wu, B., Ding, Z., Ostigaard, L. and Huang, J. (2025) Reinforcement learning-based energy-aware coverage path planning for precision agriculture. Proceedings of the 2025 ACM Research on Adaptive and Convergent Systems (RACS), 1-8.
[5] Salih, M.I., Mohammed, S.M., Ibrahim, A.K., Ahmed, O.M. and Haji, L.M. (2025) Fine-tuning BERT for automated news classification. Engineering, Technology & Applied Science Research, 15, 22953-22959.
[6] Wu, B., Cai, Z., Wu, W. and Yin, X. (2023) AoI-aware resource management for smart health via deep reinforcement learning. IEEE Access, 11, 81180-81195.
[7] Sreenivas, S.C., Chowdhury, S. and Masum, M. (2025) Enhancing clinical named entity recognition via fine-tuned BERT and dictionary-infused retrieval-augmented generation. Electronics, 14, 3676.
[8] Wu, B. and Wu, W. (2023) Model-free cooperative optimal output regulation for linear discrete-time multi-agent systems using reinforcement learning. Mathematical Problems in Engineering, 6350647.
[9] Zengeya, T., Naidoo, K.E., Fonou-Dombeu, J.V. and Gwetu, M. (2025) A multimodal transformer-based fusion model for enhanced relation extraction from texts. IEEE Access.
[10] Chaturvedi, R., Baghershahi, P., Medya, S. and Di Eugenio, B. (2025) Temporal relation extraction in clinical texts: A span-based graph transformer approach. Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 25765-25788.
[11] Mouiche, I. and Saad, S. (2025) Entity and relation extractions for threat intelligence knowledge graphs. Computers & Security, 148, 104120.
[12] Premasiri, D., Ranasinghe, T., Mitkov, R., El-Haj, M. and Frommholz, I. (2025) Survey on legal information extraction: Current status and open challenges. Knowledge and Information Systems, 67, 11287-11358.
[13] Ferreira, P., Martins, E., Silva, J. and Teixeira, P. (2025) Feature selection and XGBoost for enhanced intrusion detection: A comparative study across benchmark datasets. Proceedings of the 2025 13th International Symposium on Digital Forensics and Security (ISDFS), 1-6.
[14] Huang, J., Wu, B., Duan, Q., Dong, L. and Yu, S. (2025) A fast UAV trajectory planning framework in RIS-assisted communication systems with accelerated learning via multithreading and federating. IEEE Transactions on Mobile Computing.
[15] Kumar, R., Singhal, N. and Chhabra, A. (2025) Hybrid optimization algorithm with the combination of PSO and genetic algorithm for task scheduling in cloud computing. E-Learning and Digital Media, 20427530251331082.
[16] Nathiya, N., Rajan, C. and Geetha, K. (2025) A hybrid optimization and machine learning based energy-efficient clustering algorithm with self-diagnosis data fault detection and prediction for WSN-IoT application. Peer-to-Peer Networking and Applications, 18, 13.
[17] Wu, B., Huang, J. and Yu, S. (2026) 'X of Information' continuum: A survey on AI-driven multi-dimensional metrics for next-generation networked systems. IEEE Communications Surveys & Tutorials.
[18] Wu, B., Huang, J., Duan, Q., Dong, L. and Cai, Z. (2025) Enhancing vehicular platooning with wireless federated learning: A resource-aware control framework. IEEE/ACM Transactions on Networking, 33, 1-16.
[19] Isah, M.A. and Kim, B.S. (2025) Question-answering system powered by knowledge graph and generative pretrained transformer to support risk identification in tunnel projects. Journal of Construction Engineering and Management, 151, 04024193.
[20] Wu, B., Huang, J. and Duan, Q. (2025) FedTD3: An accelerated learning approach for UAV trajectory planning. Proceedings of the International Conference on Wireless Artificial Intelligent Computing Systems and Applications (WASA), 13-24.
[21] Auer, S., D'Souza, J., Farfar, K.E., Jaradeh, M.Y., Jiomekong, A., Karras, O. and Vogt, L. (2025) Open research knowledge graph: A large-scale neuro-symbolic knowledge organization system. Handbook on Neurosymbolic AI and Knowledge Graphs, SAGE Publications, 385-420.
[22] Shim, M., Choi, H., Koo, H., Um, K., Lee, K.H. and Lee, S. (2025) OmEGa (Ω): Ontology-based information extraction framework for constructing task-centric knowledge graph from manufacturing documents with large language model. Advanced Engineering Informatics, 64, 103001.
[23] Wu, B., Huang, J. and Duan, Q. (2025) Real-time intelligent healthcare enabled by federated digital twins with AoI optimization. IEEE Network, 1.
[24] Menaouer, B., Fairouz, S., Meriem, M.B., Mohammed, S. and Nada, M. (2025) A sentiment analysis of the Ukraine-Russia War tweets using knowledge graph convolutional networks. International Journal of Information Technology, 1-18.
[25] Pan, D., Wu, B.-N., Sun, Y.-L. and Xu, Y.-P. (2023) A fault-tolerant and energy-efficient design of a network switch based on a quantum-based nano-communication technique. Sustainable Computing: Informatics and Systems, 37, 100827.
[26] Agrawal, S.K. (2026) Adaptive density-aware clustering of high-dimensional patient data in electronic health records. International Journal of Engineering Development and Research, 14, 361-367.

Downloads: 47799
Visits: 1089553

Sponsors, Associates, and Links


All published work is licensed under a Creative Commons Attribution 4.0 International License.

Copyright © 2016 - 2031 Clausius Scientific Press Inc. All Rights Reserved.