Petro-chemical Equipment Technology ›› 2024, Vol. 45 ›› Issue (6): 48-53.doi: 10.3969/j.issn.1006-8805.2024.06.010

• EQUIPMENT MANAGEMENT • Previous Articles    

Research on Data Analysis Model for Design Documents Compliance Checking of Old Static Equipment

Yuan Dandan, Chen Jiayi, Yuan Xuyang, Liu Yang, Zhang Zhiyun   

  1. SINOPEC Engineering Incorporation, Beijing, 100101
  • Received:2024-08-23 Accepted:2024-10-31 Online:2024-11-15 Published:2024-11-15

Abstract: Carrying out compliance checking of old equipment design documents is conducive to accurately controlling the risks accumulated in the development process of the refining and chemical industry. It is an inevitable requirement for overall development and safety. In this paper, the text data in the static equipment design documents of the old device is studied by machine learning algorithm, and the data analysis model is constructed, aiming to obtain and analyze the key information of the old device that does not meet the compliance of the design documents. The research mainly consists of three aspects: the first is to use data preprocessing method for data cleaning; the second is to use TextRank algorithm to achieve automatic acquisition of the keywords and key phrases in a large number of texts; the third is to use LDA algorithm to train the topic model to automatically generate the key topics in the text. The results obtained from this method can assist experts in rapid and accurate investigation and assessment of safety risk information of old equipment.

Key words: compliance checking, machine learning, data cleaning, TextRank algorithm, LDA topic model