面向航空标准的大语言模型迭代检索增强生成方法
CSTR:
作者:
作者单位:

1中国航空综合技术研究所标准数据技术研究部, 北京 100028;2中国飞机强度研究所强度与结构完整性全国重点实验室,西安 710065

作者简介:

通讯作者:

曾江辉,男,研究员,E-mail:zengjh2014@sina.com。

中图分类号:

TP18

基金项目:

中国飞机强度研究所强度与结构完整性全国重点实验室项目。


Iterative Retrieval-Augmented Generation Method for Aviation Standards Based on Large Language Models
Author:
Affiliation:

1Department of Standard Data Technology Research, China Aero-polytechnology Establishment, Beijing 100028, China;2National Key Laboratory of Strength and Structural Integrity, Aircraft Strength Research Institute of China, Xi’an 710065, China

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    航空标准数据具有结构复杂、语义严谨和跨文档引用频繁等特点,为实现高效、精准的知识获取与问答应用带来挑战,本文提出一种面向航空标准的大语言模型(Large language models, LLMs)迭代检索增强生成(Retrieval-augmented generation, RAG)方法,设计了基于结构路径感知的标准向量知识库构建与检索机制,结合标准文档的章节结构与标题链条构建支持语境追溯的知识库,并提出基于关键词与语义融合的知识检索机制。在此基础上,设计LLM驱动的自动迭代检索与生成机制,使模型能够自主判断是否需要发起子问题拆解与深层意图识别,并结合多轮检索与动态调度策略,实现问题拆解、信息获取、自主判断与生成控制的一体化闭环,提升对多知识点聚合型、语义递进型等复杂标准问答任务的生成质量与覆盖深度。实验基于7 459份航空标准文档构建知识库,针对500条专家标注问答对,在4类涵盖不同参数规模、模型类型及中英文语言能力的主流开源大语言模型上开展对比实验。结果表明,对于中大型参数规模的大模型,此方法在回答准确性、覆盖度和表达质量等指标上均显著优于传统方法。在大模型DeepSeek-R1-70B上,双语评估替补(Bilingual evaluation understudy, BLEU)指标平均提升27.97%,模拟主观评分提升7.99%;在大模型Qwen-2.5-32B上,BLEU指标平均提升54.67%,模拟主观评分提升8.58%。本文所提方法不仅适用于航空标准场景,也可推广至适航规章、维修手册等其他航空结构化文档场景,以及法律、医疗等对回答效果、可信度与可溯源性要求极高的领域,为相关问答系统的构建提供通用的技术框架与实现路径。

    Abstract:

    Aviation standards are characterized by complex data structures, rigorous semantics, and frequent cross-document references, which pose significant challenges for achieving efficient and accurate knowledge acquisition and question-answering applications. In response, an iterative retrieval-augmented generation (RAG) method for aviation standards based on large language models (LLMs) is proposed. A structure-aware vectorized knowledge base construction and retrieval mechanism is designed, which leverages the hierarchical structure and title chains of standard documents to support context-traceable knowledge representation. A hybrid retrieval strategy combining keyword matching and semantic similarity is further introduced. On this basis, an LLM-driven automatic iterative retrieval and generation mechanism is developed, enabling the model to autonomously determine whether sub-question decomposition and deep intent recognition are required. By integrating multi-turn retrieval with dynamic scheduling, the proposed framework forms a closed-loop process of problem decomposition, information acquisition, self-assessment, and content generation, effectively improving generation quality and semantic coverage for complex queries such as multi-point aggregation and semantic progression. Experiments are conducted on a knowledge base constructed from 7 459 aviation standard documents and a benchmark set of 500 expert-annotated question-answer pairs, evaluated across four mainstream open-source LLMs with varying parameter scales, model types, and bilingual capabilities. The results indicate that, for models with medium or large scales, the proposed method significantly outperforms traditional retrieval-augmented generation approaches in answer accuracy, coverage, and expression quality. Specifically, on DeepSeek-R1-70B, the bilingual evaluation understudy(BLEU) score improved by 27.97% and the simulated human preference score increased by 7.99%; on Qwen-2.5-32B, BLEU improved by 54.67% and the simulated score increased by 8.58%. The proposed method is applicable not only to aviation standards, but also to other structured document scenarios in the aviation field, including airworthiness regulations and maintenance manuals, as well as to domains such as law and healthcare that require high answer quality, reliability, and traceability. It offers a versatile technical framework and implementation approach for the construction of domain-specific question-answering systems.

    参考文献
    相似文献
    引证文献
引用本文

秦晓瑞,何柳,安然,曾江辉,刘姝妍,王少枫,田宇.面向航空标准的大语言模型迭代检索增强生成方法[J].南京航空航天大学学报,2026,58(1):235-248

复制
分享
相关视频

文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2025-09-02
  • 最后修改日期:2025-11-03
  • 录用日期:
  • 在线发布日期: 2026-03-10
  • 出版日期:
文章二维码
您是第位访问者
网站版权 © 南京航空航天大学学报
技术支持:北京勤云科技发展有限公司