中文农机大语言模型耒耜构建方法
CSTR:
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

基金项目:

国家自然科学基金项目(52175258)和中国博士后科学基金项目(2023M743790)


Construction Method of Large Language Model for Chinese Agricultural Machinery LeiSi
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    大语言模型拥有强大的生成、学习和推理能力,是加快现代农机装备绿色、智能和高效发展的强力助手。然而,由于缺乏用于训练的农业机械相关数据集,极大限制了大模型在农机研发、制造和推广。因此,面向农机装备科研人员、设计制造工程师、用户等不同群体具体需求,提出了中文农机大模型---耒耜。首先,提出了耒耜大模型总体架构设计方案,包括耒耜·薪火、耒耜·匠心和耒耜·耕耘3个版本,旨在为目标群体提供农机专业知识问答、设计制造建议和田间作业管控等多样化、定制化服务;其次,构建了国内首个中文农机数据集,并以耒耜·薪火大模型为例,阐明了模型训练和评估方法,分别以LLaMA 3.1-8B-Instruct、Mistral-7B-Instruct-v0.3 和Qwen 2.5-7B-Instruct 为基座模型进行监督式微调,利用ROUGE 和BLEU 作为评价指标评估微调后模型性能;最后,采用人工评估对LLaMA 3.1、GPT-4o、Mistral、Qwen 2.5 和薪火大模型的问答结果进行评价,自动评估和人工评估结果表明薪火大模型在准确性、专业性和可用性等方面表现最优。研究成果为农机装备全生命周期管控及智慧农业发展提供了有力工具和手段。

    Abstract:

    The development of large language models (LLMs) has significantly propelled the latest advancements in natural language processing (NLP). These models have been built upon complex deep learning architectures, typically Transformer, characterized by billions of parameters and extensive training data, enabling them to achieve high precision across a variety of tasks. However, the absence of agricultural machinery-specific textual data for training in existing general large models has severely limited their performance in the research, development, manufacturing, and application of agricultural machinery. To address this issue, the specific needs of agricultural machinery for large models were analyzed and a Chinese-compatible agricultural machinery large model named "LeiSi" was proposed, catering to various groups such as university faculty, students, designers, and users. The overall architectural design of the LeiSi LLM was outlined, which included three parts: LeiSi-torch, LeiSi-ingenuity, and LeiSi-plough, aiming to provide target groups with diversified and customized services such as agricultural machinery professional knowledge Q&A, agricultural machinery design and manufacturing advice, and agricultural machinery field operation control. Subsequently, the LeiSi-torch large language model was taken as an example to elucidate the construction of a Chinese agricultural machinery dataset and the methods for model fine-tuning and automatic evaluation. Utilizing LLaMA 3.1-8B-Instruct, Mistral-7B-Instruct-v0.3, and Qwen 2.5-7B-Instruct as base models, supervised fine-tuning on each was conducted and the performance of the fine-tuned models was evaluated by using ROUGE and BLEU as evaluation metrics. Finally, manual evaluation was employed to assess the Q&A results of LLaMA 3.1, GPT-4o, Mistral, Qwen 2.5, and the LeiSi-torch model. The results from both automatic and manual evaluations indicated that the LeiSi-torch model demonstrated superior performance in terms of accuracy, professionalism, and usability. The research outcomes can provide insights and references for the development of intelligent agricultural machinery and smart agriculture.

    参考文献
    相似文献
    引证文献
引用本文

栗晓宇,王玉聪,杜岳峰,李国润,刘磊,宋正河.中文农机大语言模型耒耜构建方法[J].农业机械学报,2026,57(5):387-397. LI Xiaoyu, WANG Yucong, DU Yuefeng, LI Guorun, LIU Lei, SONG Zhenghe. Construction Method of Large Language Model for Chinese Agricultural Machinery LeiSi[J]. Transactions of the Chinese Society for Agricultural Machinery,2026,57(5):387-397.

复制
分享
相关视频

文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2024-11-29
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2026-03-01
  • 出版日期:
文章二维码