返回文献集

结论

Conclusion

本文将语义训练鸿沟识别为工业 AI 智能体系统中的结构性失效类别。鸿沟源于统计语言模型通过训练获取领域词汇,但不学习赋予该词汇操作含义的关系本体结构。本文演示两种失效模式:单智能体系统中的工具调用幻觉(对照实验 72 次查询、六种行业配置中 43% 无约束工具调用参数为伪造标识符),以及多智能体系统中的语义漂移(无共享本体约束的智能体渐进发散)。为应对这些失效模式,本文提出基于本体约束的工具架构:类型化关系配置(每领域 45 导出、700–770 行),运行时由三操作接口契约(resolve、contextualize、annotate)消费,不变量由 AIOps 编排层强制。本体约束的工具参数消除工具调用幻觉(0% 与 43%),覆盖全部六种行业配置和全部 12 个分析工具。架构在数字孪生仿真环境中验证,演示跨领域可配置性:单一应用代码库配合领域特定本体配置,在航空航天、制药、汽车、电子、食品饮料和仓储自动化配置中产生正确、语义锚定的结果。未来工作包括与 OPC UA 和 AutomationML 集成以实现自动本体填充、配置接口形式公理化以进行设计时一致性检查、多模型幻觉基准测试,以及对照既有工厂制造环境生产 MES 数据的验证。


利益冲突声明

作者受雇于西门子数字工业软件。研究作为作者职责的一部分进行。雇主未参与研究设计、数据分析或投稿决定。

生成式 AI 与 AI 辅助技术声明

在本文准备过程中,作者使用 Claude(Anthropic)进行起草、编辑和文稿结构化。使用该工具后,作者按需审阅和编辑内容,并对发表文章的全部内容承担完整责任。

CRediT 作者贡献

Grama Chethan:概念化、方法论、软件、验证、调查、数据管理、撰写—原稿、撰写—审阅与编辑、可视化。

数据可用性

六种本体配置模块、72 查询幻觉实验数据集(两条件下查询文本、工具调用参数和结果)及校准结果(60 次仿真运行)可向通讯作者索取。仿真框架源代码维护于西门子私有仓库,审阅人可应要求获取。

致谢

本工作于西门子数字工业软件完成。作者感谢平台工程团队在多智能体平台验证期间提供的基础设施支持。

参考文献
  1. ISA. ANSI/ISA-5.1-2022: Instrumentation symbols and identification. International Society of Automation; 2022.
  2. IEC. IEC 61131-3:2013: Programmable controllers – Part 3: Programming languages. International Electrotechnical Commission; 2013.
  3. IPC. IPC-9850: Surface mount placement equipment characterization. Association Connecting Electronics Industries; 2020.
  4. Wang L, Ma C, Feng X, Zhang Z, Yang H, Zhang J, et al. A survey on large language model based autonomous agents. Front Comput Sci 2024;18(6):186345.
  5. Xi Z, Chen W, Guo X, He W, Ding Y, Hong B, et al. The rise and potential of large language model based agents: a survey. arXiv:2309.07864; 2023.
  6. Gruber TR. Toward principles for the design of ontologies used for knowledge sharing. Int J Hum-Comput Stud 1995;43(5–6):907–28.
  7. Guarino N, Oberle D, Staab S. What is an ontology? In: Handbook on ontologies. Springer; 2009.
  8. Lemaignan S, Siadat A, Dantan JY, Semenenko A. MASON: a proposal for an ontology of manufacturing domain. Proc IEEE Workshop on Distributed Intelligent Systems. 2006.
  9. Usman Z, Young RIM, Chungoora N, Palmer C, Case K, Harding JA. Towards a formal manufacturing reference ontology. Int J Prod Res 2013;51(22):6553–72.
  10. Biffl S, Lüder A, Gerhard D, editors. Multi-disciplinary engineering for cyber-physical production systems. Springer; 2017.
  11. ISA. ANSI/ISA-95 (IEC 62264): Enterprise-control system integration. International Society of Automation; 2010.
  12. MESA International. B2MML: Business to Manufacturing Markup Language, Version 7.0. 2018.
  13. Scholten B. The road to integration: a guide to applying the ISA-95 standard in manufacturing. ISA; 2007.
  14. Vegetti M, Leone HP, Henning GP. PRONTO: an ontology for comprehensive and consistent representation of product information. Eng Appl Artif Intell 2011;24(8):1305–27.
  15. IEC. IEC 62541: OPC Unified Architecture. International Electrotechnical Commission; 2020.
  16. IEC. IEC 62714: AutomationML – Engineering data exchange format. International Electrotechnical Commission; 2018.
  17. Li X, Liu H, Wang W, Zheng Y, Lv H, Lv Z. Big data analysis of the internet of things in the digital twins of smart city based on deep learning. Future Gener Comput Syst 2022;128:167–77.
  18. Zhou B, Bao J, Li J, Lu Y, Liu T, Zhang Q. A novel knowledge graph-based optimization approach for resource allocation in discrete manufacturing workshops. Robot Comput-Integr Manuf 2022;71:102160.
  19. Pan S, Luo L, Wang Y, Chen C, Wang J, Wu X. Unifying large language models and knowledge graphs: a roadmap. IEEE Trans Knowl Data Eng 2024;36(7):3580–99.
  20. Schick T, Dwivedi-Yu J, Dessì R, Raileanu R, Lomeli M, Hambro E, et al. Toolformer: language models can teach themselves to use tools. Adv Neural Inf Process Syst 2023;36.
  21. Patil SG, Zhang T, Wang X, Gonzalez JE. Gorilla: large language model connected with massive APIs. arXiv:2305.15334; 2023.
  22. Anthropic. Tool use (function calling) with Claude. Anthropic Documentation; 2024.
  23. Rebedea T, Dinu R, Sreedhar M, Parisien C, Cohen J. NeMo Guardrails: a toolkit for controllable and safe LLM applications with programmable rails. EMNLP System Demonstrations. 2023.
  24. Ji Z, Lee N, Frieske R, Yu T, Su D, Xu Y, et al. Survey of hallucination in natural language generation. ACM Comput Surv 2023;55(12):1–38.
  25. Huang L, Yu W, Ma W, Zhong W, Feng Z, Wang H, et al. A survey on hallucination in large language models. arXiv:2311.05232; 2023.
  26. Negri E, Fumagalli L, Macchi M. A review of the roles of digital twin in CPS-based production systems. Procedia Manuf 2017;11:939–48.
  27. Riddick F, Lee YT. Representing layout information in the CMSD specification. Proc Winter Simulation Conference. 2011.
  28. Patki N, Wedge R, Veeramachaneni K. The Synthetic Data Vault. Proc IEEE DSAA. 2016.
  29. Chethan G. The data layer nobody builds: how template-as-ontology alignment enables cross-domain synthetic data for industrial AI validation. Manuscript in preparation; 2025.
  30. Hodge G. Systems of knowledge organization for digital libraries. CLIR; 2000.
  31. PRI. NADCAP: National Aerospace and Defense Contractors Accreditation Program. Performance Review Institute; 2024.
  32. Sculley D, Holt G, Golovin D, Davydov E, Phillips T, Ebner D, et al. Hidden technical debt in machine learning systems. Adv Neural Inf Process Syst 2015;28:2503–11.
  33. Noy NF, McGuinness DL. Ontology development 101. Stanford KSL Technical Report KSL-01-05; 2001.
  34. FDA. 21 CFR Part 11: Electronic records; electronic signatures. U.S. FDA; 2003.
  35. IPC. IPC-A-610: Acceptability of electronic assemblies. 2021.
  36. IATF. IATF 16949:2016: Quality management system requirements for automotive production. 2016.
  37. Dang Y, Lin Q, Huang P. AIOps: real-world challenges and research innovations. Proc ICSE-SEIP. 2019.
  38. Notaro P, Cardoso J, Gerndt M. A systematic mapping study in AIOps. Proc ICSOC. 2020.
  39. Anthropic. Model Context Protocol specification. 2024.
  40. Bender EM, Gebru T, McMillan-Major A, Shmitchell S. On the dangers of stochastic parrots. Proc ACM FAccT. 2021.