第5章

先进传感、感知与分析

Advanced Sensing, Perception, and Analytics for Manufacturing

Lingbao Kong*, Qiyuan Wang, Xinlan Tang
复旦大学未来信息创新学院，中国上海
电子邮件：LKong@fudan.edu.cn

现状

在工业5.0持续推进的背景下，智能制造已成为工业制造领域的前沿焦点。在此语境下，传感技术作为连接物理世界与数字信号系统的关键桥梁，正经历从传统范式向智能范式的深刻变革。在当前多域制造主导的时代，传统的单模态传感技术因其单一信息维度、抗干扰能力弱、校准与维护成本高等局限而面临严峻挑战。相应地，如图1所示，多模态传感技术凭借信息丰富、抗干扰冗余度高、校准与维护成本低等优势，正在各类复杂或动态场景中逐步取代单模态传感技术。这一转变有效规避了单模态传感在新工业环境中遭遇的挑战，同时更好地契合了现代先进制造的迫切需求。

与单模态传感相比，多模态传感技术已在三个关键方面取得重大突破。其一在于感知能力的全面提升。通过利用多元传感探测器，多模态传感实现了跨模态信息互补。例如，如图1所示，在汽车零部件表面划痕或涂层缺陷检测中，整合工业相机、三维激光扫描仪、红外热成像仪等多种传感模态，可将缺陷检出率从90%提升至99.5%，同时将误报率降低60%[3]。其二是鲁棒性与抗干扰能力的显著改善。在机器人抓取场景中，即使发生视觉遮挡或盲区，机器人仍可通过触觉与力反馈动态调整动作[4]。这种适应性确保了任务连续性与准确性不受环境扰动影响。其三聚焦于智能与系统集成创新。一方面，通过多模态深度学习实现跨模态语义对齐，引入自监督机制以降低对标注数据的依赖。例如，在视频数据处理中，视觉、听觉与运动信息被自动关联，从而增强模型的泛化能力[5]。另一方面，采用边缘计算实现系统集成中的实时处理，减轻了对云端基础设施的依赖。在智能物流中，借助AGV（自动导引车）导航与避障技术，障碍响应时间可缩短至仅100毫秒[6]。

当前及未来挑战

尽管多模态传感技术在智能制造领域展现出巨大潜力，其发展仍面临诸多挑战。

在硬件层面，最突出的障碍在于不同传感器物理特性的显著差异，这极大地增加了硬件组件无缝集成与校准的复杂性。例如，在智能物流应用中，LiDAR（光检测与测距）传感器的高功耗与相机的低功耗需求形成鲜明对比，亟需设计复杂的电路与精密的冷却方案。此外，昂贵的生产成本构成另一 formidable 障碍。在预测性维护场景中，单个传感器的部署成本可超过200万货币单位，且在分析后续数据时还需投入大量资源用于AI模型训练与维护。

转向数据与算法领域，跨模态数据融合面临重重困难。这些挑战包括不同传感器间的语义对齐问题——例如，在自动化焊接过程中，弧焊传感器、高速相机、红外测温仪与声发射传感器的数据在数据类型与物理意义上差异巨大[7]——以及多协议异构网络的整合。例如，在工业物联网背景下，5G网络与现有工业总线的有效融合仍是难以实现的目标[8]。此外，计算复杂度与实时性要求极为严苛。以自动上料任务为例，LiDAR点云数据（每帧点数）与4K分辨率相机图像的融合必须在严格的100毫秒时间窗口内完成[9]，给计算资源带来巨大压力。

图1. 以传统单模态技术为中心的制造范式与现代先进制造的发展日益不兼容。相应地，多模态传感技术通过利用多维信息协调与互补数据融合，常产生超越孤立模态累积性能的协同效应（即实现"1+1>2"效果）。这些技术正在复杂工业场景中逐步取代单模态方法，并已成为当代先进制造系统的显著特征。

应对挑战的科学技术进展

为应对上述挑战，多模态传感技术必须在多个前沿实现突破。数字孪生技术的集成为高昂的硬件成本提供了可行解决方案。通过利用多模态传感器数据驱动虚拟工厂仿真，该技术实现了生产策略优化，最终达到降低成本与提升效率的目标[10]。

边缘智能与自学习系统的持续发展为缓解跨模态数据融合复杂度与计算需求提供了有效解决方案。边缘智能通过集成AI芯片，在终端设备层面实现多模态数据的实时融合。这不仅满足了严格的实时性要求，还确保了数据的有效整合[11]。另一方面，自学习系统采用强化学习动态优化多传感器间的权重分配，从而降低计算复杂度、增强数据可靠性、减少冗余数据——最终减轻计算负担[12]。

此外，人机协作监测技术体现了新工业范式以人为中心的理念。该技术利用卡尔曼滤波进行空间对齐，构建动态安全区域实时追踪工人手部位置。由此降低了人机协作事故发生率并提升了生产效率。这些技术的发展不仅弥补了多模态传感在实际应用中的当前缺陷，更推动了机械制造技术向更高效率、智能化与和谐化方向发展[13]。

展望未来，智能制造技术将与数字孪生、边缘智能、自学习系统及人机协作监测实现更深层次融合。这一融合将推动制造系统向"零缺陷"、"自我感知"与"以人为本"的目标迈进，标志着制造范式演进的重要飞跃。

结语

多模态传感技术推动了传统单模态传感方法向更高效率与智能化的演进。通过整合多元信息获取模态、增强抗干扰能力、开创新型智能化集成系统，该技术有效突破了制造环境中的鲁棒性壁垒。它不仅强化了制造工艺标准化与缺陷检测精度，还为制造系统提供了高保真度、高可靠性的数据基础。因此，多模态传感技术已成为智能制造生态系统中智能感知的基石。

同时，数字孪生技术、边缘集成架构与自学习系统的融入进一步加速了多模态传感技术的智能化与小型化轨迹，为当代工业奠定了不可或缺的基础设施。除这些技术进步外，以人为本原则的深度融合仍是多模态传感技术发展中的关键里程碑。通过将以人为本置于智能制造的核心，这一范式转变巩固了智能制造与时代精神无缝契合的基础，确保技术进步与人类需求及愿景内在关联。

致谢

作者谨对国家自然科学基金（52375414）与上海市科委创新专项（23ZR1404200）的支持表示衷心感谢。

参考文献

[1] . By digitizing domain expertise and leveraging large-scale process data, autonomous systems provide scalable and adaptive alternatives. Recent advances in artificial intelligence (AI) have enabled these systems to autonomously incorporate real-time feedback, allowing for predictive quality assurance, anomaly detection, and self-optimization of process parameters. The implementation of AI-driven autonomy has been shown to significantly enhance operational efficiency, reduce overhead costs, and improve system resilience—particularly in globally distributed manufacturing environments where access to expert knowledge is limited. Empirical evidence highlighted the effectiveness of such technologies; for example, the implementation of an autonomous quality management system in the automotive manufacturing sector resulted in a 52% reduction in production costs and a 78% decrease in inspection expenses

[2] . Moreover, autonomous manufacturing technologies are expected to exhibit broad applicability across diverse operational domains, including quality control, logistics, energy management, equipment maintenance, and comprehensive process optimization. Current and future challenges Achieving truly AI-enabled autonomous manufacturing requires seamless integration of three foundational components—sensing, reasoning, and action—while also establishing a robust platform for managing the integrated autonomous manufacturing system. In the sensing stage, manufacturing systems must establish robust and scalable data pipelines capable of reliably extracting, pre-processing, storing, and managing diverse multimodal sensor data. Despite the abundance of available data, current pipeline architectures are often underdeveloped compared to the overall maturity of production systems. These pipelines are frequently designed without sufficient consideration for downstream reasoning and control tasks. Consequently, the acquired data suffers from data availability issues—such as noise, low resolution, inconsistent sampling, an excessive amount of data and poor synchronization with system context—which hinders the systems’ ability to transmit only relevant, high-quality data necessary for real-time decision-making and autonomous operation

[3] . The reasoning stage involves deriving actionable insights support process-level decisions. At this stage, two central challenges arise: ensuring the interpretability and generalization of AI models. For AI systems to contribute effectively to manufacturing operations, they must provide structured information across key categories, including current and predicted system states (system assessment), identified operational tasks (problem definition), causal factors (root cause diagnosis), and prescriptive recommendations (decision- making). However, many AI models operate as “black boxes,” hindering engineers' ability to verify or trust the inferred outputs. Generalization also remains problematic, as models often struggle to maintain robust performance under domain shifts, such as variations in operating conditions, product configurations, or factory environments, leading to physically inconsistent or non-representative results

[4] . The action stage requires translating reasoning outputs into executable operations, such as control commands, optimal setpoint selection, or human-readable decision reports. Despite recent progress in AI, current AI models often produce outputs in abstract or model-centric forms that lack the semantic clarity necessary for effective interpretation and implementation within manufacturing systems. Without additional Journal XX (XXXX) XXXXXX A Author et al 31 contextualization, these outputs are not readily actionable, requiring engineers to manually interpret the reasoning results and determine appropriate interventions, thereby increasing cognitive burden and delaying operational response

[5] . Lastly, current platforms such as manufacturing execution system (MES), and programmable logic controller (PLC) are hierarchical and lack the flexibility to support autonomous manufacturing operations. Key challenges include poor interoperability across distributed manufacturing components, limited support for real-time self-organization and manufacturing lifecycle integration. Advances in science and technology to meet challenges In the sensing stage, data pipelines integrated with extract-transform-load (ETL) mechanisms are employed to convert raw signals into structured, analysis-ready formats

[6] . Virtual sensing techniques are utilized to estimate difficult-to-measure variables by leveraging data acquired from the manufacturing process

[7] . To enhance data quality and contextual fidelity, pre-processing methods such as noise removal, sampling rate alignment, and synchronization of heterogeneous data sources are applied

[8] . Additionally, ontology-based technologies have been developed to define the identity of collected data and establish contextual relationships among correlated information

[9] . By enabling context-aware data linkage and semantic interpretation, it facilitates data filtering and selection in subsequent stages, despite the abundance and heterogeneity of manufacturing data. In the reasoning stage, interpretability has been advanced through explainable AI techniques, including pre-modelling strategies such as domain-informed feature extraction, as well as post-modelling tools such as attention mechanism analysis, Shapley additive explanations (SHAP)

[10] . To improve generalization under domain shifts, lifecycle-aware learning strategies are employed to support data drift detection and continual learning

[19] . Building on these advances, a decentralized autonomous manufacturing (DAM) platform architecture was introduced to enable autonomous decision-making, decentralized control, and self-organizing production capabilities

[20] . By utilizing multi-agent systems and secure communication protocols, the platform allows distributed manufacturing nodes to collaborate effectively, respond to disruptions, and execute manufacturing tasks without centralized coordination.