ZORO – PROS

ZORO is a zero-shot multimodal framework that enables process mining analysis of robotic behavior by leveraging foundation models to perform activity recognition from visual, auditory, and textual data. The framework includes a fusion module that integrates activities across modalities to produce the final holistic event log. ZORO runs foundation models locally, preserving the privacy of fine-grained multimodal data.

Source Code

Validation Data

The Framework

ZORO follows a pipeline-based architecture that transforms multimodal fine-grained data into structured event logs. Each modality within the framework is treated independently from the others. Depending on the available data and the requirements of the analysis, the system can operate on all supported modalities or on any subset thereof. Moreover, it does not assume a one-to-one correspondence between modalities and inputs, and multiple inputs of the same modality can be processed. This design allows ZORO to flexibly adapt to heterogeneous sensing configurations without imposing strict assumptions on the number or type of available inputs. Finally, the fusion module integrates the modality-specific event logs produced in the previous step into a single multimodal event log.

The implementation

The tool supports two complementary execution modes. An interactive mode is offered through a graphical user interface, which enables exploratory analysis, configuration of modalities, prompts, and fusion strategies, and inspection of final results. In addition, a batch mode is available to support automated analyses and experimentation, allowing the framework to be applied programmatically to collections of robotic data.

The figure shows an example of the ZORO graphical user interface, where input data have been selected, a fusion strategy has been defined, and the analysis can be executed.

Cookie	Duration	Description
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
JSESSIONID	session	Used by sites written in JSP. General purpose platform session cookies that are used to maintain users' state across page requests.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
VISITOR_INFO1_LIVE	5 months 27 days	A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSC	session	YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.

Cookie	Duration	Description
ice.connection.contextpath	session	No description
ice.connection.lease	session	No description
ice.connection.running	session	No description
ice.push.browser	session	No description
ice.pushids	session	No description
ultp_view_2478	1 day	No description
ultp_view_2524	1 day	No description
ultp_view_2526	1 day	No description
ultp_view_2534	1 day	No description
ultp_view_2542	1 day	No description
ultp_view_2544	1 day	No description
ultp_view_2546	1 day	No description
ultp_view_2613	1 day	No description
ultp_view_2620	1 day	No description
ultp_view_2622	1 day	No description
ultp_view_2624	1 day	No description
ultp_view_2668	1 day	No description
ultp_view_2674	1 day	No description

Monitoring & Controlling / Tool · December 29, 2025

The Framework

The implementation