Version: v2.4

Application Overview

Upon first login to Snowflake you can find your apps under the Data Products / Apps menu. Then click on the "mpmX" app to open it.

Welcome Page

The Snowflake Process Mining App starts with the Welcome page. Here you can view all the available process mining scenarios, create new process scenarios or edit and delete existing ones. Each scenario contains all the necessary details for mining your data, including data source definitions within Snowflake and the process mining parameters. You can review and edit existing scenarios, delete them, or add new ones as needed.

The "✎" button takes you to the Data Source screen, where you can start editing an existing Process Scenario.

The "⛌" button allows you to delete an existing Process Scenario. You must to confirm your intention.

Create a Process Scenario

The "➕ Create a Process Scenario" button on the Welcome page takes you to a new screen.

Here you enter the Process scenario name and Process scenario description of the new Process Scenario.

You can also decide whether you want to create an OCPM Process Scenario and whether you want to Enable Time Travel.

Data Sources Tab

Data sources are the starting point for process mining.

Each data source must adhere to a defined data structure. Our validation process ensures that everything is formatted correctly before moving forward.

Select an Event Log

An event log is the central and mandatory input for process mining. It is a table that contains case and event information. To further enrich the event log with your domain specific data, you can also add additional Case and Event Dimensions.

The "➕ Select Table as Event Log" button allows you to reference a table that you can access as an event log. You can change the type of event log before selecting a reference. The dropdown options are Table or View. Apache Iceberg Tables are supported through the Tables option.

Click the "🗑️" button to remove an existing reference which then allows the selection of a different Event Log.

Dimensions

The "➕ Select Table as Case Dimensions" and "➕ Select Table as Event Dimensions" buttons allow you to reference tables that you can access as additional case and event dimensions respectively. You can change the type of event log before selecting a reference. The dropdown options are Table or View. Apache Iceberg Tables are supported through the Tables option.

Click the "🗑️" button to remove an existing reference which then allows the selection of a different Case and Event Dimensions.

Grouping Tab

By defining groups, you can combine related activities to get a better overview of your processes in the process analyzer. Groups can be defined in a hierarchy by configuring one group as the parent of another.

To add a new group, click the "➕" button.

Fill out the Name field with the name of the group.

The Parent Group field lets you define a parent group. Parent groups are optional. A group cannot reference itself as its parent.

Use the Activities field to define which activities belong to your group.

Click on the "˅" to open a list of activities to choose from.
Select as many activities as you like, but each activity can only belong to one group.
Click the "X" next to an activity name to delete it.

To edit an existing group, simply change the field values as required.

To delete an existing group, click the "🗑️" button next to the group.

Click the blue "Save" button when you are finished.

Subprocess Leadtime Tab

Define subprocesses to calculate lead times and other time-related measures for partial processes. To understand why this is useful, take a look at the Subprocess Leadtime » Use Cases.

Add a New Subprocess

To add a new subprocess, click the "➕" button and fill out the fields appropriately.

The Subprocess field lets you define a name for your subprocess.

Use the Start Activity dropdown to select one or more activities that mark the start of the subprocess.

The Include Start Activity check mark below the Start Activity field defines if the duration of the activity will be included in or excluded from the calculated lead time.

Sometimes you may have more than one activity marked as the Start Activity show up in a process variant - either because two different start activities appear or a single start activity is repeated. The Min/Max Start Activity field is then used to define if the first or last matching activity will be used for lead time calculation. Min uses the first matching activity (resulting in a longer process leadtime), while Max uses the last matching activity (resulting in a shorter process leadtime).

Use the End Activity field to select one or more activities that mark the end of the subprocess.

The Include End Activity check mark defines if the duration of the activity will be included in or excluded from the calculated lead time.

By configuring the Min/Max End Activity field for the End Activity you can define if the first or last matching activity will be used for lead time calculation. Here the resulting impact is opposite that for the Start Activity. Min uses the first matching activity (resulting in a shorter subprocess leadtime), Max uses the last matching activity (resulting in a longer subprocess leadtime).

The Target Time [d] and Target Time Operator fields define the desired lead time for the subprocess. They are used to calculated if the lead time is missed.

Don't forget to Save your changes.

End of Process Tab

By default, the mpmX app shows leadtimes and other information from an average of all cases, but this can sometimes be misleading.

For example, cases that are not finished will have shorter leadtimes just because they have not reached the end.
The number of process variants (specific path that each case takes) will be greater if you include both open and closed cases, than if you only counted closed cases.

On the End of Process screen you can define when a process is considered to be completed.

If the End of process condition is set to None, all processes are considered to be completed.
Select using Activity to then select an activity which marks a process as completed.
Select using Custom Field to define more complex conditions.
- On the left, you can define which field of the activity log should be used.
- In the middle select the appropriate operator (= or IN).
- On the right, define the value (in case of =) or the comma-separated list of values which mark a process as completed.
  End of Process - using Custom Field

Don't forget to Save your changes.

Time Travel Tab

Time Travel allows you to keep advanced historical information to enable a more detailed analysis of changes over time.

info

Time Travel is only available for Snowflake Enterprise Edition accounts (and higher).

The Data Retention Time [d] field allows you to configure how many days historical information should be retained. The maximum value is 90 days.

warning

Please note, that this will result in additional storage costs.

Don't forget to Save your changes.

Additional Customizations Tab

On the Additional Customizations screen you can configure some rework and automation settings as well as some miscellaneous settings.

Rework

A rework event is any activity that indicates an unexpected change, such as a Purchase Order being adjusted or deleted.

By selecting a Rework definition option you can define which kinds of events should be taken into account when calculating rework related measures. Options are:

<empty> - rework related measures will not be calculated
ReworkEvent - events are considered if they are defined as rework (see Rework Event Expression (by activity type) below)
RepeatedEvent - events are considered if they are repeated in a loop
ReworkAndRepeatedEvent - events are considered if they are defined as rework AND are repeated
ReworkOrRepeatedEvent - events are considered if they are defined as rework OR are repeated

An event is regarded as rework if its activity type matches the Rework Event Expression (by activity type) field.

Automation

The Automation limit [%] indicates the percentage of automated events that a case needs to have to be labeled as an automated case.
An event is regarded as automated if its user name field matches the Automated Event Expression (by user name) field.

Miscellaneous

If Reduce timestamps? is checked, the number of distinct timestamps is reduced by rounding seconds and milliseconds to minutes. The lead and process times are not influenced, just the final timestamp output format is shortened to minutes.
If Suppress loops in process paths? is checked, process variants are considered equal if they only differ in how often an activity occurs in succession. All repetitions are kept in the event log but there will be fewer distinct variants.
Check Run pareto analysis to enable Pareto/ABC Analysis.
Don't forget to Save your changes.

Task Execution and History Tab

Task Execution Scheduling

This section allows you to configure a schedule for regular automatic process mining runs by defining a Cron expression and a time zone. To understand how a cron expression is structured, please refer to the Snowflake documentation: Snowflake » Schedule using cron » SCHEDULE ... » USING CRON <expr> <time_zone>
Don't forget to Save your changes.

Ad-hoc Task Execution

The Start Mining Process button allows you to manually start a process mining task.

Mining Summary of last execution

This section shows the basic KPIs of the process model after a successful mining run.

Task Execution History

The Task Execution History section displays the status of the most recent process mining runs.

The Start (UTC) and End (UTC) columns contain the start and end timestamps of the mining run
Duration (hh:mm:ss) contains the total time of the mining run.
The States column contains one of the following values:
- Running - The process mining task is running.
- Failed - The process mining task has failed. Error details can be found in the Error Mesage column.
- Succeeded - The process mining task was successful.
- Scheduled - The process mining task is scheduled to run at the displayed Start (UTC) time.
An Error Message will appear if anything went wrong.

Welcome Page​

Create a Process Scenario​

Data Sources Tab​

Select an Event Log​

Dimensions​

Grouping Tab​

Subprocess Leadtime Tab​

Add a New Subprocess​

End of Process Tab​

Time Travel Tab​

Additional Customizations Tab​

Rework​

Automation​

Miscellaneous​

Task Execution and History Tab​

Task Execution Scheduling​

Ad-hoc Task Execution​

Mining Summary of last execution​

Task Execution History​

Welcome Page

Create a Process Scenario

Data Sources Tab

Select an Event Log

Dimensions

Grouping Tab

Subprocess Leadtime Tab

Add a New Subprocess

End of Process Tab

Time Travel Tab

Additional Customizations Tab

Rework

Automation

Miscellaneous

Task Execution and History Tab

Task Execution Scheduling

Ad-hoc Task Execution

Mining Summary of last execution

Task Execution History