Discover model

Apromore allows us to discover a process map or a BPMN model from an event log. A process map (a.k.a. directly-follows graph) is a visual representation of the log as a graph where nodes capture process activities and directed arcs between them capture sequential order relations between the activities. For example, an arc going from activity Accept order to activity Check order indicates that in the log, we can observe that process cases flow from Accept order to Check order. Process maps are a simple yet effective means to understand the basic order of relationships between process activities. As such, they are the most common type of model discovered by commercial process mining tools.

View process map

To view the process map, double-click on a log. Alternatively, we can right-click the log Discover model > From full log .

../_images/Discov002.png

A window will open up, showing the process map discovered from the log.

../_images/Discov003.png

We can rearrange an activity by clicking and dragging it accordingly. We can also rearrange multiple activities at once. To select multiple activities, press Ctrl (for Windows) or Command (for Mac) and drag the mouse over  multiple activities.

../_images/Discov026.png

The selected activities can be rearranged by dragging them accordingly.

../_images/Discov056.png

Process map layout

To better understand a process map, displaying the most frequent activities along a horizontal line may be useful. We can horizontally align and lock the most frequent activities in one specific Y-coordinates when displaying the process map in Process Discoverer.

There are three layout options when configuring the process map layout in Process Discoverer.

  • Legacy left-to-right: This is the default option where Apromore randomly places activities along the Y-coordinates.

  • Auto-discovered backbone: When we select this option, Apromore displays the most frequent activities in the process in a specific Y-coordinate (called the backbone line). Less frequent activities are displayed in a Y-coordinate above or below the backbone line.

  • User-defined backbone: When we select this option, we first will select at least two activities and all their connecting arcs. The selected activities will form the backbone line.

To specify the process map layout in Process Discoverer, click on the layout icon.

../_images/Proces002.png

This displays a list of options to set the process map layout.

../_images/Proces003.png

By default, the process map is opened with Legacy left-to-right.

../_images/Proces004.png

To display the most frequent activities on the backbone line, select Auto-discovered backbone.

../_images/Proces005.png

Apromore automatically detects the most frequent activities and align them on the backbone line.

Note

When a filter is applied, Apromore displays the most frequent activity of the filtered process map along the backbone line. In the same way, when the abstraction slider is changed, the most frequent activities at their abstraction setting is aligned on the backbone line.

To display a user-defined backbone, first select at least two activities with their connecting arcs.

Note: The first and last element selected must be an activity not an arc.

../_images/Proces006.png

Select User-defined backbone.

../_images/Proces007.png

Apromore now display the process map with the selected elements forming the backbone line.

../_images/Proces008.png

Note

Two activities in a short loop cannot be part of the same backbone.

Abstraction settings

We can adjust the complexity of the discovered map by increasing or decreasing the frequency or duration of nodes and arcs visualized in our process map. The default values for the nodes and arcs sliders are 100% and 10%, respectively.

We can abstract a process map by Case frequency or Average Duration. For example, if we abstract by case frequency and shift the arcs or nodes slider towards the left, more edges/nodes with low case frequency will be removed from the process map.

../_images/Discov004.png

Similarly, if we abstract by Average Duration and shift the arcs or nodes slider towards the left, more edges/nodes with low average duration will be removed from the process map.

../_images/Discov005.png

We recommend you to use the abstraction sliders as follows.

First, use the Nodes slider to select the percentage of nodes you wish to display in the canvas. For example, if you set the Nodes slider to 80%, Apromore will display only the 80% most frequent nodes and remove the 20% most infrequent.

Next, you may adjust the number of arcs displayed on the canvas. If you set the Arcs slider to 100%, then Apromore will display all the arcs between the current set of nodes. As you move the Arcs slider down from 100% down to 0%, Apromore will hide the most frequent arcs. Importantly, when Apromore hides an arc, it will always ensure the following two properties:

  1. The percentage of displayed nodes does not change. In other words, as you move the Arcs slider down from 100 to 0%, no node will be hidden along the way. If the Nodes slider is set to 80%, you will always see the 80% most frequent nodes, regardless of the position of the Arcs slider. For example, if the Nodes slider is set to 80% and the Arcs slider is set to 10%, Apromore will display 80% of the nodes, and a small number of arcs between these nodes.

  2. Every node has at least one incoming arc and one outgoing arc. In other words, there will be no disconnected nodes or disconnected fragments. This property is important. It ensures that if you switch the current view to the BPMN view, the BPMN model will be fully connected.

Because of the above two requirements, some arcs cannot be hidden. For example, it may happen that a node has many incoming arcs, each of which is relatively infrequent. As you move the slider from 100% to 0%, these infrequent arcs will be hidden. At some point, it may be that only one of the incoming arcs is displayed. This last remaining arc will not be deleted because otherwise the node would be disconnected from the rest.

Note

In fact, Apromore’s arcs slider is designed to ensure a stronger property: Every node will always be on a path from the start node to the end node. This property is required in order to transform the process map into a correct BPMN model.

You can also use the abstraction sliders to remove the most infrequent nodes and the most infrequent arcs, instead of removing the most frequent nodes and arcs. To remove the most infrequent arcs, you should click the invert order toggle icon Discov050. When the invert order to toggle is on, Apromore will:

  • Hide the most infrequent nodes when you move the Nodes slider down from 100% to 0%

  • Hide the most infrequent arcs when you move the Arcs slider from 100% to 0%

Similarly, you can use the abstraction sliders to remove the nodes or arcs with the highest duration. To achieve this behavior, select Average duration in the menu Abstract by located above the sliders. When the Average duration option is selected, Apromore will hide the nodes or arcs with highest average duration (the slowest arcs) when you move the corresponding slider towards 0%. Conversely, if the invert order toggle is in the On state and the option Average duration is selected, Apromore will hide the nodes or arcs with lowest average duration (the faster arcs) when you move the corresponding slider towards 0%.

Note

Due to limitations in screen sizes and resolution, if a discovered process map or BPMN model contains more than 250 nodes, only the most frequent 250 nodes are displayed in Process Discoverer. Similarly, if the discovered process map or model contains more than 1000 arcs, only the most frequent 1000 arcs are displayed. The remaining ones are not visible and cannot be retrieved using the search box in Process Discoverer.

View log statistics

The top bar of Process Discoverer displays the following statistics of the event log:

  • The number of cases in the log

  • The number of case variants of the log. A case variant is a distinct pathway in the process. For example, in an insurance process, the sequence of activities [“Create Insurance Claim”, “Review Insurance Claim”, “Approve Insurance Claim”] is a case variant. In an event log, there are usually other cases that follow the same pathway (case variant).

  • Variance ratio: The variance ratio is a score between 0% and 100%, indicating how “rigid” or “flexible” the process is. A score of 0% means that the process is rigid: All the cases follow exactly the same sequence of activities. A score of 100% means the process is highly unstructured: Every case follows the same sequence of activities. The variance ratio is calculated based on the number of cases and the number of case variants using the following formula: 1 - (number of case variants) / (number of cases)

  • The number of activity instances in the event log.

  • The number of activities in the log.

Note

If we select the Resource perspective, instead of displaying the number of activities in the log, the Process Discoverer will display the number of Resources in the log (and the same applies if we select another perspective).

../_images/Discov057.png

For the number of cases, case variants, activity instances, and activities, Process Discoverer displays the elements in the currently filtered log / the total number of elements in the unfiltered log.

In the screenshot below, no filter is applied. Thus, the number of elements in the filtered and unfiltered log are the same. 19.9% indicates the percentage of cases variants (121) to the total cases (608) is 19.9 percent.

../_images/Discov058.png

In the screenshot below, a filter was applied. 556/608 in Cases, indicates there are 556 cases of the 608 cases in the currently filtered log. This translates to 91.4% of the cases.

../_images/Discov059.png

The top bar also displays:

  • The average, minimum, and maximum case duration

  • The average, minimum, and maximum case cost.

  • The log timeframe which goes from the earliest start timestamp in the log to the latest end timestamp in the log.

../_images/Discov060.png

Note

If no Cost Center has been associated with the log, the average, minimum, and maximum case cost appear as N/A. To learn more on how to associate the cost center to a log, see View by cost.

Inspect cases

We can inspect individual cases using the Case Inspector. To open the Case Inspector, click on the number of cases under the Log statistics section.

../_images/Discov006.png

The Case Inspector window opens.

../_images/Discov007.png

Click on a specific case to visualize it exclusively in the Process Discoverer.

../_images/Discov008.png

To reveal the statistics of an individual activity, we can hover our mouse on the activity.

../_images/Discov061.png

To check the length of the cases, click on the Activity instances column. Cases will be sorted from the shortest to the longest.

../_images/Discov009.png

We can also sort them by the duration of the case by clicking the Duration column.

../_images/Discov010.png

Note

We can sort the case inspector table in ascending or descending order of a column by clicking the column header multiple times.

To download the list of cases and their statistics, click on Download.

../_images/Discov011.png

We can also view a case timeline from the Case Inspector. A case timeline is a chart that displays the timeline of activities as a Gantt chart. To display the case timeline, double-click a case in the case inspector.

../_images/Discov012.png

This displays the case timeline for the selected case containing the case timeline chart (upper part) and the case data from the log (lower part).

../_images/Discov013.png

By default, the case timeline includes the waiting times in the case. We can, however, remove the waiting times by checking “Remove waiting time”. Now, the chart is displayed, ignoring waiting times between activities.

../_images/Discov014.png

In addition, we can display the case timeline for only specific activities. In the case data, select the activities we wish to display.

../_images/Discov015.png

To display the case data alone, click the inverted triangle button right above the slider. We can also drag it to adjust the height of the case data dynamically.

../_images/Discov016.png

We can adjust it back to the default view by clicking Discov051.

../_images/Discov017.png

By default, Apromore displays the case timeline chart with the minimum width. We can however increase this chart width by entering our preferred width and clicking OK.

../_images/Discov018.png

The timeline now appears wider. We can reset to the default width by clicking the reset button Discov052.

../_images/Discov019.png

Finally, we can download the case timeline or the case data.

Click the download button in the top-right corner.

../_images/Discov020.png

Click Download timeline as PNG to download the chart or Download case data as CSV to download the case data.

../_images/Discov021.png

The file is zipped and downloaded.

Inspect case variants

Case variants are cases in an event log with a similar sequence of activities. We can inspect the case variants in Apromore with Process Discoverer. To inspect the case variants, click the number of case variants in the log statistics section.

../_images/Inspec002.png

This opens the Case Variant Inspector, which displays the following statistics:

  • Case variant ID: A unique number assigned to a case variant.

  • Activity instances: The number of activities in the log.

  • Average duration: The average duration of all cases in the case variant.

  • Cases: The number of cases in the case variant.

  • Percentage: The percentage of cases in the case variant to the total number of cases in the log.

../_images/Inspec003.png

We can click a row to display the process map of that case variant.

../_images/Inspec004.png

We can also hover our mouse on a specific activity to see the statistics of that activity.

../_images/Inspec005.png

The case variants in the Case Variant Inspector can also be downloaded locally. To download the case variants, click Download.

../_images/Inspec006.png

This downloads case variants as a CSV file.

Inspect activities

We can inspect an individual activity in Process Discoverer with the the Activity Inspector. To open the Activity Inspector, click on the number of activities under the Activities chart in Log statics section.

../_images/Discov022.png

The Activity Inspector window will open.

image4.8

To check the length of the activity, click on the Activity instances column. To check activity variant frequency, click on Percentage(%) column.

../_images/Discov024.png

To download the list of activities and their statistics, click Download.

../_images/Discov025.png

In business process mining, it is useful for stakeholders to have a detailed understanding of each transition between consecutive activities in a process to enhance operational efficiency and decision making. To analyze the activity transitions in detail, we need to have a way to investigate each incoming and each outgoing arc of an activity.

When we double-click an activity in Process Discover, the Activity Inspector displays the activities that directly follow and the activities that directly precede the selected activity, as well as relevant metrics: relative frequency, average frequency and average duration. Note that in the activity inspector, all predecessors and all successors of an activity are listed, irrespective of the current setting of the abstraction sliders.

../_images/Discov062.png

Note that in the activity inspector, all predecessors and all successors of an activity are listed, irrespective of the current setting of the abstraction sliders.

View by frequency

We can also view the event log’s simple statistics, such as the total, median, minimum, maximum, and average number of times an activity is executed. This information is provided as a label on the activities/arcs, color of activities, and arcs’ thickness. The darker the blue color, the higher the number of times that activity has been observed in the log. The thicker arc, the higher the frequency of that arc. We can use the Frequency drop-down list in the Visualization settings section to view different statistics.

../_images/Discov027.png

View by duration

Additionally, we can view the statistics on the time performance of the activities and arcs in the process map using the Duration drop-down list in the Visualization settings section. These are total, mean, median, minimum, maximum, and average duration of each arc (indicating the waiting time before starting a given activity, once the previous one has been completed), and total, mean, median, minimum, maximum, and average duration of each arc duration of an activity (a.k.a. the activity’s processing time). Suppose the log only has completion timestamps for each activity and not their start timestamp. In that case, these performance statistics will combine both processing time and waiting time into a single time statistics visualized on the arc. At the same time, activities will be shown as having an instantaneous duration.

Like frequency statistics, we can also visualize time performance statistics via labels on activities and arcs and via colors and line thickness (on a red scale) for activities and arcs.

../_images/Discov028.png

View by cost

Apromore allows displaying cost metrics on top of a process map or a BPMN model in the Process Discoverer.

Note

The Cost overlay functionality can be used only with the logs having the Roles column present while uploading the event log to Apromore.

To check the cost metrics, click on Manage cost centers button placed right above the Abstraction Settings panel.

../_images/Discov030.png

The Cost center management window appears, consisting of Roles, Cost per hour and a Calendar. A Currency can be defined below.

../_images/Discov031.png

Enter the cost values next to each Role

../_images/Discov032.png

We can also assign calendars to each role. For instance, other roles may work Mondays to Fridays while the “Supplier” work Tuesdays to Fridays. Their working schedule affects the costs so we can create and assign respective calendar to roles. To assign a calendar, click the assign calendar button.

../_images/Discov033.png

This brings the calendar management window which displays the list of available calendars. Click on a calendar and click Apply calendar to assign it to the role.

../_images/Discov034.png

We can do this for all the roles.

../_images/Discov035.png

Note

Before assigning a calendar to a role, the calendar must first be created. To learn more about creating calendars, see Manage calendars

We can also choose a currency from the Currency drop-down and click Apply.

../_images/Discov036.png

To see how the Resource costs are distributed among different process activities, click on the Cost overlay available under the Visualization settings section, Overlay subsection placed top left in the Process Discoverer.

../_images/Discov037.png

View rework metrics

Rework is a common type of waste in a business process. The occurrence of rework increases the cycle time and cost of cases in a process.

To help identify and quantify rework in a process, we can display rework metrics in Process Discoverer. There are three kinds of rework metrics in Apromore:

  • Total rework: Total number of times an activity has been repeated after having been executed once in a case.

  • Rework duration: Total duration of the reworked activity instances.

  • Rework cost: Total cost of the reworked activity instances.

To display rework metrics in the Process Discoverer, open an event log from the Portal. In the Overlay section, click Frequency to specify working in the Frequency overlay. Click the dropdown, and select Total rework.

../_images/Rework002.png

The process map now displays the total rework for each task.

../_images/Rework003.png

We can also display the total duration the resources spend on the rework. Click Duration to switch to the duration overlay and select Total rework.

../_images/Rework004.png

The process map displays the duration of the total rework for each task.

../_images/Rework005.png

Similarly, we can display the total rework cost for each task. To display the cost of the total rework, the event log must have a Role attribute, and a cost table must be defined for this event log.

In the Overlay section, click Cost to switch to the cost overlay. Select Total rework from the dropdown.

../_images/Rework006.png

The process map displays the cost of the total rework for each task.

../_images/Rework007.png

View two metrics simultaneously

We can visualise two metrics of different types simultaneously. For example, we can view both Frequency and Duration at the same time by clicking the Discov053. If we are viewing our process map in Frequency mode, we can also view the Duration as a secondary metric by clicking on the Discov053.

../_images/Discov029.png

This can be done for any two metric.

View by different perspectives

Visualizing the handover between activities is not the only way to analyze a process. We may also be interested in assessing if a specific resource or group of resources are overloaded with work. We can use the Perspective dropdown to determine which log attribute will be displayed as the process map.

../_images/View002.png

For example, we can select the Resource perspective to visualize the handover of work among a group of resources. This option will map each resource in our process to a node and connect two nodes if a handover of work occurs between the two nodes.

../_images/View003.png

We can also visualize the flow of work from one role to another. Select the Role perspective.

../_images/View004.png

The Activity, Resource, and Role perspectives are the default perspectives in Apromore Process Discoverer. However, we can have other attributes appear as perspectives. To make an attribute appear as a perspective, select Event Attribute (Perspective) as the column type when importing the log.

For example, we can tag the “Stages” column as a perspective attribute during import, allowing us to display how the process moves across stages.

../_images/View005.png

Stages now appear as a perspective, and we can see the process across different stages.

../_images/View006.png

Similarly, suppose the log represents a process performed across multiple software systems, and we have tagged the System column as a perspective attribute during import. We can switch to the System perspective to see the workflow across multiple systems.

../_images/View007.png

Search activity

We can also search for an activity by using the search bar on the top-right.

../_images/Discov048.png

For ease of view, the search results get highlighted in the process map.

image15

View BPMN model

Whenever the insights deriving from the analysis of a process map are not sufficient, the same functionalities are offered on top of a BPMN model. Changing the view from Process Map to BPMN model will automatically discover a BPMN model from an event log. When visualizing a BPMN model, the slider Parallelism offers the possibility to adjust the amount of parallelism (e.g., AND and OR gateways) discovered by the plugin.

../_images/Discov040.png

Save BPMN Model

A BPMN model can also be saved from Process Discoverer. Click the save icon (first ribbon) in the top left corner. Click Save BPMN model.

../_images/Discov044.png

Note

To save a BPMN model, the BPMN model view must be toggled on.

We will be prompted to enter the name of the model, a short description and the version number. Additionally, we can specify the folder where the BPMN model will be stored. We can also specify whether the BPMN model should include the simulation parameters.

../_images/Discov045.png

Click OK to save.

Save PDF, PNG and JSON

Apromore allows us to download a BPMN model or filtered log in different file formats. We can download the model or log in PDF, PNG or JSON formats. To download a model/log, click the download icon in the top left ribbon.

../_images/Discov046.png

Specify the file format in which the model/log to be downloaded.

../_images/Discov047.png

The file is downloaded in the specified format.