Part 1 out of 4
In the roughly twenty years that Tethys-based GAStech has been operating a natural gas production site in the island country of Kronos, it has produced remarkable profits and developed strong relationships with the government of Kronos. However, GAStech has not been as successful in demonstrating environmental stewardship.
In January, 2014, the leaders of GAStech are celebrating their new-found fortune as a result of the initial public offering of their very successful company. In the midst of this celebration, several employees of GAStech go missing. An organization known as the Protectors of Kronos (POK) is suspected in the disappearance, but things may not be what they seem.
Both historical vehicle tracking data and transaction data from loyalty and credit card will be used to observe the following issues:
The data source are available publicly on VAST Challenge 2021 website under the sub section Mini-Challenge 2. The data used for the project are as follows:
library("tidyverse")
library("readxl")
emp_records <- read_excel("datasets/EmployeeRecords.xlsx")
car <- read_csv("datasets/car-assignments.csv")
cc <- read_csv("datasets/cc_data.csv")
gps <- read_csv("datasets/gps.csv")
loyalty <- read_csv("datasets/loyalty_data.csv")
LastName | FirstName | BirthDate | BirthCountry | Gender |
---|---|---|---|---|
Bramar | Mat | 1981-12-19 | Tethys | Male |
Ribera | Anda | 1975-11-17 | Tethys | Female |
Pantanal | Rachel | 1984-08-22 | Tethys | Female |
Lagos | Linda | 1980-01-26 | Tethys | Female |
Mies Haber | Ruscella | 1964-04-26 | Kronos | Female |
Forluniau | Carla | 1981-06-02 | Kronos | Female |
LastName | FirstName | CarID | CurrentEmploymentType | CurrentEmploymentTitle |
---|---|---|---|---|
Calixto | Nils | 1 | Information Technology | IT Helpdesk |
Azada | Lars | 2 | Engineering | Engineer |
Balas | Felix | 3 | Engineering | Engineer |
Barranco | Ingrid | 4 | Executive | SVP/CFO |
Baza | Isak | 5 | Information Technology | IT Technician |
Bergen | Linnea | 6 | Information Technology | IT Group Manager |
timestamp | location | price | last4ccnum |
---|---|---|---|
1/6/2014 7:28 | Brew’ve Been Served | 11.34 | 4795 |
1/6/2014 7:34 | Hallowed Grounds | 52.22 | 7108 |
1/6/2014 7:35 | Brew’ve Been Served | 8.33 | 6816 |
1/6/2014 7:36 | Hallowed Grounds | 16.72 | 9617 |
1/6/2014 7:37 | Brew’ve Been Served | 4.24 | 7384 |
1/6/2014 7:38 | Brew’ve Been Served | 4.17 | 5368 |
Timestamp | id | lat | long |
---|---|---|---|
01/06/2014 06:28:01 | 35 | 36.07623 | 24.87469 |
01/06/2014 06:28:01 | 35 | 36.07622 | 24.87460 |
01/06/2014 06:28:03 | 35 | 36.07621 | 24.87444 |
01/06/2014 06:28:05 | 35 | 36.07622 | 24.87425 |
01/06/2014 06:28:06 | 35 | 36.07621 | 24.87417 |
01/06/2014 06:28:07 | 35 | 36.07619 | 24.87406 |
timestamp | location | price | loyaltynum |
---|---|---|---|
01/06/2014 | Brew’ve Been Served | 4.17 | L2247 |
01/06/2014 | Brew’ve Been Served | 9.60 | L9406 |
01/06/2014 | Hallowed Grounds | 16.53 | L8328 |
01/06/2014 | Coffee Shack | 11.51 | L6417 |
01/06/2014 | Hallowed Grounds | 12.93 | L1107 |
01/06/2014 | Brew’ve Been Served | 4.27 | L4034 |
4.1 Past MITB Visual Analytics project were reviewed and evaluated prior to the assignment.
4.2 The solutions submitted for VAST challenge 2014 were also reviewed on their repository webpage(“VAST Challenge 2014:MC2 - Patterns of Life Analysis” 2014).
Submission entry from the University of Buenos Aires - Tralice (Villordo et al. 2014) utilised a multi-layered horizontal bar graph that showed the GPS movement for each employment type. The background highlight to indicate the weekend provided a good contrast and representation for the differentiation between weekdays and weekends.
Submission entry from KU Leuven (Chua et al. 2014) used a boxplot to visualise the credit card spending price at each location. Boxplot allows for distinct and clear visualisation of outliers in the transaction price. However, boxplot also provides informative details such as the median, 25 and 75 percentile price for each location which was not reflected in the boxplot. Furthermore, the 10,000 dollars outliers caused the y-axis tick marks to be large and each individual boxplot became too small on the plot.
Submission entry from the University of Calgary (Sahaf et al. 2014) utilised parallel coordinate plot to show the interaction and relationship between different categorical and numerical variables. The visualisation provides story telling insights between the different variables.
Most past submission utilised map and overlay with GPS lines and points to show the movement of each car. I would like to highlight the submission from Central South University (Zhao et al. 2014) where the map utilised different colors for lines and dots to present their findings. The variation in colors allowed for better visualisation and clarity of the different employees information to be highlighted to gain insights. However, due to the overlap of GPS data such as the location and GPS lines, an interactive map with tooltip will allow for better interpretation of the findings.
Submission from Fraunhofer IAIS and City University London (Andrienko, Andrienko, and Fuchs 2014) and RBEI-Bangalore (Singhal et al. 2014) both used network cluster and analysis to investigate the relationships between GAStech employees. Fraunhofer IAIS university used an ego-centric graph whereas RBEI used a combination of fragmented and node-only layout to visual the relationship by connecting employees. Network analysis is an informative visualisation that provides an overview of potential relationships between employees or even connecting employees to different mediums such as the locations or emails.
Submission from University of Buenos Aires - Alcoser (Flores, Lopez, and Forero 2014) used the sankey diagrams to visualise the locations where employees frequently visits. Sankey diagram shows how the quantities flow from one state to another and is usually used to show flows or processes.
Submission from University of Bueons Aires - Croceri (Croceri and Guzzi 2014) used a scatter plot to show the distance average speed against the speed for each employees route. The visualisation displayed extreme outliers effectively based on the car speed.
The various use cases and visualisation techniques were all reviewed and evaluated to integrate into the investigation works for the report.
For attribution, please cite this work as
Lim (2021, July 23). Yong Kai: Assignment: VAST Mini-Challenge 2. Retrieved from https://limyongkai.netlify.app/posts/2021-07-23-vastmc2part1/
BibTeX citation
@misc{lim2021assignment:, author = {Lim, Yong Kai}, title = {Yong Kai: Assignment: VAST Mini-Challenge 2}, url = {https://limyongkai.netlify.app/posts/2021-07-23-vastmc2part1/}, year = {2021} }