Difference between revisions of "Exploring Clustering In QGIS"

From CUOSGwiki
Jump to navigationJump to search
(Added data)
Line 1: Line 1:
== Purpose ==
+
= Purpose =
   
 
The purpose of this project is to explore the capabilities of open source software such as QGIS. QGIS is one of the many open source softwares we have used in GEOM 4008. In this tutorial which I am writing in 2021, I will be going into vector analysis tools in QGIS which are used to create clusters. I will go step by step going through the Density Based and K-means methods of clustering and I will give the results of my findings.
 
The purpose of this project is to explore the capabilities of open source software such as QGIS. QGIS is one of the many open source softwares we have used in GEOM 4008. In this tutorial which I am writing in 2021, I will be going into vector analysis tools in QGIS which are used to create clusters. I will go step by step going through the Density Based and K-means methods of clustering and I will give the results of my findings.
   
== Introduction ==
+
= Introduction =
   
 
The use of clusters is a practical tool in GIS, it can help group vector data points into separate clusters or groups which the amount is fully configurable by the user. This can be helpful if you have a vast area and you want to divide a task into multiple areas and divide and conquer to accomplish a certain goal. The [https://pro.arcgis.com/en/pro-app/2.8/tool-reference/spatial-statistics/an-overview-of-the-mapping-clusters-toolset.htm option] is given in ArcPro and while it may have a bigger collection of clustering options, QGIS still has clustering capabilities which are DBSCAN(density based) and K-Means. We will be using the newest version of [https://qgis.org/en/site/forusers/download.html QGIS] at the time of tutorial, version 3.22. The most important part for the start of the project is downloading the latest version, getting the proper data and proper projections which I will go step by step in detailing below. The scope of my project will be going as a City of Ottawa official that wants to investigate incidents(vehicle, bicycle, etc...) and he feels the city is too big and the goal of this tutorial will be on how we can designate multiple smaller areas for him to work with.
 
The use of clusters is a practical tool in GIS, it can help group vector data points into separate clusters or groups which the amount is fully configurable by the user. This can be helpful if you have a vast area and you want to divide a task into multiple areas and divide and conquer to accomplish a certain goal. The [https://pro.arcgis.com/en/pro-app/2.8/tool-reference/spatial-statistics/an-overview-of-the-mapping-clusters-toolset.htm option] is given in ArcPro and while it may have a bigger collection of clustering options, QGIS still has clustering capabilities which are DBSCAN(density based) and K-Means. We will be using the newest version of [https://qgis.org/en/site/forusers/download.html QGIS] at the time of tutorial, version 3.22. The most important part for the start of the project is downloading the latest version, getting the proper data and proper projections which I will go step by step in detailing below. The scope of my project will be going as a City of Ottawa official that wants to investigate incidents(vehicle, bicycle, etc...) and he feels the city is too big and the goal of this tutorial will be on how we can designate multiple smaller areas for him to work with.
   
== Data ==
+
= Data =
   
 
The data that we will be using will be in two forms. Referencing data so we can visualize where we are and data we will use for the clustering.
 
The data that we will be using will be in two forms. Referencing data so we can visualize where we are and data we will use for the clustering.
   
=Referencing=
+
==Referencing==
   
 
*The [https://open.ottawa.ca/datasets/road-centrelines/explore Roads] dataset
 
*The [https://open.ottawa.ca/datasets/road-centrelines/explore Roads] dataset
 
*The [https://open.ottawa.ca/datasets/wards/explore?location=45.249481%2C-75.800844%2C1.20 Wards] dataset
 
*The [https://open.ottawa.ca/datasets/wards/explore?location=45.249481%2C-75.800844%2C1.20 Wards] dataset
   
=Clustering=
+
==Clustering==
   
 
*The [https://open.ottawa.ca/datasets/traffic-collisions-by-location-2013/explore Traffic Collisions by Location in 2013] dataset
 
*The [https://open.ottawa.ca/datasets/traffic-collisions-by-location-2013/explore Traffic Collisions by Location in 2013] dataset

Revision as of 09:15, 8 December 2021

Purpose

The purpose of this project is to explore the capabilities of open source software such as QGIS. QGIS is one of the many open source softwares we have used in GEOM 4008. In this tutorial which I am writing in 2021, I will be going into vector analysis tools in QGIS which are used to create clusters. I will go step by step going through the Density Based and K-means methods of clustering and I will give the results of my findings.

Introduction

The use of clusters is a practical tool in GIS, it can help group vector data points into separate clusters or groups which the amount is fully configurable by the user. This can be helpful if you have a vast area and you want to divide a task into multiple areas and divide and conquer to accomplish a certain goal. The option is given in ArcPro and while it may have a bigger collection of clustering options, QGIS still has clustering capabilities which are DBSCAN(density based) and K-Means. We will be using the newest version of QGIS at the time of tutorial, version 3.22. The most important part for the start of the project is downloading the latest version, getting the proper data and proper projections which I will go step by step in detailing below. The scope of my project will be going as a City of Ottawa official that wants to investigate incidents(vehicle, bicycle, etc...) and he feels the city is too big and the goal of this tutorial will be on how we can designate multiple smaller areas for him to work with.

Data

The data that we will be using will be in two forms. Referencing data so we can visualize where we are and data we will use for the clustering.

Referencing

Clustering

Acquiring QGIS (version 3.20.3)

Set up the Environment

Add vector data

Projection

Symbology