Difference between revisions of "Spatial Pattern Analysis with CartoDB"

From CUOSGwiki
Jump to navigationJump to search
 
(27 intermediate revisions by 2 users not shown)
Line 2: Line 2:
 
For many general GIS users CartoDB may seem complex and very daunting. But the great thing with CartoDB is that being a master at SQL, CSS and databases is not always required for using CartoDB. For more basic use cases it can be used with limited coding experience. It is extremely powerful and versatile with a wide array of use cases, namely data visualization, spatial analysis, and other geospatial applications.
 
For many general GIS users CartoDB may seem complex and very daunting. But the great thing with CartoDB is that being a master at SQL, CSS and databases is not always required for using CartoDB. For more basic use cases it can be used with limited coding experience. It is extremely powerful and versatile with a wide array of use cases, namely data visualization, spatial analysis, and other geospatial applications.
   
In this tutorial it will provide a basic run down of how to set up CartoDB and provide some information on performing spatial analysis in CartoDB.
+
This tutorial will provide a basic run down of how to set up CartoDB and provide some information on performing spatial analysis in CartoDB.
   
 
== Install Guide for Ubuntu 20.04 LTS x64 ==
 
== Install Guide for Ubuntu 20.04 LTS x64 ==
   
In this tutorial it will guide you through the basic process of installing CartoDB on Ubuntu 20.04.
+
This tutorial will guide you through the basic process of installing CartoDB on Ubuntu 20.04.
 
Most will likely install CartoDB inside a VM on a local PC but other commercial options exist.
 
Most will likely install CartoDB inside a VM on a local PC but other commercial options exist.
The creators of CartoDB have a basic, 12 month free trial and paid options. Alternative hosting options are also available from Digital Ocean.
+
The creators of CartoDB have a basic, 12 month [https://carto.com/pricing/ free trial] and paid options. Alternative hosting options are also available from Digital Ocean.
   
 
CartoDB has several dependencies which it uses to run:
 
CartoDB has several dependencies which it uses to run:
Line 31: Line 31:
 
Then install PostgreSQL
 
Then install PostgreSQL
   
<code>sudo apt-get install postgresql-10 \
+
<code>sudo apt-get install postgresql-10 \</code><br>
postgresql-plpython-10 \
+
<code> postgresql-plpython-10 \</code><br>
postgresql-server-dev-10</code>
+
<code> postgresql-server-dev-10</code>
   
 
==== PostgreSQL Configuration ====
 
==== PostgreSQL Configuration ====
Line 44: Line 44:
 
And make sure that the lines appear as below:
 
And make sure that the lines appear as below:
   
<code>local all postgres trust
+
<code>local all postgres trust</code><br>
local all all trust
+
<code>local all all trust</code><br>
host all all 127.0.0.1/32 trust</code>
+
<code>host all all 127.0.0.1/32 trust</code><br>
   
 
Once done restart PostgreSQL:
 
Once done restart PostgreSQL:
Line 54: Line 54:
 
Then add the required CARTO users to PostgreSQL:
 
Then add the required CARTO users to PostgreSQL:
   
<code>sudo createuser publicuser --no-createrole --no-createdb --no-superuser -U postgres
+
<code>sudo createuser publicuser --no-createrole --no-createdb --no-superuser -U postgres</code><br>
sudo createuser tileuser --no-createrole --no-createdb --no-superuser -U postgres</code>
+
<code>sudo createuser tileuser --no-createrole --no-createdb --no-superuser -U postgres</code>
   
 
====Install PostgreSQL Helper Extensions====
 
====Install PostgreSQL Helper Extensions====
 
Finally install some PostgreSQL extensions that expand upon PostgreSQL to work better with the other dependencies.
 
Finally install some PostgreSQL extensions that expand upon PostgreSQL to work better with the other dependencies.
   
<code>git clone https://github.com/CartoDB/cartodb-postgresql.git
+
<code>git clone https://github.com/CartoDB/cartodb-postgresql.git</code><br>
cd cartodb-postgresql
+
<code>cd cartodb-postgresql</code><br>
git checkout <LATEST cartodb-postgresql tag>
+
<code>git checkout <LATEST cartodb-postgresql tag></code><br>
sudo make all install</code>
+
<code>sudo make all install</code>
   
 
===Install GDAL and PostGIS===
 
===Install GDAL and PostGIS===
   
<code>sudo add-apt-repository ppa:cartodb/gis && sudo apt-get update
+
<code>sudo add-apt-repository ppa:cartodb/gis && sudo apt-get update</code><br>
sudo apt-get install gdal-bin libgdal-dev postgis</code>
+
<code>sudo apt-get install gdal-bin libgdal-dev postgis</code>
   
Configure PostGIS Database:
+
Configure PostGIS Database:<br>
<code>sudo createdb -T template0 -O postgres -U postgres -E UTF8 template_postgis
+
<code>sudo createdb -T template0 -O postgres -U postgres -E UTF8 template_postgis</code><br>
psql -U postgres template_postgis -c 'CREATE EXTENSION postgis;CREATE EXTENSION postgis_topology;'
+
<code>psql -U postgres template_postgis -c 'CREATE EXTENSION postgis;CREATE EXTENSION postgis_topology;'</code><br>
sudo ldconfig</code>
+
<code>sudo ldconfig</code>
   
 
===Install Redis===
 
===Install Redis===
<code>sudo add-apt-repository ppa:cartodb/redis-next && sudo apt-get update
+
<code>sudo add-apt-repository ppa:cartodb/redis-next && sudo apt-get update</code><br>
sudo apt-get install redis</code>
+
<code>sudo apt-get install redis</code>
   
 
===Install Node.js===
 
===Install Node.js===
<code>curl -sL https://deb.nodesource.com/setup_10.x | sudo -E bash -
+
<code>curl -sL https://deb.nodesource.com/setup_10.x | sudo -E bash -</code><br>
sudo apt-get install -y nodejs</code>
+
<code>sudo apt-get install -y nodejs</code>
   
Some more dependencies:
+
Some more dependencies:<br>
 
<code>sudo apt-get install libpixman-1-0 libpixman-1-dev libcairo2-dev libjpeg-dev libgif-dev libpango1.0-dev</code>
 
<code>sudo apt-get install libpixman-1-0 libpixman-1-dev libcairo2-dev libjpeg-dev libgif-dev libpango1.0-dev</code>
   
 
I promise we're getting close to the end.
 
I promise we're getting close to the end.
  +
 
===Install SQL API===
 
===Install SQL API===
<code>git clone git://github.com/CartoDB/CartoDB-SQL-API.git
+
<code>git clone git://github.com/CartoDB/CartoDB-SQL-API.git</code><br>
cd CartoDB-SQL-API
+
<code>cd CartoDB-SQL-API</code><br>
npm install</code>
+
<code>npm install</code>
   
Configure our local development environment and then Start Node.js
+
Configure our local development environment and then Start Node.js:<br>
<code>cp config/environments/development.js.example config/environments/development.js
+
<code>cp config/environments/development.js.example config/environments/development.js</code><br>
node app.js development</code>
+
<code>node app.js development</code>
   
 
===Install MAPS API===
 
===Install MAPS API===
<code>git clone git://github.com/CartoDB/Windshaft-cartodb.git
+
<code>git clone git://github.com/CartoDB/Windshaft-cartodb.git</code><br>
cd Windshaft-cartodb
+
<code>cd Windshaft-cartodb</code><br>
npm install</code>
+
<code>npm install</code>
   
Then configure the API and start it:
+
Then configure the API and start it:<br>
<code>cp config/environments/development.js.example config/environments/development.js
+
<code>cp config/environments/development.js.example config/environments/development.js</code><br>
mkdir logs
+
<code>mkdir logs</code><br>
node app.js development</code>
+
<code>node app.js development</code>
   
 
===Install Ruby===
 
===Install Ruby===
*No not the mineral type*
+
*No not the mineral type<br>
   
<code>sudo apt-add-repository ppa:brightbox/ruby-ng && sudo apt-get update
+
<code>sudo apt-add-repository ppa:brightbox/ruby-ng && sudo apt-get update</code><br>
sudo apt-get install ruby2.4 ruby2.4-dev ruby-bundler
+
<code>sudo apt-get install ruby2.4 ruby2.4-dev ruby-bundler</code><br>
sudo gem install compass</code>
+
<code>sudo gem install compass</code>
   
 
===And Finally Install Builder and CartoDB===
 
===And Finally Install Builder and CartoDB===
   
Download and install CartoDB with additonal dependencies:
+
Download and install CartoDB with additonal dependencies:<br>
<code>git clone --recursive https://github.com/CartoDB/cartodb.git
+
<code>git clone --recursive https://github.com/CartoDB/cartodb.git</code><br>
cd cartodb
+
<code>cd cartodb</code><br>
sudo apt-get install python-pip imagemagick unp zip libicu-dev
+
<code>sudo apt-get install python-pip imagemagick unp zip libicu-dev</code><br>
RAILS_ENV=development bundle install
+
<code>RAILS_ENV=development bundle install</code><br>
sudo pip install --no-use-wheel -r python_requirements.txt</code>
+
<code>sudo pip install --no-use-wheel -r python_requirements.txt</code>
   
Finish up:
+
Finish up:<br>
<code>npm install
+
<code>npm install</code><br>
npm run carto-node && npm run build:static</code>
+
<code>npm run carto-node && npm run build:static</code>
   
Configure CartoDB:
+
Configure CartoDB:<br>
<code>cp config/app_config.yml.sample config/app_config.yml
+
<code>cp config/app_config.yml.sample config/app_config.yml</code><br>
cp config/database.yml.sample config/database.yml</code>
+
<code>cp config/database.yml.sample config/database.yml</code>
   
Start remaining services and initialize database:
+
Start remaining services and initialize database:<br>
<code>sudo systemctl start redis-server
+
<code>sudo systemctl start redis-server</code><br>
RAILS_ENV=development bundle exec rake db:create
+
<code>RAILS_ENV=development bundle exec rake db:create</code><br>
RAILS_ENV=development bundle exec rake db:migrate
+
<code>RAILS_ENV=development bundle exec rake db:migrate</code><br>
RAILS_ENV=development bundle exec rails server
+
<code>RAILS_ENV=development bundle exec rails server</code>
   
And finally close that console and then in a new one run:
+
And finally close that console and then in a new one run:<br>
 
<code>RAILS_ENV=development bundle exec ./script/resque</code>
 
<code>RAILS_ENV=development bundle exec ./script/resque</code>
   
 
===First Time CartoDB Run===
 
===First Time CartoDB Run===
   
Create user account and development environment:
+
Create user account and development environment:<br>
<code>cd cartodb
+
<code>cd cartodb</code><br>
export SUBDOMAIN=development
+
<code>export SUBDOMAIN=development</code>
   
# Add entries to /etc/hosts in development
+
<code># Add entries to /etc/hosts in development</code><br>
echo "127.0.0.1 ${SUBDOMAIN}.localhost.lan" | sudo tee -a /etc/hosts
+
<code>echo "127.0.0.1 ${SUBDOMAIN}.localhost.lan" | sudo tee -a /etc/hosts</code>
   
# Create the development user
+
<code># Create the development user</code><br>
sh script/create_dev_user</code>
+
<code>sh script/create_dev_user</code>
   
Run the remaining processes:
+
Run the remaining processes:<br>
<code>bundle exec script/resque
+
<code>bundle exec script/resque</code><br>
bundle exec thin start --threaded -p 3000 --threadpool-size 5
+
<code>bundle exec thin start --threaded -p 3000 --threadpool-size 5</code><br>
cd cartodb-sql-api && node app.js
+
<code>cd cartodb-sql-api && node app.js</code><br>
cd windshaft-cartodb && node app.js</code>
+
<code>cd windshaft-cartodb && node app.js</code>
   
That's it!
+
That's it!<br>
 
Go to http://<mysubdomain>.localhost.lan:3000
 
Go to http://<mysubdomain>.localhost.lan:3000
 
And enter your login created using the password entered above.
 
And enter your login created using the password entered above.
   
 
== User Interface ==
 
== User Interface ==
  +
Now that CartoDB is intstalled there are several ways to begin using it.<br>
  +
The three most popular ones that many people use are directly throught the web builder interface online, through Jupiter Notebook or with the PythonSDK.
   
===How set up to a personal server?===
+
===How do you upload data?===
 
Importing data is seamless and easy. The 'new table' icon with the plus sign will open a dialog box with options to:
  +
* paste a url or select a file (e.x. shapefiles),
  +
* Online cloud storage such as dropbox or google drive data,
  +
* or create a new table from scratch or other database services
   
 
===How do you explore added data?===
Digital Ocean is the chosen host of the private server.
 
 
Each dataset is known as a table that corresponds to a spreadsheet that can be visualized on a map. Examination or modification of data are performed by SQL statements or using the GUI.
   
  +
===How do you map data? How do you visualize data? A short note on styling, the visualization wizard, base maps, and labels.===
===How to upload data?===
 
 
Cartodb provides full control over styling of map using SQL, Python and CSS. Visualization wizards provide options to visualize data with simple, choropleth, category, bubble, intensity, density and a animated data categorization called torque. Torque, simple and category are used in this project. Column labels are toggled on for each row to provide the user information about name and location for incubators and bixi docks with the addition of how many bikes are available at each location. There are support from base maps from Google, CartoDB, Mapbox, WMS, XYZ and more. CartoDB Dark is used in this project.
Importing data is seamless and easy. The 'new table' icon with the plus sign will open a dialog box with options to 1) paste a url or select a file, 2) dropbox, 3) google drive data as well as a star 4) a new table from scratch.
 
   
  +
Many great examples of [https://carto.com/spatial-data-catalog/ projects] created with CartoDB and further [https://vimeo.com/channels/carto video tutorials] or [https://carto.com/help/ help centers] exist online.
===How to explore data?===
 
Each dataset is known as a table that corresponds to a spreadsheet that can be visualized on a map. Examination or modification of data are performed by SQL statements.
 
 
===How to map data? Styling, Visualization Wizard, Base maps, Labels===
 
Cartodb provides full control over styling of map using SQL and CSS. Visualization wizards provide options to visualize data with simple, choropleth, category, bubble, intensity, density and a animated data categorization called torque. Torque, simple and category are used in this project. Column labels are toggled on for each row to provide the user information about name and location for incubators and bixi docks with the addition of how many bikes are available at each location. There are support from base maps from Google, CartoDB, Mapbox, WMS, XYZ and more. CartoDB Dark is used in this project.
 
 
Category, Density, Torque
 
   
 
===How to share the map?===
 
===How to share the map?===
Line 187: Line 189:
   
 
== Data Analysis ==
 
== Data Analysis ==
Cartodb has full support for both vector and raster data.
+
Cartodb has full support for both vector and raster data. The example below takes advantage of a short SQL script.
   
 
Example: SQL statement is derived from Andrew Hill: https://gist.github.com/andrewxhill/5979532.
 
Example: SQL statement is derived from Andrew Hill: https://gist.github.com/andrewxhill/5979532.
Line 216: Line 218:
   
 
== References ==
 
== References ==
  +
CartoDB Developers Page: https://carto.com/developers/<br>
 
  +
CartoDB Official Documentation: https://cartodb.readthedocs.io/en/latest/install.html<br>
 
VirtualBox https://www.virtualbox.org/ <br>
 
VirtualBox https://www.virtualbox.org/ <br>
 
Old Ubuntu Releases http://old-releases.ubuntu.com/releases/ <br>
 
Old Ubuntu Releases http://old-releases.ubuntu.com/releases/ <br>

Latest revision as of 15:17, 19 October 2020

Introduction to CartoDB

For many general GIS users CartoDB may seem complex and very daunting. But the great thing with CartoDB is that being a master at SQL, CSS and databases is not always required for using CartoDB. For more basic use cases it can be used with limited coding experience. It is extremely powerful and versatile with a wide array of use cases, namely data visualization, spatial analysis, and other geospatial applications.

This tutorial will provide a basic run down of how to set up CartoDB and provide some information on performing spatial analysis in CartoDB.

Install Guide for Ubuntu 20.04 LTS x64

This tutorial will guide you through the basic process of installing CartoDB on Ubuntu 20.04. Most will likely install CartoDB inside a VM on a local PC but other commercial options exist. The creators of CartoDB have a basic, 12 month free trial and paid options. Alternative hosting options are also available from Digital Ocean.

CartoDB has several dependencies which it uses to run:

  • PostgreSQL
  • PostGIS
  • Redis
  • CARTO PostgreSQL extensions
  • CARTO Builder
  • Maps API
  • SQL API

Install GIT

This tutorial requires the use of git for the installation of some packages.

sudo apt-get install git

Install PostgreSQL

Add the custom PostgreSQL repository for CartoDB. The reason for the custom package can be found here.

sudo add-apt-repository ppa:cartodb/postgresql-10 && sudo apt-get update

Then install PostgreSQL

sudo apt-get install postgresql-10 \
postgresql-plpython-10 \
postgresql-server-dev-10

PostgreSQL Configuration

To make this installation easier, and because we are installing within a local environment we can make authentication less secure.

Simply run:

sudo nano etc/postgresql/10/main/pg_hba.conf

And make sure that the lines appear as below:

local all postgres trust
local all all trust
host all all 127.0.0.1/32 trust

Once done restart PostgreSQL:

sudo systemctl restart postgresql

Then add the required CARTO users to PostgreSQL:

sudo createuser publicuser --no-createrole --no-createdb --no-superuser -U postgres
sudo createuser tileuser --no-createrole --no-createdb --no-superuser -U postgres

Install PostgreSQL Helper Extensions

Finally install some PostgreSQL extensions that expand upon PostgreSQL to work better with the other dependencies.

git clone https://github.com/CartoDB/cartodb-postgresql.git
cd cartodb-postgresql
git checkout <LATEST cartodb-postgresql tag>
sudo make all install

Install GDAL and PostGIS

sudo add-apt-repository ppa:cartodb/gis && sudo apt-get update
sudo apt-get install gdal-bin libgdal-dev postgis

Configure PostGIS Database:
sudo createdb -T template0 -O postgres -U postgres -E UTF8 template_postgis
psql -U postgres template_postgis -c 'CREATE EXTENSION postgis;CREATE EXTENSION postgis_topology;'
sudo ldconfig

Install Redis

sudo add-apt-repository ppa:cartodb/redis-next && sudo apt-get update
sudo apt-get install redis

Install Node.js

curl -sL https://deb.nodesource.com/setup_10.x | sudo -E bash -
sudo apt-get install -y nodejs

Some more dependencies:
sudo apt-get install libpixman-1-0 libpixman-1-dev libcairo2-dev libjpeg-dev libgif-dev libpango1.0-dev

I promise we're getting close to the end.

Install SQL API

git clone git://github.com/CartoDB/CartoDB-SQL-API.git
cd CartoDB-SQL-API
npm install

Configure our local development environment and then Start Node.js:
cp config/environments/development.js.example config/environments/development.js
node app.js development

Install MAPS API

git clone git://github.com/CartoDB/Windshaft-cartodb.git
cd Windshaft-cartodb
npm install

Then configure the API and start it:
cp config/environments/development.js.example config/environments/development.js
mkdir logs
node app.js development

Install Ruby

  • No not the mineral type

sudo apt-add-repository ppa:brightbox/ruby-ng && sudo apt-get update
sudo apt-get install ruby2.4 ruby2.4-dev ruby-bundler
sudo gem install compass

And Finally Install Builder and CartoDB

Download and install CartoDB with additonal dependencies:
git clone --recursive https://github.com/CartoDB/cartodb.git
cd cartodb
sudo apt-get install python-pip imagemagick unp zip libicu-dev
RAILS_ENV=development bundle install
sudo pip install --no-use-wheel -r python_requirements.txt

Finish up:
npm install
npm run carto-node && npm run build:static

Configure CartoDB:
cp config/app_config.yml.sample config/app_config.yml
cp config/database.yml.sample config/database.yml

Start remaining services and initialize database:
sudo systemctl start redis-server
RAILS_ENV=development bundle exec rake db:create
RAILS_ENV=development bundle exec rake db:migrate
RAILS_ENV=development bundle exec rails server

And finally close that console and then in a new one run:
RAILS_ENV=development bundle exec ./script/resque

First Time CartoDB Run

Create user account and development environment:
cd cartodb
export SUBDOMAIN=development

# Add entries to /etc/hosts in development
echo "127.0.0.1 ${SUBDOMAIN}.localhost.lan" | sudo tee -a /etc/hosts

# Create the development user
sh script/create_dev_user

Run the remaining processes:
bundle exec script/resque
bundle exec thin start --threaded -p 3000 --threadpool-size 5
cd cartodb-sql-api && node app.js
cd windshaft-cartodb && node app.js

That's it!
Go to http://<mysubdomain>.localhost.lan:3000 And enter your login created using the password entered above.

User Interface

Now that CartoDB is intstalled there are several ways to begin using it.
The three most popular ones that many people use are directly throught the web builder interface online, through Jupiter Notebook or with the PythonSDK.

How do you upload data?

Importing data is seamless and easy. The 'new table' icon with the plus sign will open a dialog box with options to:

  • paste a url or select a file (e.x. shapefiles),
  • Online cloud storage such as dropbox or google drive data,
  • or create a new table from scratch or other database services

How do you explore added data?

Each dataset is known as a table that corresponds to a spreadsheet that can be visualized on a map. Examination or modification of data are performed by SQL statements or using the GUI.

How do you map data? How do you visualize data? A short note on styling, the visualization wizard, base maps, and labels.

Cartodb provides full control over styling of map using SQL, Python and CSS. Visualization wizards provide options to visualize data with simple, choropleth, category, bubble, intensity, density and a animated data categorization called torque. Torque, simple and category are used in this project. Column labels are toggled on for each row to provide the user information about name and location for incubators and bixi docks with the addition of how many bikes are available at each location. There are support from base maps from Google, CartoDB, Mapbox, WMS, XYZ and more. CartoDB Dark is used in this project.

Many great examples of projects created with CartoDB and further video tutorials or help centers exist online.

How to share the map?

Maps are shared from the options menu in the top right, by link, embed or as an api.

Dataset

Bicycle Stations (Bixi) and Business Incubators from the City of Toronto open data initiative are the two data sets utilized in this project. As of this writing, business incubators is no longer offered. Bixi dataset had been preprocessed to include the latitude and longitude derived from address.

Data Analysis

Cartodb has full support for both vector and raster data. The example below takes advantage of a short SQL script.

Example: SQL statement is derived from Andrew Hill: https://gist.github.com/andrewxhill/5979532.

 SELECT
 ST_MakeLine( --This function can take two or more points and make a line
  the_geom_webmercator, --We select the_geom_webmercator, since CartoDB will need it to draw your maps
  ( --This is a nested query that will run for every row in our outer query
    SELECT the_geom_webmercator FROM plout10 -- Here we select the geometry from our second dataset
    ORDER BY the_geom <-> c.the_geom -- We then order it by its distance to the geometry in the first dataset (c.the_geom)
    LIMIT 1  -- And limit it to just 1, i.e. we find just the closest
    )
    ) the_geom_webmercator -- Here we alias the result to a column we call, 'the_geom_webmercator', so that CartoDB will draw it
  FROM citibike_stations c -- Here we alias our table to 'c' so we can type it nicely above :)


The following SQL statement is used to visualize the closest Biki locations to an incubator. These statement utilizes PostGIS and PostgresSQL functions.

 SELECT 
 ST_MakeLine(the_geom_webmercator, 
   ( 
     SELECT the_geom_webmercator FROM business_incubators_toronto 
     ORDER BY the_geom <-> c.the_geom
     LIMIT 1  
   )
  ) the_geom_webmercator
  FROM station c

References

CartoDB Developers Page: https://carto.com/developers/
CartoDB Official Documentation: https://cartodb.readthedocs.io/en/latest/install.html
VirtualBox https://www.virtualbox.org/
Old Ubuntu Releases http://old-releases.ubuntu.com/releases/
Digital Ocean https://www.digitalocean.com/
CartoDB Repository https://github.com/CartoDB/cartodb
PostGIS Reference http://postgis.net/docs/reference.html
PostgreSQL Tutorial http://www.postgresql.org/docs/9.1/static/tutorial.html
Lord Linus's RVM tutorial https://github.com/lordlinus/cartodb
Michael Schmid's suggestions https://groups.google.com/forum/#!topic/cartodb/o5_cVk-owe0
Andrew Hill's SQL query https://gist.github.com/andrewxhill/5979532
Open Data Toronto http://www1.toronto.ca/wps/portal/contentonly?vgnextoid=1a66e03bb8d1e310VgnVCM10000071d60f89RCRD
Bixi http://www1.toronto.ca/wps/portal/contentonly?vgnextoid=ad3cb6b6ae92b310VgnVCM10000071d60f89RCRD&vgnextchannel=1a66e03bb8d1e310VgnVCM10000071d60f89RCRD