Learning R for Geospatial Analysis

0关注
6粉丝

已卖：1556份资源

副教授

31%

还不是VIP/贵宾

-

TA的文库 其他...

Complex Data Analysis

东西方金融数据分析

eBook with Data and Code

0%

威望: 0 级
论坛币: 11779 个
通用积分: 2.3950
学术水平: 119 点
热心指数: 115 点
信用等级: 114 点
经验: 8940 点
帖子: 173
精华: 10
在线时间: 30 小时
注册时间: 2006-9-19
最后登录: 2022-11-3

楼主

NewOccidental 发表于 2015-9-4 09:09:18 |AI写论文

是否 +2 论坛币

k人参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群

赵安豆老师微信：zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

立即领取

感谢您参与论坛问题回答

经管之家送您两个论坛币！

+2 论坛币

Book Description
R is a simple, effective, and comprehensive programming language and environment that is gaining ever-increasing popularity among data analysts.
This book provides you with the necessary skills to successfully carry out complete geospatial data analyses, from data import to presentation of results.
Learning R for Geospatial Analysis is composed of step-by-step tutorials, starting with the language basics before proceeding to cover the main GIS operations and data types. Visualization of spatial data is vital either during the various analysis steps and/or as the final product, and this book shows you how to get the most out of R's visualization capabilities. The book culminates with examples of cutting-edge applications utilizing R's strengths as a statistical and graphical tool.
Book Details
Publisher: Packt Publishing
By: Michael Dorman
ISBN: 978-1-78398-436-7
Year: 2014
Pages: 364
Language: English
File size: 10.2 MB
File format: PDF

复制代码

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

分享0 收藏1 回帖

关键词：Learning Analysis Spatial Analysi earning popularity necessary effective provides starting

本帖被以下文库推荐

· 東西方精品圖書|主题: 896, 订阅: 110
· R资源总汇 |主题: 1226, 订阅: 404
· Spatial Data Analysis|主题: 299, 订阅: 64

沙发

ReneeBK 发表于 2015-9-4 09:13:52

Creating a table from separate vectors
The data.frame class is the basic class to represent tabular data in R. A data.frame object is essentially a collection of vectors, all with the same length. However, the vectors do not have to be of the same type. They may also include one-dimensional objects that are not strictly vectors, such as Date or factor objects (see the previous chapter). Therefore, data.frame objects are particularly suitable to represent data with different variables in columns and different cases in rows. Thus, variables may be of different types; for example, a table storing climatic data may have one character variable to store meteorological station names, another Date variable to represent measurement dates, and a third numeric variable to represent the measured values such as rainfall amounts or temperatures.
One way to create a data.frame object is to combine several vectors that are already present in the R environment. This can be achieved with the data.frame function with the arguments being the names of the vector objects we would like to combine. Let's take a look at the following examples:
> num = 1:4
> lower = c("a","b","c","d")
> upper = c("A","B","C","D")
> df = data.frame(num, lower, upper)
> df
num lower upper
1 1 a A
2 2 b B
3 3 c C
4 4 d D
Here, we created a data.frame object named df by combining the vectors num, lower, and upper. The previously independent vectors now comprise columns in df. As we can see, the names of the columns appear on the first line of the printed output of a data.frame object. These are the names of the original vectors, num, lower, and upper. Rows have names as well; these are automatically assigned with the characters 1, 2, 3, and 4 (as it appears to the left of the first column in the printed output).

复制代码

藤椅

ReneeBK 发表于 2015-9-4 09:14:54

Creating a table from a CSV file
Another common method to create a data.frame object is to read tabular data from the disk. For example, we can read a CSV file using the read.csv function (which was briefly mentioned earlier). The first parameter of this function, and the one with no defaults, is a file indicating the path to the CSV file. For example, the following expression reads the contents of the 343452.csv file and assigns it to a data.frame object called dat (remember that directories should be separated with \\ or /):
> dat = read.csv("C:\\Data\\343452.csv")
The 343452.csv file contains monthly records of precipitation, minimum temperature, and maximum temperature from Spain for a period of 30 years. It was downloaded from the NOAA climatic archive and provided as is. Since we will use data from this file in several of our examples, in this and the upcoming chapters, let's examine its contents. Because the table is very large, to see what it looks like, we can print only the first several rows with the head function, as follows (similarly, with the tail function, we can print the several last rows):
> head(dat)

复制代码

板凳

ReneeBK 发表于 2015-9-4 09:16:37

Examining the structure of a data.frame object
We can get the number of rows and columns in our data.frame object using the nrow and ncol functions, respectively. For example, our small table df has four rows and three columns, while dat (containing the monthly climatic data) has 28,536 rows and nine columns:
> nrow(df)
[1] 4
> ncol(df)
[1] 3
> nrow(dat)
[1] 28536
> ncol(dat)
[1] 9
We can, if the table is not too long, print the table's contents and see how many columns (or rows) are there, according to the row names. However, it is generally advisable to get the properties of an object using functions (such as ncol), rather than typing a specific number manually (such as 9). This way, our code is going to be transferable to an analysis of any object and not just the specific object we are currently working on.
We can get the lengths of both row and column dimensions using the dim function. If our argument is a data.frame object (we will see later that the dim function works with other classes as well; such a function is called a generic function in R terminology), a vector of length 2 is returned with the first element being the number of rows and the second being the number of columns, as follows:
> dim(dat)
[1] 28536 9
We can also get the names of the rows and columns (getting column names is often more useful) as a character vector using the functions rownames and colnames, as shown in the following example:
> colnames(dat)
[1] "STATION" "STATION_NAME" "ELEVATION" "LATITUDE"
[5] "LONGITUDE" "DATE" "TPCP" "MMXT"
[9] "MMNT"
Assignment into column names can be made to replace the existing names with new ones. For example, to change the name of the third column from ELEVATION to Elev, we can use the colnames(dat)[3]="Elev" expression. Similarly, we can convert all column names of the data.frame object from uppercase to lowercase using the tolower function so that it will be easier to type:
> colnames(dat) = tolower(colnames(dat))
> colnames(dat)
[1] "station" "station_name" "elevation" "latitude"
[5] "longitude" "date" "tpcp" "mmxt"
[9] "mmnt"
It is frequently useful to examine the structure of a given object using the str function. This function (which is also generic) prints the structure of its argument showing the data types of its components and the relations between them. In the case of a data.frame object, a list of the column names and types is printed, along with the table dimensions, and the first several values (or all values, if the table is very short). For example, the output for the small table df shows that we have a table with three columns (the variables) and four rows (the observations). It also shows that the first column is numeric and the last two are characters. Here is how the output looks like:
> str(df)
'data.frame': 4 obs. of 3 variables:
$ num : int 1 2 3 4
$ lower: chr "a" "b" "c" "d"
$ upper: chr "A" "B" "C" "D"

复制代码

报纸

ReneeBK 发表于 2015-9-4 09:19:09

Subsetting data.frame objects
There are two principal ways to create a subset of a data.frame object. The first involves accessing separate columns, using the column names, with the $ operator. The second involves providing the two vectors of indices, names or logical values, with the [ operator.
Using the $ operator, we can gain access to separate columns in a data.frame object. To do this, we simply insert the name of the data.frame to the left of the $ operator and the name of the required column to the right, as follows:
> df$num
[1] 1 2 3 4
> df$lower
[1] "a" "b" "c" "d"
> df$upper
[1] "A" "B" "C" "D"
Since the columns of a data.frame object are basically vectors, we can employ all the previously presented vector methods in columns of a data.frame object the same way we would in independent vectors. For example, we can replace the -9999 values (which mark the missing data) with NA, for each of the three measured variables in dat, as follows:
> dat$tpcp[dat$tpcp == -9999] = NA
> dat$mmxt[dat$mmxt == -9999] = NA
> dat$mmnt[dat$mmnt == -9999] = NA
The only difference from how we did this operation in the previous chapter is the dat$ part. This means that we refer to columns of the data.frame object (dat), rather than independent vectors. Now, let's convert the tpcp values to mm units and mmxt and mmnt values to degree Celsius units by dividing each value in the respective columns by 10, as follows:
> dat$tpcp = dat$tpcp / 10
> dat$mmxt = dat$mmxt/ 10
> dat$mmnt = dat$mmnt / 10

复制代码

地板

ReneeBK 发表于 2015-9-4 09:26:44

Calculating new data fields
As previously shown, we can assign new values to a column of a table (or to a subset of a column) using the $ operator. If the column name we assign does not exist in the table, a new column will be created to accommodate the data. Let's take a look at the following examples:
> df
num lower upper
1 1 a A
2 2 b B
3 3 c C
4 4 d D
> df$word[df$num == 2] = "Two"
> df
num lower upper word
1 1 a A <NA>
2 2 b B Two
3 3 c C <NA>
4 4 d D <NA>

复制代码

7楼

ReneeBK 发表于 2015-9-4 09:35:11

Writing a data.frame object to a CSV file
A data.frame object can be written to a CSV file with the write.csv function. The two first (and most important) parameters for this function indicate the name of the data.frame object, which we would like to save, and the path to the new file (including the new filename). These parameters have no defaults, so we need to specify them. For example, the following expression writes the data.frame object df to the df.csv file in the C:\Data directory:
> write.csv(df, "C:\\Data\\df.csv")

复制代码

8楼

ReneeBK 发表于 2015-9-4 09:37:26

Representing two-dimensional data with a matrix
A matrix object is a two-dimensional collection of elements, all of the same type (as opposed to a data.frame object; see the previous chapter), where the number of elements in all rows (and, naturally, all columns) is identical. Matrix objects have many uses in R. For example, certain functions take matrices as their arguments (such as the focal function to filter rasters) or return matrices (such as the extract function to extract raster values; we will meet both these functions in the subsequent chapters).
A matrix object can be created with the matrix function by specifying its values (in the form of a vector) and dimensions as follows:
> matrix(1:6, ncol = 3)
[,1] [,2] [,3]
[1,] 1 3 5
[2,] 2 4 6
The first four parameters of the matrix function are as follows:
data: The vector of values for the matrix (for example, 1:6)
nrow: The number of rows
ncol: The number of columns (for example, 3)
byrow: Whether the matrix is filled column by column (FALSE, which is the default value) or row by row (TRUE)
The nrow and ncol parameters determine the number of rows and columns, respectively. We can specify either one of these parameters, and the other will be calculated taking into account the overall number of elements. Let's take a look at the following example:
> matrix(1:6, nrow = 3)
[,1] [,2]
[1,] 1 4
[2,] 2 5
[3,] 3 6
> matrix(1:6, nrow = 2)
[,1] [,2] [,3]
[1,] 1 3 5
[2,] 2 4 6

复制代码

9楼

lonestone

发表于 2016-7-27 09:51:18

谢谢楼主

10楼

ReneeBK 发表于 2017-8-8 07:58:47

Learning R for Geospatial Analysis [推广有奖]

经管之家送您一份

经管之家联合CDA

感谢您参与论坛问题回答

扫码加我拉你入群

相关帖子

本帖被以下文库推荐

浏览过的帖子

浏览过的版块

本版微信群

Learning R for Geospatial Analysis [推广有奖]

经管之家送您一份

经管之家联合CDA

感谢您参与论坛问题回答

扫码加我 拉你入群

相关帖子

本帖被以下文库推荐

浏览过的帖子

浏览过的版块

本版微信群

扫码加我拉你入群