_ _ _(_)_ | A fresh approach to technical computing
(_) | (_) (_) | Documentation: http://docs.julialang.org
_ _ _| |_ __ _ | Type "help()" to list help topics
| | | | | | |/ _` | |
| | |_| | | | (_| | | Version 0.3.0-prerelease+490 (2013-12-15 07:16 UTC)
_/ |\__'_|_|_|\__'_| | Commit f8f3190* (0 days old master)
|__/ | x86_64-linux-gnu
julia> # load the package
julia> using DataFrames
julia> # read a txt file into dataframe
julia> df1 = readtable("credit_count.txt");
julia> # subset the dataframe
julia> df2 = df1[:(CARDHLDR .== 1), ["DEFAULT", "MAJORDRG", "MINORDRG"]];
julia> # aggregate the data
julia> df3 = by(df2, "DEFAULT", :(MAJOR_DRG = mean(_DF["MAJORDRG"])))
2x2 DataFrame:
DEFAULT MAJOR_DRG
[1,] 0 0.139851
[2,] 1 0.175703
julia> df4 = by(df2, "DEFAULT", :(MINOR_DRG = mean(_DF["MINORDRG"])))
2x2 DataFrame:
DEFAULT MINOR_DRG
[1,] 0 0.213196
[2,] 1 0.292169
julia> # join two dataframes
julia> df5 = join(df3, df4, on = "DEFAULT", kind = :inner)
2x3 DataFrame:
DEFAULT MAJOR_DRG MINOR_DRG
[1,] 0 0.139851 0.213196
[2,] 1 0.175703 0.292169


雷达卡



京公网安备 11010802022788号







