搜索
人大经济论坛 标签 统计 相关日志

tag 标签: 统计经管大学堂:名校名师名课

相关日志

分享 关于统计模型的一句话
西门高 2017-6-13 18:25
All models are wrong, but some are useful. Box (1979)
16 次阅读|0 个评论
分享 统计1
西门高 2017-4-25 19:33
\As there are many possible forms of nonlinearity it is likely that no one test will be powerful against them all, so several tests may be needed." (Terasvirta, Tjstheim and Granger, 1992)
16 次阅读|0 个评论
分享 推广
Mirabelle_Li 2017-2-27 15:17
求助学术问题,找学习资料,找数据,找案例,发学术文章,论文查重,学统计软件, 金融培训,财会培训,考研人大,报考人大在职研……就上 人大经济论坛 ! 人大经济论坛—经管人的专业社区: https://bbs.pinggu.org/?fromuid=8765895
95 次阅读|0 个评论
分享 沪港通股票统计套利:基于BP神经网络
accumulation 2016-12-16 15:53
1.基础策略-统计套利 统计套利是根据对历史数据的统计来指导投资,是一种基于模型的中短期投资策略,使用量化分析方法挖掘投资机会。使用配对交易(统计套利方法之一)是利用标的对相对价差获取收益,在市场相对稳定情况,标价差理论上也是稳定的,可以对冲大部分市场趋势变动的风险。 统计套利一般步骤:首先通过相关系数和平稳检验,选出股池可能具有稳定价格关系的投资对;其次,通过EG协整回归确定有稳定价格关系的投资对;最后,在存在稳定股票对中,筛选有价差偏离现象的投资对。 配对交易的收益情况:设有Yt,Xt价格序列,两者已通过相关和平稳筛选,对应协整方程为: 移项: 当处于投资时点i时,若 标准化后正向偏离( , 是设置的开仓阈值 ),认为残差(即价差)将会回复,进行卖空1单位的Yt,价格为Yi;买入 单位的 Xt ,价格为 X i,构造股票对。当到平仓时点j,获得收益(或亏损)为: 若 负向偏离 ,反向操作。 在使用上述统计套利方式进行投资时,价差偏离回复需要的时间和空间是未知的,如果在投资时能对价差序列实现预测,就可以提前确定套利的利润;此外,协整的残差序列(即价差)具有自相关性。基于以上,本文将价差序列的自相关性作为预测的理论基础,采用神经网络算法对价差实现预测,进而优化基础策略。 2. 优化策略-BP神经网络算法 神经网络(ANNs)算法使用领域十分广泛,它是智能科学和计算智能的重要部分,以脑科学和认知神经科学的研究成果为基础,拓展智能信息处理的方法,为解决复杂问题和智能控制提供有效的途径。简单来说,神经网络是一个数学模型,模型中有多个信息处理器(网络层和神经元),随着输入的信,动态调整模型的权重和阈值,最终形成输出。 神经网络模型的内含数学公式: 其中为连接权重,为连接阈值,为激励函数; BP(Back Propagation)神经网络是ANNs中最典型,也是运用最为广泛的模型。其在一个典型的网络结构中增加了反馈过程,图如下(Xm,Ym对应输入输出): BP神经网络能对时间序列进行拟合的原理为,网络可以通过误差反馈和权修正,调整各层间神经单元的连接权重,使网络整体逼近可能存在的各种函数形式(对于传统计量无法表达的复杂函数,理论上也可以无限逼近)。 在使用BP模型时,参数的选择方式有许多。本文网络参数较为固定,训练函数使用traingdx,坡度最小值为0,传递函数使用tansig,最大迭代次数设置为2000,最小方差设置为0.001,隐藏层层数为1,输入输出层神经元依据输入输出变量的数目确定,隐层神经元数目依据一般公式确定:
个人分类: 金融工程|0 个评论
分享 统计套利在股指期货跨期套利中的应用:基于协整方法的估计
accumulation 2016-12-16 15:49
一、统计套利简介   统计套利是一种基于量化模型的投资过程,是在不依赖经济含义的情况下, 运用数量手段构建资产组合,根据证券实际价格与数量模型所预测的理论价值进行对比,构建证券投资组合的多头和空头,从而对市场风险进行规避,获取一个稳定的alpha。有别于无风险套利,统计套利是利用证券价格的历史统计规律进行套利,是一种风险套利,其风险在于这种历史统计规律在未来一段时间内是否继续存在。   统计套利在方法上可以分为两类,一类是利用价格序列的协整关系建模,称之为协整策略;另一类是利用收益率序列建模,目标是在组合的值等于零的前提下实现alpha 收益,称之为中性策略,该策略主要应用于针对融资融券的套利交易中。 二、跨期套利   跨期套利也称作持仓费用套利、新老作物年度套利,一般而言,跨期套利的操作对象为同一品种但是交割月份不同的股指期货。一般来说,相同标的指数的股指期货在市场上会有不同交割月的若干合约同时交易,目前我国沪深300 指数期货同时上市的就有4 个合约:当月、下月及下两个季月,这为跨期套利提供了基础。由于同时交易的不同交割月合约均是基于同一标的指数,所以,在市场预期相对稳定的情况下,不同交割日期合约间的价差应该是稳定的,一旦价差发生了变化,则会产生跨期套利机会。考虑到价差运行的不确定性,投资者需要对不同到期日的期货合约价差及价差的运行做出预测,因而,这种套利形式不是无风险套利。价差运行的方向与投资者预测方向一致,则跨期套利交易就可盈利,反之则亏损。不过,由于套利交易者所持的两份合约具有同涨同跌的特征,而其方向相反,因而可以对冲大部分趋势变动的风险。这使得套利交易的风险要远远小于纯粹的投机交易(即单方向做多、做空)。   股指期货的跨期套利方式主要包括熊市套利和牛市套利。
个人分类: 金融工程|0 个评论
分享 概率论与数理统计 经典推荐书籍教材
hylpy1 2016-10-23 11:08
非数学专业 1. 概率论与数理统计(第4版) 盛骤、谢式千、 潘承毅 高等教育出版社 2.概率论与数理统计 陈希孺 中国科学技术大学出版社 3.概率论与数理统计 周纪芗、 茆诗松 中国统计出版社 4. 概率论基础教程(第8版) 罗斯(Sheldon M.Ross)、郑忠国人民邮电出版社 5. 概率论与数理统计(第3版改编版) 德格奥特、 谢尔维斯 高等教育出版社 6.概率统计(英文版第4版)(经典的概率论与数理统计教材,多年来畅销不衰,被CMU、哈佛等众多名校采用) (美)德格鲁特(DeGroot, M. H.) (美)舍维什(Schervish, M. J.) 机械工业出版社 7. 概率论(英文版) 皮特曼(Pitman.J.) 世界图书出版公司 8.应用随机过程:概率模型导论(第10版) 罗斯(Sheldon M.Ross)、 龚光鲁 人民邮电出版社 ------------------------------------------------------------------------------- 数学专业 一般人们对概率论这门学科的理解可以划分为三个层次: 1 古典型,未受过任何相关训练的人都属于此类,只能够理解一些离散的(古典的)概率模型; 2 近代型,通常指学过概率论基础的,从微积分的角度理解各种连续分布,概率模型的数字特征; 3 现代型,抽象地从测度论和实分析高度理解,建立在测度基础上的概率论通常所谓的高等概率论。 概率论: 初中等 1.Mathematical Methods of Statistics: Harald Cramér 2.Kendall's Advanced Theory of Statistics vol 1 主要讲分布论 3. 概率论基础(第3版) 李贤平 高等教育出版社 (有辅导书 ) 4.初等概率论(第4版)(英文版) 钟开莱(Kai Lai Chung) 世界图书出版公司 5. 概率论及其应用 第1卷 (第3版)威廉·费勒(有辅导书) 6. 概率论(第2版)苏淳 7. 概率论及数理统计(上册)(第4版) 邓集贤、杨维权、司徒荣、 等 高等教育出版社(有辅导书) 高等 1. 概率论教程:英文版(第3版) 钟开莱(Kai Lai Chung) 机械工业出版社 2. 概率论及其应用 第2卷(第3版)/威廉·费勒 3. 概率论(第1,2卷)(第4版) M.Loeve 世界图书出版公司 4. 俄罗斯数学教材选译•概率(第1,2卷)(修订和补充第3版) 施利亚耶夫 高等教育出版社 5.Kallenberg 的 Foundations of modern probability 6.David Williams的 Probability with martingales 7.Chow.Y.S的 Probability Theory 辅导书 概率论基础学习指导书 李贤平 陈子毅 高等教育出版社 概率论题解1000例(英文版) G. 格里梅特(Geoffrey Grimmett)、 D.斯特扎克(David Stirzaker) 世界图书出版社 概率论习题集 施利亚耶夫、 苏淳 高等教育出版社 概率论题解(英文版) T.M.Mills Exercises in Probability: A Guided Tour from Measure Theory to Random Processes, via Conditioning Cambridge University Press , L. Chaumont and M. Yor W.Feller 第1卷 概率论及其应用题解 陈希孺 数理统计: 初中等 1.Mathematical Methods of Statistics: Harald Cramér 2. 数理统计学教程 陈希孺 3.数理统计学讲义 陈家鼎(书的习题解答占了一半) 4.PETER J.BICKEL 的MATHEMATICAL STATISTICS:Basic Ideas and Selected Topics(有中文辅导书,第2版分了2卷,第一卷出版了,第二卷还未出版 ) 5.Kendall's Advanced Theory of Statistics v2,3 第2卷讲统计推断 第3卷讲时间序列 6.V.K.Rohatgi 的 An Introduction to Probability Theory and Mathematical Statistics 7.数理统计学导论(英文版•第7版) 霍格(Robert V.Hogg)、Joseoh W.McKean、 Allen T.Craig 机械工业出版社(第4版有辅导书) 8. 高等统计学 郑忠国、童行伟、赵慧 北京大学出版社 9. 数理统计,韦来生,科学出版社 10.Pestman,Wiebe R 的Mathematical Statistics (2009) 有辅导书 11. 概率论及数理统计(下册)(第4版) 邓集贤、杨维权、司徒荣 等 高等教育出版社(有辅导书) 高等 测度论 1.Jun Shao Mathematical Statistics (2nd ed.) 2. 数理统计引论 陈希孺 3. 高等数理统计 陈希孺 4. 高等数理统计(第2版) 茆诗松、王静龙、 濮晓龙 高等教育出版社 5.Mathematical Theory of Statistics:Statistical Experiments and Asymptotic Decision Theory 讲到了希尔伯特空间。。。 辅导书 1. 数理统计习题教程(上下) 李泽慧、荆炳义、 李效虎 兰州大学出版社 J.Bickel 第一版的解答 2.Mathematical Statistics -- Exercises and Solutions , Jun Shao 3.Mathematical Statistics Problems and Detailed Solutions , Pestman, Wiebe R. Alberink, Ivo B 4.数理统计学导论习题详解 R.V霍格 http://www.docin.com/p-287034823.html 5. 概率论及数理统计(上下)习题解答 许刘俊,杨维权 与统计软件教材 Matlab MATLAB 统计分析与应用:40个案例分析 谢中华 北京航空航天大学出版社 MATLAB 概率与数理统计分析 张德丰、 等 机械工业出版社 SAS The Little SAS Book: A Primer, Fourth Edition by Lora Delwiche and Susan Slaughter SAS 编程技术教程 朱世武 清华大学出版社 R The Art of R Programming: A Tour of Statistical Software Design by Norman S. Matloff A Beginner's Guide to R (Use R!) by Alain F. Zuur, Elena N. Ieno and Erik Meesters 注: 这些书的电子版一般都可以到人大经济论坛,新浪爱问,豆丁网下载到。 多里程碑式的经典著作可以参看维基百科的 List of important publications in statistics http://en.wikipedia.org/wiki/List_of_important_publications_in_statistics https://bbs.pinggu.org/forum.php?mod=viewthreadtid=1530074page=1
97 次阅读|0 个评论
分享 [转载]概率论,数理统计,随机过程,随机金融 经典教材专著
热度 1 hylpy1 2016-9-2 18:22
非数学专业 本科生 概率统计随机过程 概率论与数理统计(第4版) 盛骤 考研必备 概率论与数理统计教程(第2版) 茆诗松 概率论与数理统计 陈希孺 概率论基础教程(第8版) 罗斯、郑忠国译(已经出第9版,也是最后一版)第7版答案 http://www.docin.com/p-109941348.html 概率论与数理统计(第3版改编版) 德格奥特、 谢尔维斯 概率统计(英文版第4版)德格鲁特、舍维什 概率与统计(英文版) Ronald E.Walpole;Raymond H.Myers;Sharon L.Myers;Keying Ye 概率论(英文版) 皮特曼 应用随机过程:概率模型导论(第10版) 罗斯、龚光鲁译 概率、统计与随机过程(第4版)(英文版) 亨利斯塔克(Henry Stark)、 Schaum's Outlines - Probability, Random Variables And Random Processes Schaum's Easy Outline of Probability and Statistics. Schaum's Outline of Beginning Statistics, 2 Edition Schaum's Outlines - Elements of Statistics I - Descriptive Statistics and Probability Schaum's Outlines - Elements of Statistics II - Inferential Statistics Applied Multivariate Statistical Analysis (6th Ed)RICHARD A. JOHNSON Multivariate Data Analysis (7th Edition) Joseph F. Hair, William C. Black, Barry J. Babin, Rolph E. Anderson A Modern Introduction to Probability and Statistics_Understanding Why and How Dekking 辅导书 概率论与数理统计教程:习题与解答(第2版) 茆诗松 概率论与数理统计习题全解指南(浙大•第4版) 盛骤 Schaum's Outline of Theory and Problems of Probability and Statistics 统计学 统计学,David Freedman等著,魏宗舒,施锡铨等译 中国统计出版社 (据说是统计思想讲得最好的一本书,读了部分章节,受益很多。整本书几乎没有公式,但是讲到了统计思想的精髓。) Mind on statistics(英文版), 机械工业出版社 (只需要高中的数学水平,统计的扫盲书。有一句话影响很深:Mathematics as to statistics is something like hammer, nails, wood as to a house, it's just the material and tools but not the house itself。) 数理统计与数据分析(原书第3版)机械工业出版社 (看了就发现和国内的数理统计树有明显的不同。这本书理念很好,讲了很多新的东西,把很热门的Bootstrap方法和传统统计在一起讲了。Amazon上有书评。) Business Statistics a decision making approach(影印版)中国统计出版社(在实务中很实用的东西,虽然往往为数理统计的老师所不屑) Understanding Statistics in the behavioral science(影印版) 中国统计出版社 (和上面那本是一个系列的。老外的书都挺有意思的) 探索性数据分析,中国统计出版社(和第一本是一个系列的。大家好好看看陈希儒老先生做的序,可以说是对中国数理统计的一种反思) 商务与经济统计(原书第11版)安德森(Anderson D R.)等(会代数就读得懂这本书,美国最畅销的商务统计著作) 统计学(原书第5版) 门登霍尔(William Mendenhall)、辛塞奇(Terry Sincich) 统计模型:理论和实践(原书第2版) 弗里曼(David A.Freedman)、 Elements of Statistics 6ed,Arthur L Bowley 世界上第一本统计学教材 Introduction to the Theory of Statistics 14ed, George Udny Yule and Sir Maurice Kendall Introduction to the Theory of Statistics 3rd Edition by Alexander M. Mooda Introductory Statistics, Third Edition. Sheldon Ross. ------------------------------------------------------------------------------- 数学专业 (本科,研究生) 一般人们对概率论这门学科的理解可以划分为三个层次: 1 古典型,未受过任何相关训练的人都属于此类,只能够理解一些离散的(古典的)概率模型; 2 近代型,通常指学过概率论基础的,从微积分的角度理解各种连续分布,概率模型的数字特征; 3 现代型,抽象地从测度论和实分析高度理解,建立在测度基础上的概率论通常所谓的高等概率论。 类似的数理统计也可以按照类似的方法 概率论: 初中等 Théorie analytique des probabilités, Laplace,1812 Calcul des probabilités by Henri Poincaré , 1912 2ed Introduction to mathematical probability, Uspensky,1937 Random Variables and Probability Distributions 3th Harald Cramér The elements of probability theory and some of its application 2ed Harald Cramér Mathematical Methods of Statistics : Harald Cramér 概率论导引(苏)柯尔莫戈洛夫等著;周概容,肖慧敏译 概率论基础(第3版) 李贤平(有辅导书 ) 初等概率论(第4版)(英文版) 钟开莱 概率论及其应用 第1卷 (第3版)威廉·费勒(有辅导书) 概率论(第2版)苏淳 概率论及数理统计(上)(第4版) 邓集贤 (有辅导书) Probability and Random Processes 3rd ed - G. Grimmett, D. Stirzaker (有辅导书) Theory of Probability,2ed Gnedenko 概率与信息 (苏)雅格洛姆(А.М.Яглом) Theory of Probability 3rd ed. H.Jeffreys Stochastics - Introduction to Probability and Statistics,Hans-Otto Georgii Probability Via Expectation(Whittle) An Intermediate Course in Probability (Allan Gut) Applied probability (2ed., Springer, 2010)Lange K 高等 Foundations of the Theory of Probability 2ed, A. N. KOLMOGOROV 概率论教程:英文版(第3版) 钟开莱 概率论及其应用 第2卷(第3版)/威廉·费勒 概率论(第1,2卷)(第4版) M.Loeve 概率论(日)伊藤清著 刘璋温译 概率(第1,2卷)(修订和补充第3版) 施利亚耶夫 高等概率论及其应用 胡迪鹤 测度与概率(第2版) 严士健 测度论与概率论基础(程士宏) Kallenberg 的 Foundations of modern probability 2ed David Williams的 Probability with martingales Chow.Y.S的 Probability Theory: Independence, Interchangeability, Martingales 3ed 华人数学家 周元燊 Sheldon M Ross的A Second Course in Probability R. Durrett 的Probability Theory and Examples 4th edition Billingsley的Probability and Measure 3rd Edition Athreya的Measure Theory And Probability Theory Erhan Cnlar的GTM261.Probability and Stochastics Malliavin.P的GTM157.Integration.and.Probability A. Rényi的 Foundations of Probability Jacod J的Probability Essentials (法国教材,只有200多页,特别适用于经济学方向,网上有答案) Probability Theory (1996)Heinz Bauer Probability for Statisticians (Galen R. Shorack) Fristedt. A modern approach to probability theory . 1997. Klenke,Probability Theory - A Comprehensive Course 2ed,2013 A basic course in probability theory(不到200页的,高等概率论教材,短小精悍) J.C. Taylor (1997), An introduction to measure and probability. Springer, L. Breiman (1992), Probability. SIAM, Philadelphia. Probability Theory( Universitext ) ,Borovkov, Alexandr A. 2013,Springer , Probability: A Graduate Course Allan Gut 2013,Springer Probability(Davar. Khoshnevisan)GSM080 Real Analysis and Probability.DUDLEY Theory of Probability and Random Processes (Leonid B. Koralov, Yakov G. Sinai). Probability for Statistics and Machine Learning Fundamentals and Advanced Topics ,Gupta Fundamentals of Probability A First Course Gupta A User's Guide to Measure Theoretic Probability , Pollard Theory of Probability and Random Processes, Leonid Koralov,Yakov G. Sinai Knowing the Odds: An Introduction to Probability, John B. Walshs Mathematics of Probability (Graduate Studies in Mathematics) Daniel W. Stroock The Theory of Probability: Explorations and Applications, Venkatesh Advanced Probability Theory(荆炳义 高等概率讲义) 辅导书 Schaum's Outlines - Probability, Random Variables And Random Processes 概率论基础学习指导书 李贤平 陈子毅 概率论题解1000例(英文版) G. 格里梅特、D.斯特扎克 概率论习题集 施利亚耶夫、 苏淳译 概率论题解(英文版) T.M.Mills, Problems in Probability Probability through problem, capinski Exercises in Probability: A Guided Tour from Measure Theory to Random Processes, L. Chaumont and M. Yor Problems in Probability Theory, Mathematical Statistics and Theory of Random Functions , A. A. Sveshnikov Theoretical Exercises in Probability and Statistics, 2nd Edition W.Feller 第1卷 概率论及其应用题解 陈希孺 王道益,分析概率论与随机过程习题解析(包含了胡迪鹤写的分析概率和随机过程的习题解答) 概率论教程题解,北京化工学院数学教研室( Gnedenko概率论的习题解答 ) 概率论习题集 (苏)Л.Д.梅沙尔金著;盛骤等译 概率论习题集(苏)特罗高夫切夫等著;何声武等译 数理统计: 初中等 Mathematical Methods of Statistics : Harald Cramér Mathematical Statistics Van der Waerden B. L 数理统计学教程 陈希孺 数理统计学讲义 陈家鼎 解答 http://www.docin.com/p-437425905.html Mathematical Statistics :Basic Ideas and Selected Topics(1版有中文辅导书,第2版分了2卷,第一卷出版了,第二卷据作者说2013年出版 ) Kendall's Advanced Theory of Statistics v2,3 第2卷讲统计推断 第3卷讲实验设计与时间序列 (第3版有辅导书) V.K.Rohatgi 的 An Introduction to Probability Theory and Mathematical Statistics Stochastics - Introduction to Probability and Statistics,Hans-Otto Georgii 数理统计学导论(英文版第7版) 霍格(第4版有辅导书) 高等统计学, 郑忠国、童行伟、 赵慧 数理统计,韦来生 Pestman,Wiebe R 的Mathematical Statistics 有辅导书 概率论及数理统计(下册)(第4版) 邓集贤(有辅导书) Samuel S. Wilks的Mathematical Statistics Rao C.R. 的Linear Statistical Inference and its applications 2ed( 倪国熙 , 陈希孺 写了这本书的部分参考答案) Theoretical Statistics , D. R. Cox,D.V. Hinkley (有辅导书) Boos,Essential Statistical Inference Theory and Methods.2013 Statistical Inference , 2nd edition, by George Casella Modern Mathematical Statistics with Applications Devore,.Berk,.2ed 2012 高等 测度论 Jun Shao Mathematical Statistics (2nd ed.) 数理统计引论 陈希孺 高等数理统计 陈希孺(习题解答占了书的一半) 高等数理统计(第2版) 茆诗松 Mathematical Theory of Statistics:Statistical Experiments and Asymptotic Decision Theory Theory of Statistics 2ed Mark J. Sachervish Advanced Statistics: Volume 1: Description of Populations , Shelby J. Haberma Abstract Inference , Grenander U. Mathematical statistics, Borovkov , Alexandr A. Theoretical Statistics : Topics for a Core Course Robert W. Keener (书后有解答,适合数学专业统计学研究生课程使用,是2010年Springer出版社刚出版书。密歇根大学教授,他同时是Institute of Mathematical Statistics的fellow) Korostelev Mathematical Statistics: Asymptotic Minimax Theory Lehmann,Testing Statistical Hypoth eses.3ed Lehmann,Theory of Point Estimation.2ed Lehmann,Elements of Large-sample Theory Asymtotic statistics A.W. van der Vaart Asymptotic Methods in Statistical Decision Theory.Le Cam, Advanced Mathematical Statistics I(荆炳义 高等统计学讲义) 辅导书 数理统计习题教程(上下) 李泽慧 J.Bickel 第一版解答 Mathematical Statistics -- Exercises and Solutions , Jun Shao Mathematical Statistics Problems and Detailed Solutions , Pestman, Wiebe R. Alberink, Ivo B 数理统计学导论习题详解R.V霍格 http://www.docin.com/p-287034823.html 概率论及数理统计(上下)习题解答 许刘俊 A. A. Sveshnikov 的Problems in Probability Theory, Mathematical Statistics and Theory of Random Functions 倪国熙,陈希孺著 线性统计与线性代数 参考资料 Problems and solutions in theoretical statistics , David Roxbee Cox Exercises in Theoretical Statistics: With Answers and Hints on Solutions by Sir Maurice Kendall Problems in mathematical statistics G.Ivchenko Basics of Modern Mathematical Statistics: Exercises and Solutions (Springer Texts in Statistics) Hrdle Solutions Manual to Mathematical Statistics Asymptotic Minimax Theory http://www.docin.com/p-706903193.html 多元统计 An Introduction to Multivariate Statistical Analysis (T.W.Anderson 3ed) Aspects of multivariate statistical theory 2ed ROBB J. MUIRHEAD Methods of Multivariate.Analysis.(2nd.Ed.)ALVIN C. RENCHER Applied Multivariate Statistical Analysis (3rd editionWolfgang Karl Hrdle, Léopold Simar )有辅导书 Multivariate Statistical Analysis 2nd ed Revised and Expanded By Giri 2004 Marcel dekker Book 有辅导书 Theory of Multivariate Statistics,M.Bilodeau, D.Brenner Multivariate Analysis 2ed ,M. Kendall 离散多元分析:理论与实践(Yvonne M.M.Bishop Stephen E.Fienberg Paul W.Holland) 多元统计分析引论,张尧庭 方开泰 应用多元统计分析,高惠璇 Topics in Multivariate Approximation and Interpolation , K. Jetter (Elsevier, 2006) Multivariate Statistics with R Paul J. Hewson 2009 Applied Multivariate Statistics with SAS Software, Second Edition SAS Institute Modern Multivariate Statistical Techniques.Regression.classification.and.manifold.learning,Alan Julian Izenman Multivariate Statistics High-Dimensional and Large-Sample Approximations,Fujikoshi 辅导书 Multivariate Statistics -- Exercises and Solutions, Hardle, Springer 2007 多元统计分析习题选解,吉林大学数学系编写组(Giri 的部分题解) 统计分布论(运用概率统计研究各种分布函数的性质) Kendall's Advanced Theory of Statistics vol 1 主要讲分布论(第3版有辅导书) Norman L. Johnson的Univariate Discrete Distributions, 3ed Kocherlakota的Bivariate Discrete Distributions Norman L. Johnson的Discrete Multivariate Distributions Norman Norman L. Johnson的Continuous Univariate Distributions, Vol. 1,2 N. Balakrishnan的Continuous Bivariate Distributions Norman L. Johnson的Continuous Multivariate Distributions, Volume 1, Models and Applications, 2nd Edition N. BALAKRISHNAN的A Primer on Statistical Distributions (2003) A K Gupta Matrix Variate Distributions N. Balakrishnan的Advances in Distribution Theory, Order Statistics, and Inference 统计分布,方开泰 Charalambos A. Charalambides 的Combinatorial Methods In Discrete Distributions Johnson N.L, Kotz S. Urn models and their application (Wiley, 1977) 用瓮模型写的概率论 Christian,Statistical Size Distributions in Economics and Actuarial Sciences Hogg, Klugman Loss Distributions Krishnamoorthy,Handbook of statistical distributions with applications(Taylor and Francis, 2006) 随机过程 J.L. Doob (1953), Stochastic processes (2nd ed.). John Wiley Sons, Stochastic Processes (Emanuel Parzen) A First Course In Stochastic Processes(Karlin) A Second Course In Stochastic Processes(Karlin) Diffusions, Markov Processes, and Martingales Volume 1,2 David Williams Essentials Of Stochastic Processes(Durrett) Introduction to Probability Models, Sheldon M.Ross 10th Edition Stochastic Process.2nd.Sheldon Ross Stochastic Processes for Insurance and Finance,Tomasz Rolski R. Bhattacharya .Stochastic processes with applications. John Wiley Sons, New York. S. Resnick (1992), Adventures in stochastic processes. Birkhauser, Boston. Basics Of Applied Stochastic Processes (2010)Springer Richard Serfozo Bass,Stochastic Processes,2011 Stochastic Processes Lectures Given at Aarhus University,Kyosi Ito Durret,Essentials of stochastic processes,2012 随机过程 伊藤清 随机过程论—基础、理论、应用(胡迪鹤) 随机过程导论,Edward P.C.Kao 随机过程论【布林斯基,施利亚耶夫】 随机过程论 第1-3卷 И.И.基赫曼等 随机过程通论第1-2卷(王梓坤) 应用随机过程(林元烈) 应用随机过程(张波张景肖) 随机模型概论(英文版.第4版) Mark A.Pinsky;Samuel Karlin 辅导书 Stochastic processes. problems and solutions, Takacz L Theory of Stochastic Processes: With Applications to Financial Mathematics and Risk Theory (Problem Books in Mathematics) Exercises in Probability: A Guided Tour from Measure Theory to Random Processes, via Conditioning. 2ed, Chaumont. Solution Manual of Introduction to Probability Models 10ed 随机过程疑难分析与解题方法 孙昊、 孙清华 随机过程习题解析(第2版) 陆传赉 随机过程及应用习题集 张晓军 随机金融 Karatzas,Shreve,Brownian motion and stochastic calculus,1991 ( 被引用次数:8131 ) Karatzas,Shreve,Methods of mathematical finance,1998 ( 被引用次数:2110 ) Mikosch,Elementary Stochastic Calculus With Finance in View,1988 ( 被引用次数:289 ) Shiryaev,Essentials of Stochastic Finance,2000 ( 被引用次数:857 ) Steele,Stochastic Calculus and Financial Applications,2001 ( 被引用次数:385 ) Shreve,Stochastic calculus for finance I: The binomial asset pricing model,2004 ( 被引用次数:1172 ) Shreve,Stochastic Calculus for Finance II, Continuous Time Models,2004 ( 被引用次数:1172 ) Benth,Option theory with stochastic analysis: an introduction to mathematical finance,2004 ( 被引用次数:49 ) Elliott,Mathematics of Financial Markets Second Edition,2005 ( 被引用次数:472 ) 史树中 金融学中的数学 2006 Lin,Introductory Stochastic Analysis for Finance and Insurance,2006 ( 被引用次数:13 ) Sondermann,Introduction to Stochastic Calculus for Finance: A New Didactic Approach,2006 ( 被引用次数:24 The text is also useful for mathematicians interested in the methods of modern mathematical finance without prior knowledge of advanced stochastic analysis ) Lamberton,Introduction to stochastic calculus applied to finance,2008 ( 被引用次数:904 ) Kwok,Mathematical Models of Financial Derivatives Second Edition,2008 ( 被引用次数:589 ) Kennedy,Stochastic Financial Models ,2010 ( 被引用次数:9 ) Ross,An Elementary introduction to Mathematical Finance 3ed ,2011 ( 被引用次数:115 ) Fllmer,Stochastic Finance An Introduction in Discrete Time 3ed,2011 ( 被引用次数:1235 ) Capinski,Mathematics for Finance An Introduction to Financial Engineering 2011 2ed ( 被引用次数:112 仅仅需要高数和概率统计知识 适合非数学专业) Večeř ,Stochastic Finance: A Numeraire Approach ,2011 ( 被引用次数:7 ) 严加安 金融数学引论 2012 Capiński, Stochastic Calculus for Finance ,2012 Janssen, Mathematical Finance: Deterministic and Stochastic Models,2013 ( 被引用次数:14 ) McCauley,Stochastic Calculus and Differential Equations for Physics and Finance,2013 Kijima,Stochastic Processes with Applications to Finance,2013 ( 被引用次数:106 ) Michael Mastro, Financial Derivative and Energy Market Valuation: Theory and Implementation in MATLAB 注: 电子版一般都在en.bookfi.org,www.mathsccnu.com,人大经济论坛,新浪爱问,豆丁网下载. 更 多里程碑式的经典著作可以看维基百科的 List of important publications in statistics http://en.wikipedia.org/wiki/List_of_important_publications_in_statistics 附:统计、计量、金融与精算中的顶级杂志与一流杂志 在此晒一下统计及相关学科的好杂志,希望大家做科研的时候多多关注这些杂志, 统计学 : (1) Journal of the American Statistical Association (JASA) (2) Journal of the Royal Statistical Society, Series B (JRSSB) (3) Annals of Statistics (4) Biometrica 以上是统计学中的顶级杂志,学术圈内被称为“四大天王”,其中JASA与JRSSB很注重统计方法和理论上的创新,基本是同一级别的杂志,是这四大中最好的;老三Annals of Statistics上的文章难度比较大,数学推导比较复杂;老四Biometrica要与前三甲差一些,但圈内还是将它归为顶堤杂志。 (5) Biometrics (6) Statistica Sinica (7) Scandinavian Journal of Statistics (8) Bernoulli 以上三个是统计学中的一流杂志,其中Biometrics侧重生物统计方向;Statistica Sinica和Annals of Statistics的风格差不多,在数学推导上要偏难一些;Scandinavian Journal of Statistics的风格介于JRSSB和Annals of Statistics之间;Bernoulli是概率和统计的综合杂志,概率的文章篇多一些。 当然,除了以上四个杂志外,还有其它比较好的一流杂志,如Biostatistics, Statistical Science, Technometrics, Canadian Journal of Statistics等,虽然有的影响因子高一些,但本人感觉明显要比上面四个杂志档次低一些。 计量经济与金融 (偏文科的没有列) (1) Econometrica (2) Journal of Finnance (JF) 以上两个属于顶级杂志,其中老大Econometrica的地位无可撼动,本人认为它的水平要明显高于JASA和JRSSB,平均一篇文章有60页左右,理论新,方法新,推导复杂;老二JF也是非常不错的顶级杂志,国内基本没人能在上面发文章。 (3) Journal of Econometrics (JE) (4) Journal of Finincial Econometrics (JFE) (5) Econometric Theory (ET) 以上属于一流杂志,其中JE的文章难度比较大,水平比JFE要高一些;而ET要比JE和JFE低一个档次,但还是属于一流杂志。 精算学 : (1) Insurance: Mathematics and Economics (2) ASTIN Bulletin (3) Scandinavian Actuarial Journal (4) North American Actuarial Journal 精算学中没有顶级杂志可谈,毕竟它是统计学下面的一个分支,属于小学科门类,以上四个杂志是精算学中的一流杂志,淡然这里列举的并不全面,暂不补充了。 归到一起的话,顶级杂志当属Econometrica, JASA, JRSSB, Ann Statist, JF,如果能在这上面发篇文章,再在其它一般杂志上发一定数量的文章,拿个国基面上项目绝对没问题;如果每年在这些杂志上发一篇文章,再加上一定数量的其它文章,杰青基本没问题的(特别是在Econometrica上发的话)。 来自 http://user.qzone.qq.com/352693585/2
100 次阅读|1 个评论
分享 [转载]概率论与数理统计 经典推荐书籍教材
hylpy1 2016-9-2 18:16
概率论与数理统计 经典推荐书籍教材 非数学专业 1.概率论与数理统计(第4版) 盛骤、谢式千、 潘承毅 高等教育出版社 2.概率论与数理统计 陈希孺 中国科学技术大学出版社 3.概率论与数理统计 周纪芗、 茆诗松 中国统计出版社 4.概率论基础教程(第8版) 罗斯(Sheldon M.Ross)、郑忠国人民邮电出版社 5.概率论与数理统计(第3版改编版) 德格奥特、 谢尔维斯 高等教育出版社 6.概率统计(英文版第4版)(经典的概率论与数理统计教材,多年来畅销不衰,被CMU、哈佛等众多名校采用) (美)德格鲁特(DeGroot, M. H.) (美)舍维什(Schervish, M. J.) 机械工业出版社 7.概率论(英文版) 皮特曼(Pitman.J.) 世界图书出版公司 8.应用随机过程:概率模型导论(第10版) 罗斯(Sheldon M.Ross)、 龚光鲁 人民邮电出版社 ------------------------------------------------------------------------------- 数学专业 一般人们对概率论这门学科的理解可以划分为三个层次: 1古典型,未受过任何相关训练的人都属于此类,只能够理解一些离散的(古典的)概率模型; 2近代型,通常指学过概率论基础的,从微积分的角度理解各种连续分布,概率模型的数字特征; 3现代型,抽象地从测度论和实分析高度理解,建立在测度基础上的概率论通常所谓的高等概率论。 概率论: 初中等 1.Mathematical Methods of Statistics: Harald Cramér 2.Kendall's Advanced Theory of Statistics vol 1 主要讲分布论 3.概率论基础(第3版) 李贤平 高等教育出版社 (有辅导书) 4.初等概率论(第4版)(英文版) 钟开莱(Kai Lai Chung) 世界图书出版公司 5.概率论及其应用 第1卷 (第3版)威廉·费勒(有辅导书) 6.概率论(第2版)苏淳 7.概率论及数理统计(上册)(第4版) 邓集贤、杨维权、司徒荣、 等 高等教育出版社(有辅导书) 高等 1.概率论教程:英文版(第3版) 钟开莱(Kai Lai Chung) 机械工业出版社 2.概率论及其应用 第2卷(第3版)/威廉·费勒 3.概率论(第1,2卷)(第4版) M.Loeve 世界图书出版公司 4.俄罗斯数学教材选译•概率(第1,2卷)(修订和补充第3版) 施利亚耶夫 高等教育出版社 5.Kallenberg的Foundations of modern probability 6.David Williams的Probability with martingales 7.Chow.Y.S的Probability Theory 辅导书 概率论基础学习指导书 李贤平 陈子毅 高等教育出版社 概率论题解1000例(英文版) G.格里梅特(Geoffrey Grimmett)、 D.斯特扎克(David Stirzaker) 世界图书出版社 概率论习题集 施利亚耶夫、 苏淳 高等教育出版社 概率论题解(英文版) T.M.Mills Exercises in Probability: A Guided Tour from Measure Theory to Random Processes, via Conditioning Cambridge University Press ,L. Chaumont and M. Yor W.Feller 第1卷 概率论及其应用题解 陈希孺 数理统计: 初中等 1.Mathematical Methods of Statistics: Harald Cramér 2.数理统计学教程 陈希孺 3.数理统计学讲义 陈家鼎(书的习题解答占了一半) 4.PETER J.BICKEL 的MATHEMATICAL STATISTICS:Basic Ideas and Selected Topics(有中文辅导书,第2版分了2卷,第一卷出版了,第二卷还未出版) 5.Kendall's Advanced Theory of Statistics v2,3 第2卷讲统计推断 第3卷讲时间序列 6.V.K.Rohatgi的An Introduction to Probability Theory and Mathematical Statistics 7.数理统计学导论(英文版•第7版) 霍格(Robert V.Hogg)、Joseoh W.McKean、 Allen T.Craig 机械工业出版社(第4版有辅导书) 8.高等统计学 郑忠国、童行伟、赵慧 北京大学出版社 9.数理统计,韦来生,科学出版社 10.Pestman,Wiebe R的Mathematical Statistics (2009) 有辅导书 11.概率论及数理统计(下册)(第4版) 邓集贤、杨维权、司徒荣 等 高等教育出版社(有辅导书) 高等 测度论 1.Jun Shao Mathematical Statistics (2nd ed.) 2.数理统计引论 陈希孺 3.高等数理统计 陈希孺 4.高等数理统计(第2版) 茆诗松、王静龙、 濮晓龙 高等教育出版社 5.Mathematical Theory of Statistics:Statistical Experiments and Asymptotic Decision Theory 讲到了希尔伯特空间。。。 辅导书 1.数理统计习题教程(上下) 李泽慧、荆炳义、 李效虎 兰州大学出版社 J.Bickel 第一版的解答 2.Mathematical Statistics -- Exercises and Solutions,Jun Shao 3.Mathematical Statistics Problems and Detailed Solutions ,Pestman, Wiebe R. Alberink, Ivo B 4.数理统计学导论习题详解 R.V霍格http://www.docin.com/p-287034823.html 5.概率论及数理统计(上下)习题解答 许刘俊,杨维权 与统计软件教材 Matlab MATLAB统计分析与应用:40个案例分析 谢中华 北京航空航天大学出版社 MATLAB概率与数理统计分析 张德丰、 等 机械工业出版社 SAS The Little SAS Book: A Primer, Fourth Edition by Lora Delwiche and Susan Slaughter SAS编程技术教程 朱世武 清华大学出版社 R The Art of R Programming: A Tour of Statistical Software Design by Norman S. Matloff A Beginner's Guide to R (Use R!) by Alain F. Zuur, Elena N. Ieno and Erik Meesters 注: 这些书的电子版一般都可以到人大经济论坛,新浪爱问,豆丁网下载到。 多里程碑式的经典著作可以参看维基百科的 List of important publications in statistics http://en.wikipedia.org/wiki/List_of_important_publications_in_statistics 本文来自: 人大经济论坛 计量经济学与统计软件 版,详细出处参考: https://bbs.pinggu.org/forum.php?mod=viewthreadtid=1530074page=1
122 次阅读|0 个评论
分享 概率论,数理统计,随机过程,随机金融 经典教材专著
hylpy1 2016-8-28 11:06
非数学专业 本科生 概率统计随机过程 概率论与数理统计(第4版) 盛骤 考研必备 概率论与数理统计教程(第2版) 茆诗松 概率论与数理统计 陈希孺 概率论基础教程(第8版) 罗斯、郑忠国译(已经出第9版,也是最后一版)第7版答案 http://www.docin.com/p-109941348.html 概率论与数理统计(第3版改编版) 德格奥特、 谢尔维斯 概率统计(英文版第4版)德格鲁特、舍维什 概率与统计(英文版) Ronald E.Walpole;Raymond H.Myers;Sharon L.Myers;Keying Ye 概率论(英文版) 皮特曼 应用随机过程:概率模型导论(第10版) 罗斯、龚光鲁译 概率、统计与随机过程(第4版)(英文版) 亨利斯塔克(Henry Stark)、 Schaum's Outlines - Probability, Random Variables And Random Processes Schaum's Easy Outline of Probability and Statistics. Schaum's Outline of Beginning Statistics, 2 Edition Schaum's Outlines - Elements of Statistics I - Descriptive Statistics and Probability Schaum's Outlines - Elements of Statistics II - Inferential Statistics Applied Multivariate Statistical Analysis (6th Ed)RICHARD A. JOHNSON Multivariate Data Analysis (7th Edition) Joseph F. Hair, William C. Black, Barry J. Babin, Rolph E. Anderson A Modern Introduction to Probability and Statistics_Understanding Why and How Dekking 辅导书 概率论与数理统计教程:习题与解答(第2版) 茆诗松 概率论与数理统计习题全解指南(浙大•第4版) 盛骤 Schaum's Outline of Theory and Problems of Probability and Statistics 统计学 统计学,David Freedman等著,魏宗舒,施锡铨等译 中国统计出版社 (据说是统计思想讲得最好的一本书,读了部分章节,受益很多。整本书几乎没有公式,但是讲到了统计思想的精髓。) Mind on statistics(英文版), 机械工业出版社 (只需要高中的数学水平,统计的扫盲书。有一句话影响很深:Mathematics as to statistics is something like hammer, nails, wood as to a house, it's just the material and tools but not the house itself。) 数理统计与数据分析(原书第3版)机械工业出版社 (看了就发现和国内的数理统计树有明显的不同。这本书理念很好,讲了很多新的东西,把很热门的Bootstrap方法和传统统计在一起讲了。Amazon上有书评。) Business Statistics a decision making approach(影印版)中国统计出版社(在实务中很实用的东西,虽然往往为数理统计的老师所不屑) Understanding Statistics in the behavioral science(影印版) 中国统计出版社 (和上面那本是一个系列的。老外的书都挺有意思的) 探索性数据分析,中国统计出版社(和第一本是一个系列的。大家好好看看陈希儒老先生做的序,可以说是对中国数理统计的一种反思) 商务与经济统计(原书第11版)安德森(Anderson D R.)等(会代数就读得懂这本书,美国最畅销的商务统计著作) 统计学(原书第5版) 门登霍尔(William Mendenhall)、辛塞奇(Terry Sincich) 统计模型:理论和实践(原书第2版) 弗里曼(David A.Freedman)、 Elements of Statistics 6ed,Arthur L Bowley 世界上第一本统计学教材 Introduction to the Theory of Statistics 14ed, George Udny Yule and Sir Maurice Kendall Introduction to the Theory of Statistics 3rd Edition by Alexander M. Mooda Introductory Statistics, Third Edition. Sheldon Ross. ------------------------------------------------------------------------------- 数学专业 (本科,研究生) 一般人们对概率论这门学科的理解可以划分为三个层次: 1 古典型,未受过任何相关训练的人都属于此类,只能够理解一些离散的(古典的)概率模型; 2 近代型,通常指学过概率论基础的,从微积分的角度理解各种连续分布,概率模型的数字特征; 3 现代型,抽象地从测度论和实分析高度理解,建立在测度基础上的概率论通常所谓的高等概率论。 类似的数理统计也可以按照类似的方法 概率论: 初中等 Théorie analytique des probabilités, Laplace,1812 Calcul des probabilités by Henri Poincaré , 1912 2ed Introduction to mathematical probability, Uspensky,1937 Random Variables and Probability Distributions 3th Harald Cramér The elements of probability theory and some of its application 2ed Harald Cramér Mathematical Methods of Statistics : Harald Cramér 概率论导引(苏)柯尔莫戈洛夫等著;周概容,肖慧敏译 概率论基础(第3版) 李贤平(有辅导书 ) 初等概率论(第4版)(英文版) 钟开莱 概率论及其应用 第1卷 (第3版)威廉·费勒(有辅导书) 概率论(第2版)苏淳 概率论及数理统计(上)(第4版) 邓集贤 (有辅导书) Probability and Random Processes 3rd ed - G. Grimmett, D. Stirzaker (有辅导书) Theory of Probability,2ed Gnedenko 概率与信息 (苏)雅格洛姆(А.М.Яглом) Theory of Probability 3rd ed. H.Jeffreys Stochastics - Introduction to Probability and Statistics,Hans-Otto Georgii Probability Via Expectation(Whittle) An Intermediate Course in Probability (Allan Gut) Applied probability (2ed., Springer, 2010)Lange K 高等 Foundations of the Theory of Probability 2ed, A. N. KOLMOGOROV 概率论教程:英文版(第3版) 钟开莱 概率论及其应用 第2卷(第3版)/威廉·费勒 概率论(第1,2卷)(第4版) M.Loeve 概率论(日)伊藤清著 刘璋温译 概率(第1,2卷)(修订和补充第3版) 施利亚耶夫 高等概率论及其应用 胡迪鹤 测度与概率(第2版) 严士健 测度论与概率论基础(程士宏) Kallenberg 的 Foundations of modern probability 2ed David Williams的 Probability with martingales Chow.Y.S的 Probability Theory: Independence, Interchangeability, Martingales 3ed 华人数学家 周元燊 Sheldon M Ross的A Second Course in Probability R. Durrett 的Probability Theory and Examples 4th edition Billingsley的Probability and Measure 3rd Edition Athreya的Measure Theory And Probability Theory Erhan Cnlar的GTM261.Probability and Stochastics Malliavin.P的GTM157.Integration.and.Probability A. Rényi的 Foundations of Probability Jacod J的Probability Essentials (法国教材,只有200多页,特别适用于经济学方向,网上有答案) Probability Theory (1996)Heinz Bauer Probability for Statisticians (Galen R. Shorack) Fristedt. A modern approach to probability theory . 1997. Klenke,Probability Theory - A Comprehensive Course 2ed,2013 A basic course in probability theory(不到200页的,高等概率论教材,短小精悍) J.C. Taylor (1997), An introduction to measure and probability. Springer, L. Breiman (1992), Probability. SIAM, Philadelphia. Probability Theory( Universitext ) ,Borovkov, Alexandr A. 2013,Springer , Probability: A Graduate Course Allan Gut 2013,Springer Probability(Davar. Khoshnevisan)GSM080 Real Analysis and Probability.DUDLEY Theory of Probability and Random Processes (Leonid B. Koralov, Yakov G. Sinai). Probability for Statistics and Machine Learning Fundamentals and Advanced Topics ,Gupta Fundamentals of Probability A First Course Gupta A User's Guide to Measure Theoretic Probability , Pollard Theory of Probability and Random Processes, Leonid Koralov,Yakov G. Sinai Knowing the Odds: An Introduction to Probability, John B. Walshs Mathematics of Probability (Graduate Studies in Mathematics) Daniel W. Stroock The Theory of Probability: Explorations and Applications, Venkatesh Advanced Probability Theory(荆炳义 高等概率讲义) 辅导书 Schaum's Outlines - Probability, Random Variables And Random Processes 概率论基础学习指导书 李贤平 陈子毅 概率论题解1000例(英文版) G. 格里梅特、D.斯特扎克 概率论习题集 施利亚耶夫、 苏淳译 概率论题解(英文版) T.M.Mills, Problems in Probability Probability through problem, capinski Exercises in Probability: A Guided Tour from Measure Theory to Random Processes, L. Chaumont and M. Yor Problems in Probability Theory, Mathematical Statistics and Theory of Random Functions , A. A. Sveshnikov Theoretical Exercises in Probability and Statistics, 2nd Edition W.Feller 第1卷 概率论及其应用题解 陈希孺 王道益,分析概率论与随机过程习题解析(包含了胡迪鹤写的分析概率和随机过程的习题解答) 概率论教程题解,北京化工学院数学教研室( Gnedenko概率论的习题解答 ) 概率论习题集 (苏)Л.Д.梅沙尔金著;盛骤等译 概率论习题集(苏)特罗高夫切夫等著;何声武等译 数理统计: 初中等 Mathematical Methods of Statistics : Harald Cramér Mathematical Statistics Van der Waerden B. L 数理统计学教程 陈希孺 数理统计学讲义 陈家鼎 解答 http://www.docin.com/p-437425905.html Mathematical Statistics :Basic Ideas and Selected Topics(1版有中文辅导书,第2版分了2卷,第一卷出版了,第二卷据作者说2013年出版 ) Kendall's Advanced Theory of Statistics v2,3 第2卷讲统计推断 第3卷讲实验设计与时间序列 (第3版有辅导书) V.K.Rohatgi 的 An Introduction to Probability Theory and Mathematical Statistics Stochastics - Introduction to Probability and Statistics,Hans-Otto Georgii 数理统计学导论(英文版第7版) 霍格(第4版有辅导书) 高等统计学, 郑忠国、童行伟、 赵慧 数理统计,韦来生 Pestman,Wiebe R 的Mathematical Statistics 有辅导书 概率论及数理统计(下册)(第4版) 邓集贤(有辅导书) Samuel S. Wilks的Mathematical Statistics Rao C.R. 的Linear Statistical Inference and its applications 2ed( 倪国熙 , 陈希孺 写了这本书的部分参考答案) Theoretical Statistics , D. R. Cox,D.V. Hinkley (有辅导书) Boos,Essential Statistical Inference Theory and Methods.2013 Statistical Inference , 2nd edition, by George Casella Modern Mathematical Statistics with Applications Devore,.Berk,.2ed 2012 高等 测度论 Jun Shao Mathematical Statistics (2nd ed.) 数理统计引论 陈希孺 高等数理统计 陈希孺(习题解答占了书的一半) 高等数理统计(第2版) 茆诗松 Mathematical Theory of Statistics:Statistical Experiments and Asymptotic Decision Theory Theory of Statistics 2ed Mark J. Sachervish Advanced Statistics: Volume 1: Description of Populations , Shelby J. Haberma Abstract Inference , Grenander U. Mathematical statistics, Borovkov , Alexandr A. Theoretical Statistics : Topics for a Core Course Robert W. Keener (书后有解答,适合数学专业统计学研究生课程使用,是2010年Springer出版社刚出版书。密歇根大学教授,他同时是Institute of Mathematical Statistics的fellow) Korostelev Mathematical Statistics: Asymptotic Minimax Theory Lehmann,Testing Statistical Hypoth eses.3ed Lehmann,Theory of Point Estimation.2ed Lehmann,Elements of Large-sample Theory Asymtotic statistics A.W. van der Vaart Asymptotic Methods in Statistical Decision Theory.Le Cam, Advanced Mathematical Statistics I(荆炳义 高等统计学讲义) 辅导书 数理统计习题教程(上下) 李泽慧 J.Bickel 第一版解答 Mathematical Statistics -- Exercises and Solutions , Jun Shao Mathematical Statistics Problems and Detailed Solutions , Pestman, Wiebe R. Alberink, Ivo B 数理统计学导论习题详解R.V霍格 http://www.docin.com/p-287034823.html 概率论及数理统计(上下)习题解答 许刘俊 A. A. Sveshnikov 的Problems in Probability Theory, Mathematical Statistics and Theory of Random Functions 倪国熙,陈希孺著 线性统计与线性代数 参考资料 Problems and solutions in theoretical statistics , David Roxbee Cox Exercises in Theoretical Statistics: With Answers and Hints on Solutions by Sir Maurice Kendall Problems in mathematical statistics G.Ivchenko Basics of Modern Mathematical Statistics: Exercises and Solutions (Springer Texts in Statistics) Hrdle Solutions Manual to Mathematical Statistics Asymptotic Minimax Theory http://www.docin.com/p-706903193.html 多元统计 An Introduction to Multivariate Statistical Analysis (T.W.Anderson 3ed) Aspects of multivariate statistical theory 2ed ROBB J. MUIRHEAD Methods of Multivariate.Analysis.(2nd.Ed.)ALVIN C. RENCHER Applied Multivariate Statistical Analysis (3rd editionWolfgang Karl Hrdle, Léopold Simar )有辅导书 Multivariate Statistical Analysis 2nd ed Revised and Expanded By Giri 2004 Marcel dekker Book 有辅导书 Theory of Multivariate Statistics,M.Bilodeau, D.Brenner Multivariate Analysis 2ed ,M. Kendall 离散多元分析:理论与实践(Yvonne M.M.Bishop Stephen E.Fienberg Paul W.Holland) 多元统计分析引论,张尧庭 方开泰 应用多元统计分析,高惠璇 Topics in Multivariate Approximation and Interpolation , K. Jetter (Elsevier, 2006) Multivariate Statistics with R Paul J. Hewson 2009 Applied Multivariate Statistics with SAS Software, Second Edition SAS Institute Modern Multivariate Statistical Techniques.Regression.classification.and.manifold.learning,Alan Julian Izenman Multivariate Statistics High-Dimensional and Large-Sample Approximations,Fujikoshi 辅导书 Multivariate Statistics -- Exercises and Solutions, Hardle, Springer 2007 多元统计分析习题选解,吉林大学数学系编写组(Giri 的部分题解) 统计分布论(运用概率统计研究各种分布函数的性质) Kendall's Advanced Theory of Statistics vol 1 主要讲分布论(第3版有辅导书) Norman L. Johnson的Univariate Discrete Distributions, 3ed Kocherlakota的Bivariate Discrete Distributions Norman L. Johnson的Discrete Multivariate Distributions Norman Norman L. Johnson的Continuous Univariate Distributions, Vol. 1,2 N. Balakrishnan的Continuous Bivariate Distributions Norman L. Johnson的Continuous Multivariate Distributions, Volume 1, Models and Applications, 2nd Edition N. BALAKRISHNAN的A Primer on Statistical Distributions (2003) A K Gupta Matrix Variate Distributions N. Balakrishnan的Advances in Distribution Theory, Order Statistics, and Inference 统计分布,方开泰 Charalambos A. Charalambides 的Combinatorial Methods In Discrete Distributions Johnson N.L, Kotz S. Urn models and their application (Wiley, 1977) 用瓮模型写的概率论 Christian,Statistical Size Distributions in Economics and Actuarial Sciences Hogg, Klugman Loss Distributions Krishnamoorthy,Handbook of statistical distributions with applications(Taylor and Francis, 2006) 随机过程 J.L. Doob (1953), Stochastic processes (2nd ed.). John Wiley Sons, Stochastic Processes (Emanuel Parzen) A First Course In Stochastic Processes(Karlin) A Second Course In Stochastic Processes(Karlin) Diffusions, Markov Processes, and Martingales Volume 1,2 David Williams Essentials Of Stochastic Processes(Durrett) Introduction to Probability Models, Sheldon M.Ross 10th Edition Stochastic Process.2nd.Sheldon Ross Stochastic Processes for Insurance and Finance,Tomasz Rolski R. Bhattacharya .Stochastic processes with applications. John Wiley Sons, New York. S. Resnick (1992), Adventures in stochastic processes. Birkhauser, Boston. Basics Of Applied Stochastic Processes (2010)Springer Richard Serfozo Bass,Stochastic Processes,2011 Stochastic Processes Lectures Given at Aarhus University,Kyosi Ito Durret,Essentials of stochastic processes,2012 随机过程 伊藤清 随机过程论—基础、理论、应用(胡迪鹤) 随机过程导论,Edward P.C.Kao 随机过程论【布林斯基,施利亚耶夫】 随机过程论 第1-3卷 И.И.基赫曼等 随机过程通论第1-2卷(王梓坤) 应用随机过程(林元烈) 应用随机过程(张波张景肖) 随机模型概论(英文版.第4版) Mark A.Pinsky;Samuel Karlin 辅导书 Stochastic processes. problems and solutions, Takacz L Theory of Stochastic Processes: With Applications to Financial Mathematics and Risk Theory (Problem Books in Mathematics) Exercises in Probability: A Guided Tour from Measure Theory to Random Processes, via Conditioning. 2ed, Chaumont. Solution Manual of Introduction to Probability Models 10ed 随机过程疑难分析与解题方法 孙昊、 孙清华 随机过程习题解析(第2版) 陆传赉 随机过程及应用习题集 张晓军 随机金融 Karatzas,Shreve,Brownian motion and stochastic calculus,1991 ( 被引用次数:8131 ) Karatzas,Shreve,Methods of mathematical finance,1998 ( 被引用次数:2110 ) Mikosch,Elementary Stochastic Calculus With Finance in View,1988 ( 被引用次数:289 ) Shiryaev,Essentials of Stochastic Finance,2000 ( 被引用次数:857 ) Steele,Stochastic Calculus and Financial Applications,2001 ( 被引用次数:385 ) Shreve,Stochastic calculus for finance I: The binomial asset pricing model,2004 ( 被引用次数:1172 ) Shreve,Stochastic Calculus for Finance II, Continuous Time Models,2004 ( 被引用次数:1172 ) Benth,Option theory with stochastic analysis: an introduction to mathematical finance,2004 ( 被引用次数:49 ) Elliott,Mathematics of Financial Markets Second Edition,2005 ( 被引用次数:472 ) 史树中 金融学中的数学 2006 Lin,Introductory Stochastic Analysis for Finance and Insurance,2006 ( 被引用次数:13 ) Sondermann,Introduction to Stochastic Calculus for Finance: A New Didactic Approach,2006 ( 被引用次数:24 The text is also useful for mathematicians interested in the methods of modern mathematical finance without prior knowledge of advanced stochastic analysis ) Lamberton,Introduction to stochastic calculus applied to finance,2008 ( 被引用次数:904 ) Kwok,Mathematical Models of Financial Derivatives Second Edition,2008 ( 被引用次数:589 ) Kennedy,Stochastic Financial Models ,2010 ( 被引用次数:9 ) Ross,An Elementary introduction to Mathematical Finance 3ed ,2011 ( 被引用次数:115 ) Fllmer,Stochastic Finance An Introduction in Discrete Time 3ed,2011 ( 被引用次数:1235 ) Capinski,Mathematics for Finance An Introduction to Financial Engineering 2011 2ed ( 被引用次数:112 仅仅需要高数和概率统计知识 适合非数学专业) Večeř ,Stochastic Finance: A Numeraire Approach ,2011 ( 被引用次数:7 ) 严加安 金融数学引论 2012 Capiński, Stochastic Calculus for Finance ,2012 Janssen, Mathematical Finance: Deterministic and Stochastic Models,2013 ( 被引用次数:14 ) McCauley,Stochastic Calculus and Differential Equations for Physics and Finance,2013 Kijima,Stochastic Processes with Applications to Finance,2013 ( 被引用次数:106 ) Michael Mastro, Financial Derivative and Energy Market Valuation: Theory and Implementation in MATLAB 注: 电子版一般都在en.bookfi.org,www.mathsccnu.com,人大经济论坛,新浪爱问,豆丁网下载. 更 多里程碑式的经典著作可以看维基百科的 List of important publications in statistics http://en.wikipedia.org/wiki/List_of_important_publications_in_statistics 附:统计、计量、金融与精算中的顶级杂志与一流杂志 在此晒一下统计及相关学科的好杂志,希望大家做科研的时候多多关注这些杂志, 统计学 : (1) Journal of the American Statistical Association (JASA) (2) Journal of the Royal Statistical Society, Series B (JRSSB) (3) Annals of Statistics (4) Biometrica 以上是统计学中的顶级杂志,学术圈内被称为“四大天王”,其中JASA与JRSSB很注重统计方法和理论上的创新,基本是同一级别的杂志,是这四大中最好的;老三Annals of Statistics上的文章难度比较大,数学推导比较复杂;老四Biometrica要与前三甲差一些,但圈内还是将它归为顶堤杂志。 (5) Biometrics (6) Statistica Sinica (7) Scandinavian Journal of Statistics (8) Bernoulli 以上三个是统计学中的一流杂志,其中Biometrics侧重生物统计方向;Statistica Sinica和Annals of Statistics的风格差不多,在数学推导上要偏难一些;Scandinavian Journal of Statistics的风格介于JRSSB和Annals of Statistics之间;Bernoulli是概率和统计的综合杂志,概率的文章篇多一些。 当然,除了以上四个杂志外,还有其它比较好的一流杂志,如Biostatistics, Statistical Science, Technometrics, Canadian Journal of Statistics等,虽然有的影响因子高一些,但本人感觉明显要比上面四个杂志档次低一些。 计量经济与金融 (偏文科的没有列) (1) Econometrica (2) Journal of Finnance (JF) 以上两个属于顶级杂志,其中老大Econometrica的地位无可撼动,本人认为它的水平要明显高于JASA和JRSSB,平均一篇文章有60页左右,理论新,方法新,推导复杂;老二JF也是非常不错的顶级杂志,国内基本没人能在上面发文章。 (3) Journal of Econometrics (JE) (4) Journal of Finincial Econometrics (JFE) (5) Econometric Theory (ET) 以上属于一流杂志,其中JE的文章难度比较大,水平比JFE要高一些;而ET要比JE和JFE低一个档次,但还是属于一流杂志。 精算学 : (1) Insurance: Mathematics and Economics (2) ASTIN Bulletin (3) Scandinavian Actuarial Journal (4) North American Actuarial Journal 精算学中没有顶级杂志可谈,毕竟它是统计学下面的一个分支,属于小学科门类,以上四个杂志是精算学中的一流杂志,淡然这里列举的并不全面,暂不补充了。 归到一起的话,顶级杂志当属Econometrica, JASA, JRSSB, Ann Statist, JF,如果能在这上面发篇文章,再在其它一般杂志上发一定数量的文章,拿个国基面上项目绝对没问题;如果每年在这些杂志上发一篇文章,再加上一定数量的其它文章,杰青基本没问题的(特别是在Econometrica上发的话)。 来自 http://user.qzone.qq.com/352693585/2
98 次阅读|0 个评论
分享 sas基础统计代码
xulimei1986 2016-8-8 16:10
data need; set sasuser.whmms; run; *Check data sets; proc contents data=need; run; /*********************************** 【报表与图形输出】 包括: ——Tabulate过程汇总 ——Means过程输出基本统计量 ——Univariate过程输出基本统计量 ——Freq过程输出离散变量的分布情况 ——Corr过程计算两变量相关关系 ——Gplot过程绘制散点图和曲线图 ——Gchart过程绘制直方图、饼图、三维直方图 ***********************************/ *Tabulate过程分类汇总; proc tabulate data=need; class brand; var ts9 ts10; table brand all,(ts9 ts10)*(n sum mean); keylabel n="用户数" sum="发送总量" mean="人均发送"; label brand="品牌" all="总计" ts9="9月" ts10="10月"; run; *Means过程输出简单统计量; proc means data=need n sum mean; var ts9 ts10; label ts9="9月" ts10="10月"; run; *Univariate过程输出简单统计量; proc univariate data=need; var ts10; run; *Freq过程输出离散变量分布情况; proc freq data=need; tables brand sex; run; *Corr过程计算两变量之间的相关关系; proc corr data=need; var ts10 ts9 fee10 fee9; run; *Gplot绘制散点图和曲线图; proc gplot data=need; symbol i=none v=* color=blue; plot ts10*ts9; quit; proc gplot data=need; symbol i=join v=* color=blue; plot ts10*ts9; quit; *绘制直方图; goptions reset=goptions; proc gchart data=need; vbar sex; quit; *绘制三维直方图; proc gchart data=need; vbar3d sex; quit; *绘制横向直方图; proc gchart data=need; hbar sex; quit; *绘制三维横向直方图; proc gchart data=need; hbar3d sex; quit; *绘制饼形图; proc gchart data=need; pie sex/type=percent;/*百分比*/ run; *绘制三维饼形图; proc gchart data=need; pie3d sex/type=percent; run; *绘制环形饼形图; proc gchart data=need; donut sex/type=percent; run; *绘制三角形面积图; proc gchart data=need; star sex/type=percent; run; *绘制三维分类直方图; proc gchart data=need; block sex/group=brand; quit; *G3D过程用来绘制三维曲面; data test; do x=-3 to 3 by 0.1; do y=-3 to 3 by 0.1; z=x**2+y**2; output; end; end; run; proc g3d data=test; plot x*y=z; run; *GCONTOUR过程则是画出曲面的等高线图; proc gcontour data=test; plot x*y=z; run; /*********************************** 【基本统计分析】 包括: ——正态性检验 ——单变量均值检验 ——两独立样本的均值检验 ——配对样本均值检验 ——回归分析 ——方差分析 ——列联表检验 ***********************************/ /***********************正太分布检验 PROC UNIVARIATE DATA=数据集 NORMAL; VAR 变量; HISTOGRAM 变量; PROBPLOT 变量; RUN; *****************************/ proc univariate data=need; var ts10; histogram ts10/ normal(color=red w=10)/*设置正态曲线的颜色和宽度*/ cframe=green /*设置直方图底色*/ cfill=blue /*设置直方图颜色*/ cbarline=white; /*设置直方图外框线颜色**/ probplot ts10;/*probplot语句画出它的概率分布图*/ qqplot ts10;/*分位数QQ图*/ run; /***********************单变量均值检验 PROC TTEST DATA=数据集 H0=某个值; VAR 变量; RUN; *****************************/ proc ttest data=need h0=55; var ts10; run; /**********************两独立样本均值检验 PROC TTEST DATA=数据集; CLASS 分类变量; VAR 变量; RUN; *****************************/ proc ttest data=need; class sex; var ts10; where sex="male" or sex="female"; run; /********************** 常用的非参数方法是 NPAR1WAY过程 *****************************/ proc npar1way data=need; class sex; var ts10; where sex="male" or sex="female"; run; /**********************配对样板均值检验 PROC TTEST DATA=数据集; VAR ADD; RUN; 新生成一个变量=new-old 然后对add进行t检验看其与0是否有差异 *****************************/ /**********************回归分析 线性回归的假设理论 (1)正态性假设:即所研究的变量均服从正态分布; (2)等方差假设:即各变量总体的方差是相等的; (3)独立性假设, 即各变量之间是相互独立的; (4)残差项无自相关性,即误差项之间互不相关 SAS提供的回归过程比较多,包含REG(回归)过程、RSREG(二次响应面回归)过程、ORTHOREG(病态数据回归)过程、NLIN(非线性回归)过程、TRANSREG(变换回归)过程、CALIS(线性机构方程和路径分析)过程、GLM(一般线性回归)过程、GENMOD(广义线性回归)过程等 REG的一般格式: PROC REG DATA=数据集 选项; VAR 变量列表; MODEL 因变量=自变量列表/SELECTION=回归模型; PRINT 输出结果; PLOT 诊断图形; RUN; 使用不同的线性回归模型 SELECTION=FORWARD为顺向选择法,将全模型中的自变项逐一加入至最佳模型 SELCTION=BACKWARD为反向排除法,将全模型中的自变项逐一去除至最佳模型 SELECTION=STEPWISE为逐步排除法,为前二者之合并 NLIN过程(非线性回归) GLM过程 GLM是一般线性模型的缩写,使用的是最小二乘法来回归线性的模型。在GLM过程不但可以进行回归分析,还可以进行方差分析、协方差分析、多变量方差分析、偏相关系数分析。 ORTHOREG过程 病态数据回归过程,它的数学核心是最小二乘法。当处理一些病态数据的时候,得到的结果比其他的线性回归方法(REG、GLM)精确得多。 *****************************/ proc reg data=need; var ts10 ts9 fee10 fee9; model ts10=ts9 fee10 fee9; run; proc nlin data=need; model ts10=a*ts9+b*fee10+c*fee9+d; parms a=1 b=1 c=1 d=10; run; /**********************方差分析 ANOVA过程一般格式 PROC ANOVA DATA=数据集; CLASS 因素; MODEL 结果=因素; RUN; *****************************/ proc anova data=need; class brand; model ts10=brand; means brand/t; run; *列联表检验; /*********************************** 【多元统计分析】 包括: ——主成分分析 ——因子分析 ——聚类分析 ——判别分析 ***********************************/
0 个评论
分享 线性回归ABC
xiaocai_82 2016-5-25 21:52
第一步,线形图观察变量之间关系,确定模型 第二步,OLS估计,观察t统计量,,拟合优度等是否显著 第三步,相关图观察是否存在多重共线性,存在则用逐步回归法消除 第四步,用怀特检验法判断是否存在异方差,加权最小二乘法消除 第五步,用DW检验确定是否存在自相关,广义差分法消除
91 次阅读|0 个评论
分享 给大家推荐一部统计方面的好读物
cyjqw 2016-5-9 14:38
最近有本书特别畅销,也是国务院副总理汪洋推荐给党政领导们的,书名字叫《 对我们生活的误测——为什么GDP增长不等于社会进步 》,建议有兴趣的童鞋可以买来看看
20 次阅读|0 个评论
分享 分享统计年鉴
13-经济-杨丽丽 2016-3-31 16:24
http://nianjian.******/tags.php?/%E5%8C%97%E4%BA%AC%E5%BB%B6%E5%BA%86%E5%B9%B4%E9%89%B4/1/13459412135/
0 个评论
分享 hive 统计案例3_20160328
xulimei1986 2016-3-28 15:55
use db_user_g6591; set date1=20160302; set date2=2016-03-02; -- /***********参数设置**************/ set var=u_vip; set login=g18.loginrole; set prepaid=g18v.prepaid; set game_name='********'; set result_name=g18_huizong; -- /***************清空已有的表******************/ drop table if exists vip; drop table if exists vip1; drop table if exists login; drop table if exists cost; drop table if exists addvip; drop table if exists result1 ; -- /******截至昨天产品累积的VIP库:prepaid******/ create table if not exists vip as select account_id,get_json_object(source,"$.cash") as cash,get_json_object(source,"$.${hiveconf:var}") as vip_grade,date from ${hiveconf:prepaid} where date=${hiveconf:date1} ; create table if not exists vip1 as select account_id,max(vip_grade) as vip_grade from vip group by account_id having vip_grade0; -- /******昨天有登录的VIP用户*****************/ create table if not exists login as select a.* from vip1 a join (select distinct account_id from ${hiveconf:login} where date='${hiveconf:date1}')b on a.account_id=b.account_id; -- /******昨天有消费的VIP用户****************/ create table if not exists cost as select a.account_id,a.vip_grade,b.cost from vip1 a join (select account_id,sum(cash) as cost from vip where date='${hiveconf:date1}' group by account_id)b on a.account_id=b.account_id; -- /********昨天新增的VIP用户***************/ create table addvip as select * from (select account_id, max(get_json_object(source,"$.${hiveconf:var}")) as vip_grade, min(time) as first_cost from ${hiveconf:prepaid} where date=${hiveconf:date1} group by account_id) b where b.vip_grade0 and to_date(b.first_cost)='${hiveconf:date2}'; -- /*****************汇总表****************/ create table result1 as select ${hiveconf:game_name} as game, '${hiveconf:date1}' as date, m.*, n.vip_add from (select c.*,d.cost_num,d.cost from (select a.vip_grade,a.vip_total,b.vip_login from (select vip_grade,count(*) as vip_total from vip1 group by vip_grade) a left join (select vip_grade,count(*) as vip_login from login group by vip_grade) b on a.vip_grade=b.vip_grade) c left join (select vip_grade,count(*) as cost_num,sum(cost) as cost from cost group by vip_grade) d on c.vip_grade = d.vip_grade) m left join (select vip_grade,count(*) as vip_add from addvip group by vip_grade) n on m.vip_grade = n.vip_grade; drop table if exists ${hiveconf:result_name} ; create table if not exists ${hiveconf:result_name} as select *, case when vip_grade=1 then '[20:100)' when vip_grade=2 then '[100:200)' when vip_grade=3 then '[200:500)' when vip_grade=4 then '[500:1000)' when vip_grade=5 then '[1000:2000)' when vip_grade=6 then '[2000:5000)' when vip_grade=7 then '[5000:10000)' else '10000以上' end as cost_quyu from result1;
0 个评论
分享 统计抽样
西门高 2016-1-21 21:28
简单随机抽样,等距随机抽样,分层抽样,整群抽样
16 次阅读|0 个评论
分享 hive_统计案例2
xulimei1986 2016-1-15 15:44
set begin_date0=20150501; set end_date0=20150831; drop table if exists g2_1b_login; drop table if exists g3_1b_login; drop table if exists g4_1b_login; drop table if exists g15_1b_login; drop table if exists g18_1b_login; drop table if exists h2_1b_login; drop table if exists wscs_1b_login; drop table if exists ma32_1b_login; drop table if exists ma30_1b_login; drop table if exists ma43_1b_login; drop table if exists g2_1n_cost; drop table if exists g3_1n_cost; drop table if exists g4_1n_cost; drop table if exists g15_1n_cost; drop table if exists g18_1n_cost; drop table if exists h2_1n_cost; drop table if exists wscs_1n_cost; drop table if exists ma32_1n_cost; drop table if exists ma30_1b_login; drop table if exists ma43_1b_login; -- ********************************************************************** create table if not exists g2_1b_login as select account_id, to_date(time) as login_time from g2.loginrole where date between '${hiveconf:begin_date0}' and '${hiveconf:end_date0}'; create table if not exists g3_1b_login as select account_id, to_date(time) as login_time from g3.loginrole where date between '${hiveconf:begin_date0}' and '${hiveconf:end_date0}'; create table if not exists g4_1b_login as select account_id, to_date(time) as login_time from g4.loginrole where date between '${hiveconf:begin_date0}' and '${hiveconf:end_date0}'; create table if not exists g15_1b_login as select account_id, to_date(time) as login_time from g15.loginrole where date between '${hiveconf:begin_date0}' and '${hiveconf:end_date0}'; create table if not exists g18_1b_login as select account_id, to_date(time) as login_time from g18.loginrole where date between '${hiveconf:begin_date0}' and '${hiveconf:end_date0}'; create table if not exists h2_1b_login as select account_id, to_date(time) as login_time from h2.loginrole where date between '${hiveconf:begin_date0}' and '${hiveconf:end_date0}'; create table if not exists wscs_1b_login as select account_id, to_date(time) as login_time from wscs.loginrole where date between '${hiveconf:begin_date0}' and '${hiveconf:end_date0}'; create table if not exists ma32_1b_login as select account_id, to_date(time) as login_time from ma32.loginrole where date between '${hiveconf:begin_date0}' and '${hiveconf:end_date0}'; create table if not exists ma30_1b_login as select account_id, to_date(time) as login_time from ma30.loginrole where date between '${hiveconf:begin_date0}' and '${hiveconf:end_date0}'; create table if not exists ma43_1b_login as select account_id, to_date(time) as login_time from ma43.loginrole where date between '${hiveconf:begin_date0}' and '${hiveconf:end_date0}'; -- ****************************************************************************************** create table if not exists g2_1n_cost as select b.account_id, to_date(a.time) as cost_time, cash from (select * from g2.mall where date between '${hiveconf:begin_date0}' and '${hiveconf:end_date0}')a left join (select distinct role_id,server,account_id from g2.loginrole where date between '${hiveconf:begin_date0}' and '${hiveconf:end_date0}') b on a.role_id=b.role_id and a.server=b.server; create table if not exists g3_1n_cost as select account_id, to_date(time) as cost_time, get_json_object(a.source,'$.cash') as cash from g3.prepaid a where date between '${hiveconf:begin_date0}' and '${hiveconf:end_date0}'; create table if not exists g4_1n_cost as select account_id, to_date(time) as cost_time, get_json_object(a.source,'$.cash') as cash from g4.prepaid a where date between '${hiveconf:begin_date0}' and '${hiveconf:end_date0}'; create table if not exists g15_1n_cost as select account_id, to_date(time) as cost_time, get_json_object(a.source,'$.cash') as cash from g15.prepaid a where date between '${hiveconf:begin_date0}' and '${hiveconf:end_date0}'; create table if not exists g18_1n_cost as select account_id, to_date(time) as cost_time, get_json_object(a.source,'$.cash') as cash from g18.prepaid a where date between '${hiveconf:begin_date0}' and '${hiveconf:end_date0}'; create table if not exists h2_1n_cost as select account_id, to_date(time) as cost_time, get_json_object(a.source,'$.cash') as cash from h2.prepaid a where date between '${hiveconf:begin_date0}' and '${hiveconf:end_date0}'; create table if not exists wscs_1n_cost as select account_id, to_date(time) as cost_time, get_json_object(a.source,'$.cash') as cash from wscs.prepaid a where date between '${hiveconf:begin_date0}' and '${hiveconf:end_date0}'; create table if not exists ma32_1n_cost as select account_id, to_date(time) as cost_time, get_json_object(a.source,'$.cash') as cash from ma32.prepaid a where date between '${hiveconf:begin_date0}' and '${hiveconf:end_date0}'; create table if not exists ma30_1n_cost as select account_id, to_date(time) as cost_time, get_json_object(a.source,'$.cash') as cash from ma30.prepaid a where date between '${hiveconf:begin_date0}' and '${hiveconf:end_date0}'; create table if not exists ma43_1n_cost as select account_id, to_date(time) as cost_time, get_json_object(a.source,'$.cash') as cash from ma43.prepaid a where date between '${hiveconf:begin_date0}' and '${hiveconf:end_date0}'; -- select min(login_time) as min1, max(login_time) as max1 from g2_1b_login; -- select min(login_time) as min2, max(login_time) as max2 from g4_1b_login; -- select min(login_time) as min3, max(login_time) as max3 from g15_1b_login; -- select min(login_time) as min4, max(login_time) as max4 from g18_1b_login; -- select min(login_time) as min5, max(login_time) as max5 from h2_1b_login; -- select min(cost_time) as min1, max(cost_time) as max1 from g2_1n_cost; -- select min(cost_time) as min2, max(cost_time) as max2 from g4_1n_cost; -- select min(cost_time) as min3, max(cost_time) as max3 from g15_1n_cost; -- select min(cost_time) as min4, max(cost_time) as max4 from g18_1n_cost; -- select min(cost_time) as min5, max(cost_time) as max5 from h2_1n_cost;
0 个评论
分享 hive_统计案例1
xulimei1986 2016-1-15 15:40
use data_ana; -- ***************************************************************************************************** drop table if exists ma30_2a_sample; -- drop table if exists ma30_2b_login; drop table if exists ma30_2c_login1; drop table if exists ma30_2d_login2; drop table if exists ma30_2e_login3; drop table if exists ma30_2f_sample1; drop table if exists ma30_2g_churn_result; -- drop table if exists ma30_2h_cost; drop table if exists ma30_2i_cost1; drop table if exists ma30_2j_cost2; drop table if exists ma30_2k_result; drop table if exists ma30_2l_result1; drop table if exists ma30_2m_result2; drop table if exists ma30_2m_result2a; drop table if exists ma30_2m_result2e; drop table if exists ma30_2m_result3; drop table if exists ma30_2y_retention_acc; drop table if exists ma30_2y_retention_acc1; -- drop table if exists ma30_2y_retention_acc2; drop table if exists ma30_2z_output1; drop table if exists ma30_2z_output2; drop table if exists ma30_2z_output3; drop table if exists ma30_2z_output4; -- ***************************************************************************************************** -- 导入操作样本集 drop table if exists ma30_2a_sample; create external table if not exists ma30_2a_sample( perior string, account_id string, start_time string, survey_time string, settle_time string, sample_type string, channel string, survey_type string, survey_type2 string, employee string ) partitioned by (date string) row format delimited fields terminated by '\t' lines terminated by '\n' location '/user/g8123/ma30_retention'; set game_name=魔天记; set perior=4; set begin_date=20150801; set end_date=20150831; set begin_date1=2015-07-31; set end_date1=2015-09-01; -- ***************************************************************************************************** alter table ma30_2a_sample add if not exists partition(date='${hiveconf:end_date}'); -- ***************************************************************************************************** -- 从ma30_loginrole 中筛选出需要的字段 -- drop table if exists ma30_2b_login; create table if not exists ma30_2b_login as select account_id, to_date(time) as login_time from ma30.loginrole where date between '${hiveconf:begin_date}' and '${hiveconf:end_date}'; -- 从ma30_prepaid 中筛选出需要的字段 -- drop table if exists ma30_2h_cost; create table if not exists ma30_2h_cost as select account_id, to_date(time) as cost_time, get_json_object(source,'$.cash') as cash from ma30.prepaid where date between '${hiveconf:begin_date}' and '${hiveconf:end_date}'; -- 选择统计样本 drop table if exists ma30_2f_sample1; create table if not exists ma30_2f_sample1 as select * from ma30_2a_sample where date='${hiveconf:end_date}' and length(trim(perior)) = 2; -- ***************************************************************************************************** -- ***************************************************************************************************** -- ***************************************************************************************************** -- 登录数据按帐号/登录日期去重 drop table if exists ma30_2c_login1; create table if not exists ma30_2c_login1 as select distinct account_id, login_time from ma30_2b_login; -- ***************************************************************************************************** -- 每个帐号增加一条记录,登录日期为‘2015-03-01’ insert into table ma30_2c_login1 select account_id, '${hiveconf:begin_date1}' from (select distinct account_id from ma30_2f_sample1 where split(sample_type,'年') '8月成功' and split(sample_type,'年') '8月失败') as tmp; insert into table ma30_2c_login1 select distinct account_id, survey_time as login_time from ma30_2f_sample1 where split(sample_type,'年') = '8月成功' or split(sample_type,'年') = '8月失败'; insert into table ma30_2c_login1 select account_id, '${hiveconf:end_date1}' from (select distinct account_id from ma30_2f_sample1 ) as tmp; -- 算出每个帐号的每两条记录间的差值 drop table if exists ma30_2d_login2; create table if not exists ma30_2d_login2 as select *, row_number() over (distribute by account_id sort by login_time ) row_num from ma30_2c_login1 where account_id is not null sort by account_id,login_time; drop table if exists ma30_2e_login3; create table if not exists ma30_2e_login3 as select account_id, max(intervel) as daydif from ( select a.account_id, datediff(b.login_time,a.login_time) as intervel from ma30_2d_login2 a join ma30_2d_login2 b on a.account_id = b.account_id and b.row_num = a.row_num +1) m group by account_id; -- ***************************************************************************************************** -- 匹配样本表,打上流失标记 drop table if exists ma30_2g_churn_result; create table if not exists ma30_2g_churn_result as select *, case when daydif8 then 0 else 1 end as flag from (select a.*, b.daydif from ma30_2f_sample1 a left join ma30_2e_login3 b on a.account_id=b.account_id) m; -- select sample_type,survey_type,flag,count(account_id) as countc -- from ma30_2g_churn_result -- group by sample_type,survey_type,flag; -- ***************************************************************************************************** -- 消费数据按帐号/消费日期求和 drop table if exists ma30_2i_cost1; create table if not exists ma30_2i_cost1 as select * from (select a.perior, a.account_id, a.survey_time, b.cost_time, cast(b.cash as int) as cash from ma30_2f_sample1 a left join ma30_2h_cost b on a.account_id=b.account_id) as tmp where cost_time is not null; drop table if exists ma30_2j_cost2; create table if not exists ma30_2j_cost2 as select perior, account_id, sum(cash) as cash from ma30_2i_cost1 where cost_time=survey_time group by perior,account_id; -- ***************************************************************************************************** -- 总表 drop table if exists ma30_2k_result; create table if not exists ma30_2k_result as select *, case when cash0 then 1 else 0 end as cost_flag, case when cash is null then 0 else cash end as cost from (select a.*, b.cash from ma30_2g_churn_result a left join ma30_2j_cost2 b on a.account_id=b.account_id and a.perior=b.perior)m; drop table if exists ma30_2l_result1; create table if not exists ma30_2l_result1 as select *, case when channel='官网IOS用户' then cost*0.7 when channel='渠道用户' then cost*0.5 when channel='官网安卓用户' then cost end as shouyi, concat(split(trim(sample_type),'年') ,'年',lpad(split(substr(trim(sample_type),6),'月') ,2,'0'),'月') as sample_type1 from ma30_2k_result; drop table if exists ma30_2m_result2; create table if not exists ma30_2m_result2 as select '魔天记' as game_name, a.perior, a.sample_type1, a.wx_num+b.wx_num as oper_num, a.wx_num as wx_num_suc, a.churn_num as churn_num_suc, a.churn_ratio as churn_ratio_suc, a.cost_num as cost_num_suc, a.cost as cost_suc, b.wx_num as wx_num_fail, b.churn_num as churn_num_fail, b.churn_ratio as churn_ratio_fail, b.cost_num as cost_num_fail, b.cost as cost_fail, b.churn_ratio-a.churn_ratio as ratio_dif, round(a.wx_num*(b.churn_ratio-a.churn_ratio)) as back_num, a.wx_num*(b.churn_ratio-a.churn_ratio)*a.cost/(a.wx_num-a.churn_num) as profit from (select perior, sample_type1, sample_type, count(account_id)as wx_num, sum(flag)as churn_num, sum(flag)/count(account_id) as churn_ratio, sum(case when flag=0 then cost_flag else 0 end) as cost_num, sum(case when flag=0 then shouyi else 0 end) as cost from ma30_2l_result1 where split(trim(sample_type),'月') ='成功' group by perior,sample_type1,sample_type) a, (select perior, sample_type1, sample_type, count(account_id)as wx_num, sum(flag)as churn_num, sum(flag)/count(account_id) as churn_ratio, sum(case when flag=0 then cost_flag else 0 end) as cost_num, sum(case when flag=0 then shouyi else 0 end) as cost from ma30_2l_result1 where split(trim(sample_type),'月') ='失败' group by perior,sample_type1,sample_type) b where a.perior=b.perior and a.sample_type1=b.sample_type1; drop table if exists ma30_2m_result2a; create table if not exists ma30_2m_result2a as select '魔天记' as game_name, perior, '合计' as sample_type1, sum(oper_num) as oper_num, sum(wx_num_suc) as wx_num_suc, sum(churn_num_suc) as churn_num_suc, sum(churn_num_suc)/sum(wx_num_suc) as churn_ratio_suc, sum(cost_num_suc) as cost_num_suc, sum(cost_suc) as cost_suc, sum(wx_num_fail) as wx_num_fail, sum(churn_num_fail) as churn_num_fail, sum(churn_num_fail)/sum(wx_num_fail) as churn_ratio_fail, sum(cost_num_fail) as cost_num_fail, sum(cost_fail) as cost_fail, sum(churn_num_fail)/sum(wx_num_fail)-sum(churn_num_suc)/sum(wx_num_suc) as ratio_dif, sum(back_num) as back_num, sum(profit) as profit from ma30_2m_result2 group by perior; drop table if exists ma30_2m_result3; create table if not exists ma30_2m_result3 as select * from (select * from ma30_2m_result2a union all select * from ma30_2m_result2) m sort by sample_type1; -- 统计每个员工绩效 drop table if exists ma30_2m_result2e; create table if not exists ma30_2m_result2e as select '魔天记' as game_name, a.perior, a.employee, a.sample_type1, a.wx_num as wx_num_suc, a.churn_num as churn_num_suc, a.churn_ratio as churn_ratio_suc, a.cost_num as cost_num_suc, a.cost as cost_suc, b.wx_num as wx_num_fail, b.churn_num as churn_num_fail, b.churn_ratio as churn_ratio_fail, b.cost_num as cost_num_fail, b.cost as cost_fail, b.churn_ratio-a.churn_ratio as ratio_dif, round(a.wx_num*(b.churn_ratio-a.churn_ratio)) as back_num, a.wx_num*(b.churn_ratio-a.churn_ratio)*a.cost/(a.wx_num-a.churn_num) as profit from (select perior, employee, sample_type1, count(account_id)as wx_num, sum(flag)as churn_num, sum(flag)/count(account_id) as churn_ratio, sum(case when flag=0 then cost_flag else 0 end) as cost_num, sum(case when flag=0 then shouyi else 0 end) as cost from ma30_2l_result1 where split(trim(sample_type),'月') ='成功' group by perior,employee,sample_type1) a, (select perior, sample_type1, count(account_id)as wx_num, sum(flag)as churn_num, sum(flag)/count(account_id) as churn_ratio, sum(case when flag=0 then cost_flag else 0 end) as cost_num, sum(case when flag=0 then shouyi else 0 end) as cost from ma30_2l_result1 where split(trim(sample_type),'月') ='失败' group by perior,sample_type1) b where a.perior=b.perior and a.sample_type1=b.sample_type1; -- ***************************************************************************************************** -- 计算累积数据 -- 复制上一期的表 drop table if exists ma30_2y_retention_acc; create table if not exists ma30_2y_retention_acc as select game_name,perior,oper_num, wx_num_suc,churn_num_suc,churn_ratio_suc, cost_suc, wx_num_fail,churn_num_fail,churn_ratio_fail, cost_fail, ratio_dif,back_num,profit from ma30_2y_retention_acc2 where cast(perior as int) cast('${hiveconf:perior}' as int); -- 插入新一期的合计 insert into table ma30_2y_retention_acc select game_name,perior,oper_num, wx_num_suc,churn_num_suc,churn_ratio_suc, cost_suc, wx_num_fail,churn_num_fail,churn_ratio_fail, cost_fail, ratio_dif,back_num,profit from ma30_2m_result2a; -- 合并明细和合计数据 drop table if exists ma30_2y_retention_acc2; create table if not exists ma30_2y_retention_acc2 as select * from (select * from ma30_2y_retention_acc union all select game_name, '合计' as perior, sum(oper_num) as oper_num, sum(wx_num_suc) as wx_num_suc, sum(churn_num_suc) as churn_num_suc, sum(churn_num_suc)/sum(wx_num_suc) as churn_ratio_suc, sum(cost_suc) as cost_suc, sum(wx_num_fail) as wx_num_fail, sum(churn_num_fail) as churn_num_fail, sum(churn_num_fail)/sum(wx_num_fail) as churn_ratio_fail, sum(cost_fail) as cost_fail, sum(churn_num_fail)/sum(wx_num_fail)-sum(churn_num_suc)/sum(wx_num_suc) as ratio_dif, sum(back_num) as back_num, sum(profit) as profit from ma30_2y_retention_acc group by game_name) m; -- **************************************************************************************************** -- 修改数据输出格式 -- 本期新增效果数据 drop table if exists ma30_2z_output1; create table if not exists ma30_2z_output1 as select coalesce(cast(substr(game_name,1,1) as bigint),100) as game_name0, coalesce(cast(substr(sample_type1,1,1) as bigint),100) as sample_type0, * from (select * from z_retention_output1_vars union all select game_name, perior, sample_type1, format_number(round(coalesce(oper_num,0)),0) as oper_num, format_number(round(coalesce(wx_num_suc,0)),0) as wx_num_suc, format_number(round(coalesce(churn_num_suc,0)),0) as churn_num_suc, concat(round(coalesce(churn_ratio_suc,0)*100,2),'%') as churn_ratio_suc, format_number(round(coalesce(cost_num_suc,0)),0) as cost_num_suc, format_number(round(coalesce(cost_suc,0)),0) as cost_suc, format_number(round(coalesce(wx_num_fail,0)),0) as wx_num_fail, format_number(round(coalesce(churn_num_fail,0)),0) as churn_num_fail, concat(round(coalesce(churn_ratio_fail,0)*100,2),'%') as churn_ratio_fail, format_number(round(coalesce(cost_num_fail,0)),0) as cost_num_fail, format_number(round(coalesce(cost_fail,0)),0) as cost_fail, concat(round(coalesce(ratio_dif,0)*100,2),'%') as ratio_dif, format_number(round(coalesce(back_num,0)),0) as back_num, format_number(round(coalesce(profit,0)),0) as profit from ma30_2m_result3) m sort by game_name0,sample_type1; -- 本期累积效果数据 drop table if exists ma30_2z_output2; create table if not exists ma30_2z_output2 as select coalesce(cast(substr(game_name,1,1) as bigint),100) as game_name0, coalesce(cast(perior as bigint),100) as perior0, * from (select * from z_retention_output2_vars union all select game_name, perior, format_number(round(coalesce(oper_num,0)),0) as oper_num, format_number(round(coalesce(wx_num_suc,0)),0) as wx_num_suc, format_number(round(coalesce(churn_num_suc,0)),0) as churn_num_suc, concat(round(coalesce(churn_ratio_suc,0)*100,2),'%') as churn_ratio_suc, format_number(round(coalesce(cost_suc,0)),0) as cost_suc, format_number(round(coalesce(wx_num_fail,0)),0) as wx_num_fail, format_number(round(coalesce(churn_num_fail,0)),0) as churn_num_fail, concat(round(coalesce(churn_ratio_fail,0)*100,2),'%') as churn_ratio_fail, format_number(round(coalesce(cost_fail,0)),0) as cost_fail, concat(round(coalesce(ratio_dif,0)*100,2),'%') as ratio_dif, format_number(round(coalesce(back_num,0)),0) as back_num, format_number(round(coalesce(profit,0)),0) as profit from ma30_2y_retention_acc2) m sort by game_name0,perior0; -- 明细数据 drop table if exists ma30_2z_output3; create table if not exists ma30_2z_output3 as select coalesce(cast(perior as bigint),0) as perior0, * from (select * from z_retention_output3_vars union all select perior, account_id, start_time, survey_time, settle_time, sample_type, channel, survey_type, survey_type2, daydif, flag, cash, cost_flag, cost, cast(shouyi as string) as shouyi, employee from ma30_2l_result1) m sort by perior0; -- 员工绩效 drop table if exists ma30_2z_output4; create table if not exists ma30_2z_output4 as select coalesce(cast(perior as bigint),0) as perior0, * from (select * from z_retention_output4_vars union all select game_name, perior, employee, sample_type1, format_number(round(coalesce(wx_num_suc,0)),0) as wx_num_suc, format_number(round(coalesce(churn_num_suc,0)),0) as churn_num_suc, concat(round(coalesce(churn_ratio_suc,0)*100,2),'%') as churn_ratio_suc, format_number(round(coalesce(cost_num_suc,0)),0) as cost_num_suc, format_number(round(coalesce(cost_suc,0)),0) as cost_suc, format_number(round(coalesce(wx_num_fail,0)),0) as wx_num_fail, format_number(round(coalesce(churn_num_fail,0)),0) as churn_num_fail, concat(round(coalesce(churn_ratio_fail,0)*100,2),'%') as churn_ratio_fail, format_number(round(coalesce(cost_num_fail,0)),0) as cost_num_fail, format_number(round(coalesce(cost_fail,0)),0) as cost_fail, concat(round(coalesce(ratio_dif,0)*100,2),'%') as ratio_dif, format_number(round(coalesce(back_num,0)),0) as back_num, format_number(round(coalesce(profit,0)),0) as profit from ma30_2m_result2e) m sort by perior0,employee,sample_type1; -- **************************************************************************************************** -- 导出数据 insert overwrite directory '/user/g8123/result/retention/20150831/ma30/output1' select game_name,sample_type1,oper_num, wx_num_suc,churn_num_suc,churn_ratio_suc,cost_num_suc,cost_suc, wx_num_fail,churn_num_fail,churn_ratio_fail,cost_num_fail,cost_fail, ratio_dif,back_num,profit from ma30_2z_output1; insert overwrite directory '/user/g8123/result/retention/20150831/ma30/output2' select game_name,perior,oper_num, wx_num_suc,churn_num_suc,churn_ratio_suc,cost_suc, wx_num_fail,churn_num_fail,churn_ratio_fail, cost_fail,ratio_dif,back_num,profit from ma30_2z_output2; insert overwrite directory '/user/g8123/result/retention/20150831/ma30/output3' select perior,account_id,start_time,survey_time,settle_time,sample_type,channel,survey_type,survey_type2, daydif,flag,cash,cost_flag,cost,shouyi,employee from ma30_2z_output3; insert overwrite directory '/user/g8123/result/retention/20150831/ma30/output4' select game_name,perior,employee,sample_type1, wx_num_suc,churn_num_suc,churn_ratio_suc,cost_num_suc,cost_suc, wx_num_fail,churn_num_fail,churn_ratio_fail,cost_num_fail,cost_fail, ratio_dif,back_num,profit from ma30_2z_output4;
0 个评论
分享 关于统计的几个概念
西门高 2016-1-12 21:42
总体,样本,随机变量,分布,估计,假设检验
13 次阅读|0 个评论
GMT+8, 2026-2-27 13:18