6557 26

Free : Bioinformatics Programming Using Python [推广有奖]

贵宾

学术权威

10%

还不是VIP/贵宾

-

TA的文库  其他...

Data Mining

威望
3
论坛币
84003 个
通用积分
16348.6373
学术水平
669 点
热心指数
648 点
信用等级
501 点
经验
8914 点
帖子
3383
精华
20
在线时间
2864 小时
注册时间
2012-11-11
最后登录
2024-4-1

初级学术勋章 初级热心勋章 初级信用勋章 中级学术勋章 中级热心勋章 中级信用勋章 高级学术勋章 高级热心勋章 高级信用勋章

相似文件 换一批

+2 论坛币
k人 参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群
赵安豆老师微信:zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

感谢您参与论坛问题回答

经管之家送您两个论坛币!

+2 论坛币
Unnamed QQ Screenshot20140919104120.png

Comparing to Perl, Python has a quite lagged adoption as the scripting language of choice in the field of bioinformatics, although it is getting some moment recently. If you read job descriptions for bioinformatics engineer or scientist positions a few year back, you barely saw Python mentioned, even as "nice to have optional skill". One of the reasons is probably lacking of good introductory level bioinformatics books in Python so there are, in general, less people thinking Python as a good choice for bioinformatics. The book "Beginning Perl for Bioinformatics" from O Reilly was published in 2001. Almost one decade later, we finally get the book "Bioinformatics Programming Using Python" from Mitchell Model to fill the gap. When I first skimmed the book "Bioinformatics Programming Using Python", I got the impression that this book was more like "learning python using bioinformatics as examples" and felt a little bit disappointed as I was hoping for more advanced content. However, once I went through the book, reading the preface and everything else chapter by chapter, I understood the main target audiences that author had in mind and I thought the author did a great job in fulfilling the main purpose. In modern biological research, scientists can easily generate large amount of data where Excel spreadsheets that most bench scientists use to process limiting amount of data is no longer an option. I personally believe that the new generation of biologists will have to learn how to process and manage large amount inhomogeneous data to make new discovery out of it. This requires general computational skill beyond just knowing how to use some special purpose applications that some software vendor can provide. The book gives good introduction about practical computational skills using Python to process bioinformatics data. The book is very well organized for a newbie who just wants to start to process the raw data their own and get into a process of learning-by-doing to become a Python programmer. The book starts with an introduction on the primitive data types in Python and moves toward the flow controls and collection data type with emphasis on, not surprisingly, string processing and file parsing, two of most common tasks in bioinformatics. Then, the author introduces the object-oriented programming in Python. I think a beginner will also like those code templates for different patterns of data processing task in Chapter 4. They summarize the usual flow structure for common tasks very well. After giving the basic concept of programming with Python, the author focuses on other utilities which are very useful for day-to-day work for gathering, extracting, and processing data from different data sources. For example, the author discusses about how to explore and organize files with Python in the OS level, using regular expression for extracting complicated text data file, XML processing, web programming for fetching online biological data and sharing data with a simple web server, and, of course, how to program Python to interact with a database. The deep knowledge of all of these topics might deserve their own books. The author does a good job to cover all these topics in a concise way. This will help people to know what can be done very easily with Python and, if they want, to learn any of those topic more from other resources. The final touch of the book is on structured graphics. This is very wise choice since the destiny of most of bioinformatics data is very likely to be some graphs used in presentations and for publishing. Again, there are many other Python packages can help scientists to generate nice graph, but the author focuses on one or two of them to show the readers how to do general some graphs with them and the reader might be able to learn something else from there. One thing I hope the author can also cover, at least at a beginner level, is the numerical and statistical aspect in bioinformatics computing with Python. For example, Numpy or Scipy are very useful for processing large amount of data, generating statistics and evaluating significance of the results. They are very useful especially for processing large amount data where the native Python objects are no longer efficient enough. The numerical computation aspect in bioinformatics is basically lacking in the book. The other thing that might be desirable for such a book is to show that Python is a great tool for prototyping some algorithms in bioinformatics. This is probably my own personal bias, but I do think it is nice to show some basic bioinformatics algorithm implementations in python. This will help the readers to understand a little bit more about some of the common algorithms used in the field and to get a taste on a little bit more advanced programming. Overall, I will not hesitate to recommend this book to any one who will like to start to process biological data on their own with Python. Moreover, it can actually serve as a good introductory book to Python regardless the main focus on bioinformatics examples. The book covers most day-to-day basic bioinformatics tasks and shows Python is a great tool for those tasks. I think a little more advanced topics, especially on basic numerical and statistical computation in the book, will also help the target audiences. Unfortunately, none of that topic is mentioned in the book. That has been said, even if you are an experienced python programmer in bioinformatics, the book's focus on Python 3 and a lot of useful templates might serve well as a quick reference if you are looking for something you do not have direct experience before.

本帖隐藏的内容

222D73B448A43F33FA9FF7C1FBE2FD93.pdf (4.09 MB)



Unnamed QQ Screenshot20140919104147.png Unnamed QQ Screenshot20140919104214.png

二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

关键词:Programming Informatics informat informa Program recently although general getting people

本帖被以下文库推荐

If you want something,work for it!
沙发
狂热的爱好者 学生认证  发表于 2014-9-19 13:52:18 |只看作者 |坛友微信交流群

使用道具

藤椅
dxystata 发表于 2014-9-19 15:07:38 |只看作者 |坛友微信交流群
谢谢分享!

使用道具

板凳
aasa11 发表于 2014-9-19 21:44:45 |只看作者 |坛友微信交流群
多谢楼主分享

使用道具

报纸
minixi 发表于 2014-9-19 23:47:44 |只看作者 |坛友微信交流群
谢谢分享。

使用道具

地板
Nicolle 学生认证  发表于 2014-9-20 07:50:32 |只看作者 |坛友微信交流群
提示: 作者被禁止或删除 内容自动屏蔽

使用道具

7
oink-oink 发表于 2014-9-20 08:30:30 |只看作者 |坛友微信交流群

使用道具

8
sigmund 在职认证  发表于 2014-9-20 12:25:13 |只看作者 |坛友微信交流群
kan yi ikan

使用道具

9
狂热的爱好者 学生认证  发表于 2014-9-20 23:43:50 |只看作者 |坛友微信交流群

使用道具

10
kutuomonk 发表于 2014-9-24 14:04:45 |只看作者 |坛友微信交流群
谢谢分享

使用道具

您需要登录后才可以回帖 登录 | 我要注册

本版微信群
加好友,备注cda
拉您进交流群

京ICP备16021002-2号 京B2-20170662号 京公网安备 11010802022788号 论坛法律顾问:王进律师 知识产权保护声明   免责及隐私声明

GMT+8, 2024-4-25 20:55