楼主: ReneeBK
1711 8

Graph-based Natural Language Processing and Information Retrieval [推广有奖]

  • 1关注
  • 62粉丝

VIP

学术权威

14%

还不是VIP/贵宾

-

TA的文库  其他...

R资源总汇

Panel Data Analysis

Experimental Design

威望
1
论坛币
49517 个
通用积分
53.5804
学术水平
370 点
热心指数
273 点
信用等级
335 点
经验
57815 点
帖子
4006
精华
21
在线时间
582 小时
注册时间
2005-5-8
最后登录
2023-11-26

相似文件 换一批

+2 论坛币
k人 参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群
赵安豆老师微信:zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

感谢您参与论坛问题回答

经管之家送您两个论坛币!

+2 论坛币

Graph-based Natural Language Processing and Information Retrieval

By: Rada Mihalcea; Dragomir Radev

Publisher: Cambridge University Press

Pub. Date: April 11, 2011

Print ISBN: 978-0-521-89613-9

Pages in Print Edition: 202

Subscriber Rating: [0 Ratings]


二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

关键词:information Informatio Processing Retrieval formation Natural

回帖推荐

zgq.8026 发表于7楼  查看完整内容

有点儿贵,不好意思,最近想要赚一些论坛币,你看着办,想要下载就下载吧,绝对是你要的书,我检测过了

本帖被以下文库推荐

沙发
ReneeBK 发表于 2015-6-13 12:13:03 |只看作者 |坛友微信交流群

Depth-First Graph Traversal


Figure 2.2. Depth-first traversal of a graph

使用道具

藤椅
ReneeBK 发表于 2015-6-13 12:14:37 |只看作者 |坛友微信交流群

Breadth-First Graph Traversal


Figure 2.4. Breadth-first traversal of a graph.

Similar to the depth-first traversal algorithm, the running time of the breadth-first traversal is O(|E| + |V|).

The depth-first and breadth-first traversal strategies also can be used to search for a node in the graph. These strategies are referred to as depth-first search andbreadth-first search algorithms, respectively. Basically, the traversal algorithm is used until the target node is reached, at which point the algorithm stops.


使用道具

板凳
ReneeBK 发表于 2015-6-13 12:16:11 |只看作者 |坛友微信交流群

Minimum Spanning Trees

Minimum spanning trees have the following properties: (1) As shown in Figure 2.5, it is not unusual for a graph to have several minimum spanning trees. However, even if a graph admits several minimum spanning trees, all of the trees will have the same total weight. (2) For every edge not included in the minimum spanning tree, adding the edge to the tree results in the construction of a cycle. Moreover, the newly added edge will be a maximum-weight edge in that cycle. (3) The number of edges in a minimum spanning tree is equal to the number of nodes in the graph minus 1.

Several algorithms can be applied to identify the minimum spanning trees of a graph, including Prim’s algorithm. Briefly, the algorithm starts with an arbitrary node (referred to as the “root”). Next, at each iteration, it branches out from the tree constructed so far by choosing an edge that has the minimum weight among all the edges that could be attached. The algorithm adds to the tree the vertex that is associated with that edge. Thus, vertices in the graph can be divided into three disjoint categories: (1) tree vertices, which are the vertices belonging to the tree constructed so far; (2) fringe vertices, which are the vertices that are not part of the tree but are adjacent to some vertex in the tree; and (3) unseen vertices, which are all of the other vertices in the graph. Prim’s algorithm is described in the pseudocode in Figure 2.6.


使用道具

报纸
ReneeBK 发表于 2015-6-13 12:18:47 |只看作者 |坛友微信交流群
Shortest-Path Algorithms
Given a graph G and a source node N in the graph, the shortest-path problem is the problem of finding the shortest path from N to any other vertex in the graph. In the case of unweighted graphs, the length of a path is calculated as the number of edges on the path. In the case of weighted graphs, the length is calculated as the sum of the weights of all edges on the path. The algorithm used for the case of weighted graphs also is referred to as Dijkstra’s algorithm.

Figure 2.7. Shortest paths starting with the source node C.

The pseudocode for finding the shortest paths from a source node to all other nodes in an unweighted graph is shown in Figure 2.8.

Figure 2.8. Pseudo-code algorithm for finding the shortest path in unweighted graphs.

When the shortest path must be calculated between each pair of vertices in the graph, other algorithms such as the Floyd–Warshall algorithm can be used, which minimizes the number of path comparisons by using dynamic programming.

The shortest-path algorithm also can be applied to weighted graphs, with small modifications. Primarily, the path is calculated as the sum of weights of the edges; the length of the shortest path to a node can be updated if a new shorter path is found.


使用道具

地板
ReneeBK 发表于 2015-6-13 12:22:28 |只看作者 |坛友微信交流群
Random Walks

Imagine a person, Paul, standing on the corner of 43rd Street and Madison Avenue in Midtown Manhattan during a blackout (Figures 2.19 and 2.20). Paul is a few blocks away from his home on the corner of 47th Street and Madison Avenue. However, in the dark, he cannot see anything and wanders aimlessly from one street corner to another, going up and down Madison Avenue. At each step, he has a 50 percent chance of going up the avenue and a 50 percent chance of going down. His goal is to get home safely rather than fall in the open manhole located at the corner of 42nd street and Madison Avenue. We assume that the random walk will stop when one of two things happens: Paul either ends up in the manhole or he arrives home safely. It can be proven mathematically that for a given probability p, he eventually will end in one of those two places with a probability no larger than p.

Figure 2.19. Map of Midtown Manhattan.

Figure 2.20. Random walk with 6 steps (ϕ to N = 5). For simplicity, node labels starting at zero are used.

If he is unlucky, Paul’s very first step will take him to the manhole. So, the probability of this event (let us call it M) is at least 0.5. If he is lucky, Paul will move away from the manhole in the first step; however, if after that he makes two more steps toward the manhole, he will fall into it. The probability of this happening is 0.5 ∗ 0.5 ∗ 0.5 = 0.53 = 0.125. Obviously, there are many other sequences of steps that lead to the event M. As can be seen, the probability of the event H = 1 – M (i.e., arriving home safely) seems rather small. How can we compute it?

Clearly, the value of pH is a function of the starting point. In one extreme case, Paul would start at 47th Street. In that case, he is already safely at home, so the probability pH(47) = 1. In the other extreme case, he starts at 42nd Street and falls immediately into the hole, resulting in pH(42) = 0. How can we find the value ofPH(i) for an arbitrary street number i? This is an example of a random-walk model in which a walker takes random steps on the graph G, with the walk being modeled as a Markov process – that is, the decision on what edge to follow is solely based on the vertex where the walker is currently located. Under certain conditions, this model converges to a stationary distribution of probabilities, associated with vertices in the graph. Chapter 5 provides more details on random-walk algorithms, and specifically on the PageRank algorithm used in information retrieval.


使用道具

7
zgq.8026 发表于 2015-6-13 12:57:26 |只看作者 |坛友微信交流群
有点儿贵,不好意思,最近想要赚一些论坛币,你看着办,想要下载就下载吧,绝对是你要的书,我检测过了

Graph-based Natural Language Processing and Information Retrieval.pdf

1.54 MB

需要: 200 个论坛币  [购买]

有点儿贵

已有 1 人评分论坛币 收起 理由
Nicolle + 100 奖励积极上传好的资料

总评分: 论坛币 + 100   查看全部评分

使用道具

8
RiverSide 发表于 2015-10-5 15:08:46 |只看作者 |坛友微信交流群
你收入了多少?  

使用道具

9
bbslover 在职认证  发表于 2016-4-30 02:06:23 |只看作者 |坛友微信交流群
搜寻论坛有个20币币的

使用道具

您需要登录后才可以回帖 登录 | 我要注册

本版微信群
加好友,备注jltj
拉您入交流群

京ICP备16021002-2号 京B2-20170662号 京公网安备 11010802022788号 论坛法律顾问:王进律师 知识产权保护声明   免责及隐私声明

GMT+8, 2024-11-5 20:25