楼主: qililaimend
1254 0

[数据挖掘理论与案例] 基于序列标引的药物以及其属性的抽取方法 [推广有奖]

  • 0关注
  • 0粉丝

已卖:3份资源

初中生

85%

还不是VIP/贵宾

-

威望
0
论坛币
3 个
通用积分
0
学术水平
0 点
热心指数
0 点
信用等级
0 点
经验
170 点
帖子
15
精华
0
在线时间
12 小时
注册时间
2013-7-20
最后登录
2020-4-7

楼主
qililaimend 发表于 2015-6-14 09:48:30 |AI写论文

+2 论坛币
k人 参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群
赵安豆老师微信:zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

感谢您参与论坛问题回答

经管之家送您两个论坛币!

+2 论坛币
有做医疗数据管理的吗, 交流一下呀
先发一个我的文章
A sequence labeling approach to link medications and their attributes in clinical notes and clinical trial announcements for information extraction J Am Med Inform Assoc-2013-Li-915-21

基于序列标引的药物以及其属性的抽取方法

ABSTRACT
Objective The goal of this work was to evaluate
machine learning methods, binary classification and
sequence labeling, for medication–attribute linkage
detection in two clinical corpora.
Data and methods We double annotated 3000
clinical trial announcements (CTA) and 1655 clinical
notes (CN) for medication named entities and their
attributes. A binary support vector machine (SVM)
classification method with parsimonious feature sets,
and a conditional random fields (CRF)-based multilayered
sequence labeling (MLSL) model were proposed
to identify the linkages between the entities and their
corresponding attributes. We evaluated the system’s
performance against the human-generated gold
standard.
Results The experiments showed that the two machine
learning approaches performed statistically significantly
better than the baseline rule-based approach. The binary
SVM classification achieved 0.94 F-measure with
individual tokens as features. The SVM model trained on
a parsimonious feature set achieved 0.81 F-measure for
CN and 0.87 for CTA. The CRF MLSL method achieved
0.80 F-measure on both corpora.
Discussion and conclusions We compared the novel
MLSL method with a binary classification and a rulebased
method. The MLSL method performed statistically
significantly better than the rule-based method.
However, the SVM-based binary classification method
was statistically significantly better than the MLSL
method for both the CTA and CN corpora. Using
parsimonious feature sets both the SVM-based binary
classification and CRF-based MLSL methods achieved
high performance in detecting medication name and
attribute linkages in CTA and CN.

二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

关键词:parsimonious announcement Statistical significant performance 健康 信息

您需要登录后才可以回帖 登录 | 我要注册

本版微信群
加好友,备注cda
拉您进交流群
GMT+8, 2026-1-11 17:34