TheanoLM - An Extensible Toolkit for Neural Network Language Modeling

1关注
62粉丝

VIP

已卖：4897份资源

学术权威

14%

还不是VIP/贵宾

-

TA的文库 其他...

R资源总汇

Panel Data Analysis

Experimental Design

0%

威望: 1 级
论坛币: 49635 个
通用积分: 55.6937
学术水平: 370 点
热心指数: 273 点
信用等级: 335 点
经验: 57805 点
帖子: 4005
精华: 21
在线时间: 582 小时
注册时间: 2005-5-8
最后登录: 2023-11-26

楼主

ReneeBK 发表于 2017-9-11 03:06:23 |AI写论文

是否 +2 论坛币

k人参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群

赵安豆老师微信：zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

立即领取

感谢您参与论坛问题回答

经管之家送您两个论坛币！

+2 论坛币

Abstract
We present a new tool for training neural network language models (NNLMs), scoring sentences, and generating text. The tool
has been written using Python library Theano, which allows researcher to easily extend it and tune any aspect of the training
process. Regardless of the flexibility, Theano is able to generate extremely fast native code that can utilize a GPU or multiple
CPU cores in order to parallelize the heavy numerical computations.
The tool has been evaluated in difficult Finnish and English conversational speech recognition tasks, and significant
improvement was obtained over our best back-off n-gram models. The results that we obtained in the Finnish task were compared
to those from existing RNNLM and RWTHLM toolkits, and found to be as good or better, while training times were an order of magnitude shorter.

复制代码

本帖隐藏的内容

TheanoLM — An Extensible Toolkit for Neural Network Language Modeling.pdf (96.17 KB)

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

分享0 收藏0 回帖

关键词：Modeling Language network Toolkit Theano

相关帖子

沙发

ReneeBK 发表于 2017-9-11 03:16:53

from __future__ import print_function
import theano
import theano.tensor as T
from theano.tensor.shared_randomstreams import RandomStreams
import sys
import time
import numpy as np
import pprint
sys.path.append("../..")
import DeepLearningStack
from DeepLearningStack import RecurrentNet
#create a deep network defined in AlexNet.xml
if __name__=="__main__":
#create deep net
num_timesteps = 2
print ("Creating Recurrent Net for %d time steps"%num_timesteps)
config = "RNNArch.xml"
#random number generator used to initialize the weights
rng = RandomStreams(seed=int(time.time()))
#for each time step, create a non-recurrent input
nonrcrnt_ins = []
#size of the input layer is 136x128,
for t in range(num_timesteps):
in_matrix = T.matrix("data-step-%d"%t,dtype=theano.config.floatX)#the input is concatenation of action history and beliefs
nonrcrnt_ins.append( {"input":[in_matrix,(136,128)]} )#for each input, provide a symbolic variable and its size
#create the recurrent inputs for time step 0
#size of the recurrent inputs according to RNNArch.xml (batch size is 128, each size is dim x batch_size)
# rct1: 128x128
# rct2: 128x128
# fc3 : 10x128
#fc1_step0 = T.matrix("fc1-step-0",dtype=theano.config.floatX)#the input is concatenation of action history and beliefs
fc2_step0 = T.matrix("fc2-step-0",dtype=theano.config.floatX)#the input is concatenation of action history and beliefs
#rcrnt_ins = {"fc1":[fc1_step0,(128,128)],"fc2":[fc2_step0,(128,128)]}
rcrnt_ins = {"fc2":[fc2_step0,(128,128)]}
#create the graph structure
rnn = RecurrentNet.RecurrentNet( rng, nonrcrnt_ins, rcrnt_ins, config, unrolled_len=num_timesteps)
print("RNN params:")
pprint.pprint(rnn.params)
#create a function for RNN
#inputs to the graph consists of the recurrent inputs for time step 0 and non-recurrent inputs for all time steps
inputs = [nonrcrnt_ins[k]["input"][0] for k in range(len(nonrcrnt_ins))]
for k in rcrnt_ins.keys():
inputs.append(rcrnt_ins[k][0])
#outputs of the network include the outputs for each time step
outputs = [rnn.name2layer[i]["fc2"].output for i in rnn.name2layer.keys()]
print("compiling the function")
f = theano.function(inputs=inputs,outputs=outputs)
#draw the RNN
graph_img_name = "RNN.png"
print("creating graph picture in:",graph_img_name)
theano.printing.pydotprint(f, outfile=graph_img_name, var_with_name_simple=True)

复制代码

藤椅

fengyg

发表于 2017-9-11 06:40:57

kankan

板凳

MouJack007 发表于 2017-9-11 06:49:52

谢谢楼主分享！

报纸

MouJack007 发表于 2017-9-11 06:50:10

TheanoLM - An Extensible Toolkit for Neural Network Language Modeling [推广有奖]

经管之家送您一份

经管之家联合CDA

感谢您参与论坛问题回答

本帖隐藏的内容

扫码加我拉你入群

相关帖子

浏览过的帖子

浏览过的版块

本版微信群

TheanoLM - An Extensible Toolkit for Neural Network Language Modeling [推广有奖]

经管之家送您一份

经管之家联合CDA

感谢您参与论坛问题回答

本帖隐藏的内容

扫码加我 拉你入群

相关帖子

浏览过的帖子

浏览过的版块

本版微信群

扫码加我拉你入群