人大经济论坛 › 论坛 › 数据科学与人工智能 › IT基础 › Scala及其他JVM语言 › ND4J: Scientific Computing on the JVM

CDA数据分析研究院

商业数据分析与大数据领航教育品牌



经管云课堂

经管/金融/财会/社科/名师公开课



学术培训

Stata 空间计量 SSCI Python

贵宾：通行论坛特权+数据库权限
+案例库+下载特权 VIP：论坛特权+更多下载次数
+ccerdata数据库+更高阅读权限+……

发帖

楼主: Lisrelchen

1094 1

ND4J: Scientific Computing on the JVM [推广有奖]

0关注
62粉丝

VIP

院士

67%

还不是VIP/贵宾

TA的文库 其他...

Bayesian NewOccidental

Spatial Data Analysis

东西方数据挖掘

威望: 0 级
论坛币: 49957 个
通用积分: 79.5487
学术水平: 253 点
热心指数: 300 点
信用等级: 208 点
经验: 41518 点
帖子: 3256
精华: 14
在线时间: 766 小时
注册时间: 2006-5-4
最后登录: 2022-11-6

楼主

Lisrelchen 发表于 2016-7-1 10:34:18 |只看作者 |坛友微信交流群|倒序 |AI写论文

相似文件

换一批

是否 +2 论坛币

k人参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群

赵安豆老师微信：zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

立即领取

感谢您参与论坛问题回答

经管之家送您两个论坛币！

+2 论坛币

ND4J: Scientific Computing on the JVM

ND4J is an Apache2 Licensed open-sourced scientific computing library for the JVM. It is meant to be used in production environments rather than as a research tool, which means routines are designed to run fast with minimum RAM requirements.

Please search for the latest version on search.maven.org.

Or use the versions displayed in: https://github.com/deeplearning4j/dl4j-0.4-examples/blob/master/pom.xml

Main Features

Versatile n-dimensional array object
Multiplatform functionality including GPUs
Linear algebra and signal processing functions

Specifics

Supports GPUs via with the CUDA backend nd4j-cuda-7.5 and Native via nd4j-native.
All of this is wrapped in a unifying interface.
The API mimics the semantics of Numpy, Matlab and scikit-learn.

Modules

Several of these modules are different backend options for ND4J (including GPUs).

api = core
instrumentation
jdbc = Java Database Connectivity
jocl-parent = Java bindings for OpenCL
scala-api = API for Scala users
scala-notebook = Integration with Scala Notebook

Documentation

Documentation is available at nd4j.org. Access the JavaDocs for more detail.

本帖隐藏的内容

nd4j-master.zip (3.39 MB)

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

分享0 收藏0 回帖

关键词：Scientific computing Comput SCIE comp scientific computing research designed version

本帖被以下文库推荐

· 东西方数据挖掘|主题: 1798, 订阅: 171

使用道具举报

沙发

Lisrelchen 发表于 2016-7-1 10:40:40 |只看作者 |坛友微信交流群

Implementations with ND4J
As there are many cases where ND4J alone can be used conveniently, let's briefly grasp how to use ND4J before looking into the explanation of DL4J. If you would like to use ND4J alone, once you create a new Maven project, then you can use ND4J by adding the following code to pom.xml:
<properties>
<nd4j.version>0.4-rc3.6</nd4j.version>
</properties>
<dependencies>
<dependency>
<groupId>org.nd4j</groupId>
<artifactId>nd4j-jblas</artifactId>
<version>${nd4j.version}</version>
</dependency>
<dependency>
<groupId>org.nd4j</groupId>
<artifactId>nd4j-perf</artifactId>
<version>${nd4j.version}</version>
</dependency>
</dependencies>
Here, <nd4j.version> describes the latest version of ND4J, but please check whether it is updated when you actually implement the code. Also, switching from CPU to GPU is easy while working with ND4J. If you have CUDA installed with version 7.0, then what you do is just define artifactId as follows:
<dependency>
<groupId>org.nd4j</groupId>
<artifactId>nd4j-jcublas-7.0</artifactId>
<version>${nd4j.version}</version>
</dependency>
You can replace the version of <artifactId> depending on your configuration.
Let's look at a simple example of what calculations are possible with ND4J. The type we utilize with ND4J is INDArray, that is, an extended type of Array. We begin by importing the following dependencies:
import org.nd4j.linalg.api.ndarray.INDArray;
import org.nd4j.linalg.factory.Nd4j;
Then, we define INDArray as follows:
INDArray x = Nd4j.create(new double[]{1, 2, 3, 4, 5, 6}, new int[]{3, 2});
System.out.println(x);
Nd4j.create takes two arguments. The former defines the actual values within INDArray, and the latter defines the shape of the vector (matrix). By running this code, you get the following result:
[[1.00,2.00]
[3.00,4.00]
[5.00,6.00]]
Since INDArray can output its values with System.out.print, it's easy to debug. Calculation with scalar can also be done with ease. Add 1 to x as shown here:
x.add(1);
Then, you will get the following output:
[[2.00,3.00]
[4.00,5.00]
[6.00,7.00]]
Also, the calculation within INDArray can be done easily, as shown in the following example:
INDArray y = Nd4j.create(new double[]{6, 5, 4, 3, 2, 1}, new int[]{3, 2});
Then, basic arithmetic operations can be represented as follows:
x.add(y)
x.sub(y)
x.mul(y)
x.div(y)
These will return the following result:
[[7.00,7.00]
[7.00,7.00]
[7.00,7.00]]
[[-5.00,-3.00]
[-1.00,1.00]
[3.00,5.00]]
[[6.00,10.00]
[12.00,12.00]
[10.00,6.00]]
[[0.17,0.40]
[0.75,1.33]
[2.50,6.00]]
Also, ND4J has destructive arithmetic operators. When you write the x.addi(y) command, x changes its own values so that System.out.println(x); will return the following output:
[[7.00,7.00]
[7.00,7.00]
[7.00,7.00]]
Likewise, subi, muli, and divi are also destructive operators. There are also many other methods that can conveniently perform calculations between vectors or matrices. For more information, you can refer to http://nd4j.org/documentation.html, http://nd4j.org/doc/ and http://nd4j.org/apidocs/.
Let's look at one more example to see how machine learning algorithms can be written with ND4J. We'll implement the easiest example, perceptrons, based on the source code written in Chapter 2, Algorithms for Machine Learning – Preparing for Deep Learning. We set the package name DLWJ.examples.ND4J and the file (class) name Perceptrons.java.
First, let's add these two lines to import from ND4J:
import org.nd4j.linalg.api.ndarray.INDArray;
import org.nd4j.linalg.factory.Nd4j;
The model has two parameters: num of the input layer and the weight. The former doesn't change from the previous code; however, the latter isn't Array but INDArray:
public int nIn; // dimensions of input data
public INDArray w;
You can see from the constructor that since the weight of the perceptrons is represented as a vector, the number of rows is set to the number of units in the input layer and the number of columns to 1. This definition is written here:
public Perceptrons(int nIn) {
this.nIn = nIn;
w = Nd4j.create(new double[nIn], new int[]{nIn, 1});
}
Then, because we define the model parameter as INDArray, we also define the demo data, training data, and test data as INDArray. You can see these definitions at the beginning of the main method:
INDArray train_X = Nd4j.create(new double[train_N * nIn], new int[]{train_N, nIn}); // input data for training
INDArray train_T = Nd4j.create(new double[train_N], new int[]{train_N, 1}); // output data (label) for training
INDArray test_X = Nd4j.create(new double[test_N * nIn], new int[]{test_N, nIn}); // input data for test
INDArray test_T = Nd4j.create(new double[test_N], new int[]{test_N, 1}); // label of inputs
INDArray predicted_T = Nd4j.create(new double[test_N], new int[]{test_N, 1}); // output data predicted by the model
When we substitute a value into INDArray, we use put. Please be careful that any value we can set with put is only the values of the scalar type:
train_X.put(i, 0, Nd4j.scalar(g1.random()));
train_X.put(i, 1, Nd4j.scalar(g2.random()));
train_T.put(i, Nd4j.scalar(1));
The flow from a model building and training is the same as the previous code:
// construct perceptrons
Perceptrons classifier = new Perceptrons(nIn);
// train models
while (true) {
int classified_ = 0;
for (int i=0; i < train_N; i++) {
classified_ += classifier.train(train_X.getRow(i), train_T.getRow(i), learningRate);
}
if (classified_ == train_N) break; // when all data classified correctly
epoch++;
if (epoch > epochs) break;
}
Each piece of training data is given to the train method by getRow(). First, let's see the entire content of the train method:
public int train(INDArray x, INDArray t, double learningRate) {
int classified = 0;
// check if the data is classified correctly
double c = x.mmul(w).getDouble(0) * t.getDouble(0);
// apply steepest descent method if the data is wrongly classified
if (c > 0) {
classified = 1;
} else {
w.addi(x.transpose().mul(t).mul(learningRate));
}
return classified;
}
We first focus our attention on the following code:
// check if the data is classified correctly
double c = x.mmul(w).getDouble(0) * t.getDouble(0);
This is the part that checks whether the data is classified correctly by perceptions, as shown in the following equation:
Implementations with ND4J
You can see from the code that .mmul() is for the multiplication between vectors or matrices. We wrote this part of the calculation in Chapter 2, Algorithms for Machine Learning – Preparing for Deep Learning, as follows:
double c = 0.;
// check if the data is classified correctly
for (int i = 0; i < nIn; i++) {
c += w[i] * x[i] * t;
}
By comparing both codes, you can see that multiplication between vectors or matrices can be written easily with INDArray, and so you can implement the algorithm intuitively just by following the equations.
The equation to update the model parameters is as follows:
w.addi(x.transpose().mul(t).mul(learningRate));
Here, again, you can implement the code like you write a math equation. The equation is represented as follows:
Implementations with ND4J
The last time we implemented this part, we wrote it with a for loop:
for (int i = 0; i < nIn; i++) {
w[i] += learningRate * x[i] * t;
}
Furthermore, the prediction after the training is also the standard forward activation, shown as the following equation:
Implementations with ND4J
Here:
Implementations with ND4J
We can simply define the predict method with just a single line inside, as follows:
public int predict(INDArray x) {
return step(x.mmul(w).getDouble(0));
}
When you run the program, you can see its precision and accuracy, and the recall is the same as we get with the previous code.
Thus, it'll greatly help that you implement the algorithms analogous to mathematical equations. We only implement perceptrons here, but please try other algorithms by yourself.

复制代码

使用道具举报

返回列表

发帖

本版微信群

加JingGuanBbs
拉您进交流群

手机版 |

意见反馈 |

帮助 |

新手入门 |

用户手册 |

友情链接 |

如有投资本站、合作意向或投放广告，请联系：13661292478（刘老师）

联系客服

邮箱：service@pinggu.org 投诉或不良信息处理：（010-68466864）

京ICP备16021002-2号京B2-20170662号京公网安备 11010802022788号论坛法律顾问：王进律师知识产权保护声明免责及隐私声明