Aboleth: Quick Start Guide

0关注
62粉丝

VIP

已卖：4194份资源

院士

67%

还不是VIP/贵宾

-

TA的文库 其他...

Bayesian NewOccidental

Spatial Data Analysis

东西方数据挖掘

0%

威望: 0 级
论坛币: 50288 个
通用积分: 83.6306
学术水平: 253 点
热心指数: 300 点
信用等级: 208 点
经验: 41518 点
帖子: 3256
精华: 14
在线时间: 766 小时
注册时间: 2006-5-4
最后登录: 2022-11-6

楼主

Lisrelchen 发表于 2017-9-10 20:47:07 |AI写论文

是否 +2 论坛币

k人参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群

赵安豆老师微信：zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

立即领取

感谢您参与论坛问题回答

经管之家送您两个论坛币！

+2 论坛币

本帖隐藏的内容

Quick Start Guide — Aboleth 0.5.pdf (885.51 KB)

Quick Start Guide
In Aboleth we use function composition to compose machine learning models. These models are callable python classes that when called return a TensorFlow computational graph (really a tf.Tensor). We can best demonstrate this with a few examples.

复制代码

本帖隐藏的内容

http://aboleth.readthedocs.io/en/stable/

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

分享0 收藏0 回帖

关键词：Statistics statistic Bayesian Statist Statis

本帖被以下文库推荐

· Bayesian NewOccidental|主题: 578, 订阅: 78

沙发

Lisrelchen 发表于 2017-9-10 20:47:45

Logistic Classification using Python aboleth

Logistic Classification
For our first example, lets make a simple logistic classifier with L2L2 regularisation on the model weights:
import tensorflow as tf
import aboleth as ab
layers = (
ab.InputLayer(name="X") >>
ab.DenseMap(output_dim=1, l1_reg=0, l2_reg=.05) >>
ab.Activation(tf.nn.sigmoid)
)

复制代码

藤椅

Lisrelchen 发表于 2017-9-10 20:51:37

Bayesian Logistic Classification using Python aboleth

Bayesian Logistic Classification
import numpy as np
import tensorflow as tf
import aboleth as ab
layers = (
ab.InputLayer(name="X", n_samples=5) >>
ab.DenseVariational(output_dim=1, var=1., full=True) >>
ab.Activation(tf.nn.sigmoid)
)
#Model specification:
likelihood = ab.likelihoods.Bernoulli()
net, kl = layers(X=X_)
loss = ab.elbo(net, Y_, N=10000, KL=kl, likelihood=likelihood)
#Train this model in exactly the same way as the logistic classifier
probabilities = net.eval(feed_dict={X_: X_query})
#Take the mean of these samples for the predicted class probability
expected_p = np.mean(probabilities, axis=0)
#Generate more samples to get a more accurate expected probabilities (again with the TensorFlow session, sess),
probabilities = ab.predict_samples(net, feed_dict={X_: X_query},
n_groups=10, session=sess)

复制代码

板凳

Lisrelchen 发表于 2017-9-10 20:54:41

Approximate Gaussian Processes using Python aboleth

Approximate Gaussian Processes
import tensorflow as tf
import aboleth as ab
lenscale = ab.pos(tf.Variable(1.)) # learn isotropic length scale
kern = ab.RBF(lenscale=lenscale)
layers = (
ab.InputLayer(name="X", n_samples=5) >>
ab.RandomFourier(n_features=100, kernel=kern) >>
ab.DenseVariational(output_dim=1, full=True)
)
var = ab.pos(tf.Variable(1.)) # learn likelihood variance
likelihood = ab.likelihoods.Normal(var=var)
net, kl = layers(X=X_)
loss = ab.elbo(net, Y_, kl, N=10000, likelihood=likelihood)

复制代码

报纸

duoduoduo

发表于 2017-9-10 20:57:45

bayes好东西

地板

Lisrelchen 发表于 2017-9-10 20:58:59

Regression

This is a simple demo that draws a random, non linear function from a Gaussian process with a specified kernel and length scale. We then use Aboleth (in Gaussian process approximation mode) to try to learn this function given only a few noisy observations of it. This script also demonstrates how we can divide the data into mini-batches using utilities in the tf.train module, and how we can use tf.train.MonitoredTrainingSession to log the learning progress.

This demo can be used to generate figures like the following:

#! /usr/bin/env python3
"""This demo uses Aboleth for approximate Gaussian process regression."""
import logging
import numpy as np
import bokeh.plotting as bk
import bokeh.palettes as bp
import tensorflow as tf
# from sklearn.gaussian_process.kernels import Matern as kern
from sklearn.gaussian_process.kernels import RBF as kern
import aboleth as ab
from aboleth.likelihoods import Normal
from aboleth.datasets import gp_draws
# Set up a python logger so we can see the output of MonitoredTrainingSession
logger = logging.getLogger()
logger.setLevel(logging.INFO)
# Set up a consistent random seed in Aboleth so we get repeatable, but random
# results
RSEED = 666
ab.set_hyperseed(RSEED)
# Data settings
N = 1000 # Number of training points to generate
Ns = 400 # Number of testing points to generate
kernel = kern(length_scale=0.5) # Kernel to use for making a random GP draw
true_noise = 0.1 # Add noise to the GP draws, to make things a little harder
# Model settings
n_samples = 5 # Number of random samples to get from an Aboleth net
n_pred_samples = 10 # This will give n_samples by n_pred_samples predictions
n_epochs = 200 # how many times to see the data for training
batch_size = 10 # mini batch size for stochastric gradients
config = tf.ConfigProto(device_count={'GPU': 0}) # Use GPU? 0 is no
# Model initialisation
variance = tf.Variable(1.) # Likelihood variance initialisation, and learning
reg = 1. # Initial weight prior variance, this is optimised later
# Random Fourier Features
# lenscale = tf.Variable(1.) # learn the length scale
# kern = ab.RBF(lenscale=ab.pos(lenscale)) # keep the length scale positive
# Variational Fourier Features -- length-scale setting here is the "prior", we
# can choose to optimise this or not
lenscale = 1.
kern = ab.RBFVariational(lenscale=lenscale) # This is VAR-FIXED kernel from
# Cutjar et. al. 2017
# This is how we make the "latent function" of a Gaussian process, here
# n_features controls how many random basis functions we use in the
# approximation. The more of these, the more accurate, but more costly
# computationally. "full" indicates we want a full-covariance matrix Gaussian
# posterior of the model weights. This is optional, but it does greatly improve
# the model uncertainty away from the data.
net = (
ab.InputLayer(name="X", n_samples=n_samples) >>
ab.RandomFourier(n_features=100, kernel=kern) >>
ab.DenseVariational(output_dim=1, var=reg, full=True)
)
def main():
"""Run the demo."""
n_iters = int(round(n_epochs * N / batch_size))
print("Iterations = {}".format(n_iters))
# Get training and testing data
Xr, Yr, Xs, Ys = gp_draws(N, Ns, kern=kernel, noise=true_noise)
# Prediction points
Xq = np.linspace(-20, 20, Ns).astype(np.float32)[:, np.newaxis]
Yq = np.linspace(-4, 4, Ns).astype(np.float32)[:, np.newaxis]
# Set up the probability image query points
Xi, Yi = np.meshgrid(Xq, Yq)
Xi = Xi.astype(np.float32).reshape(-1, 1)
Yi = Yi.astype(np.float32).reshape(-1, 1)
_, D = Xr.shape
# Name the "data" parts of the graph
with tf.name_scope("Input"):
# This function will make a TensorFlow queue for shuffling and batching
# the data, and will run through n_epochs of the data.
Xb, Yb = batch_training(Xr, Yr, n_epochs=n_epochs,
batch_size=batch_size)
X_ = tf.placeholder_with_default(Xb, shape=(None, D))
Y_ = tf.placeholder_with_default(Yb, shape=(None, 1))
with tf.name_scope("Likelihood"):
lkhood = Normal(variance=ab.pos(variance)) # Keep the var positive
# This is where we build the actual GP model
with tf.name_scope("Deepnet"):
Phi, kl = net(X=X_)
loss = ab.elbo(Phi, Y_, N, kl, lkhood)
# Set up the trainig graph
with tf.name_scope("Train"):
optimizer = tf.train.AdamOptimizer()
global_step = tf.train.create_global_step()
train = optimizer.minimize(loss, global_step=global_step)
# This is used for building the predictive density image
with tf.name_scope("Predict"):
logprob = lkhood(Y_, Phi)
# Logging learning progress
log = tf.train.LoggingTensorHook(
{'step': global_step, 'loss': loss},
every_n_iter=1000
)
# This is the main training "loop"
with tf.train.MonitoredTrainingSession(
config=config,
save_summaries_steps=None,
save_checkpoint_secs=None,
hooks=[log]
) as sess:
try:
while not sess.should_stop():
sess.run(train)
except tf.errors.OutOfRangeError:
print('Input queues have been exhausted!')
pass
# Prediction
Ey = ab.predict_samples(Phi, feed_dict={X_: Xq, Y_: np.zeros_like(Yq)},
n_groups=n_pred_samples, session=sess)
logPY = ab.predict_expected(logprob, feed_dict={Y_: Yi, X_: Xi},
n_groups=n_pred_samples, session=sess)
Eymean = Ey.mean(axis=0) # Average samples to get mean predicted funtion
Py = np.exp(logPY.reshape(Ns, Ns)) # Turn log-prob into prob
# Plot
im_min = np.amin(Py)
im_size = np.amax(Py) - im_min
img = (Py - im_min) / im_size
f = bk.figure(tools='pan,box_zoom,reset', sizing_mode='stretch_both')
f.image(image=[img], x=-20., y=-4., dw=40., dh=8,
palette=bp.Plasma256)
f.circle(Xr.flatten(), Yr.flatten(), fill_color='blue', legend='Training')
f.line(Xs.flatten(), Ys.flatten(), line_color='blue', legend='Truth')
for y in Ey:
f.line(Xq.flatten(), y.flatten(), line_color='red', legend='Samples',
alpha=0.2)
f.line(Xq.flatten(), Eymean.flatten(), line_color='green', legend='Mean')
bk.show(f)
def batch_training(X, Y, batch_size, n_epochs):
"""Batch training queue convenience function."""
X = tf.train.limit_epochs(X, n_epochs, name="X_lim")
Y = tf.train.limit_epochs(Y, n_epochs, name="Y_lim")
X_batch, Y_batch = tf.train.shuffle_batch([X, Y], batch_size, 100, 1,
enqueue_many=True, seed=RSEED)
return X_batch, Y_batch
if __name__ == "__main__":
main()

复制代码

7楼

Lisrelchen 发表于 2017-9-10 21:00:25

Bayesian Classification with Dropout

Here we demonstrate a slightly different take on Bayesian deep learning. Yarin Gal in his thesis and associate publications demonstrates that we can view regular neural networks with dropout as a form of variational inference with specific prior and posterior distributions on the weights.

In this demo we implement this elegant idea with maximum a-posteriori weight and dropout layers in a classifier (see ab.layers). We leave these layers as stochastic in the prediction step, and draw samples from the network’s predictive distribution, as we would in variational networks.

We test the classifier against a random forest classifier on the breast cancer dataset with 5-fold cross validation, and get quite good and robust performance.

#! /usr/bin/env python3
"""This script demonstrates an alternative way of making a Bayesian Neural Net.
This is based on Yarin Gal's work on interpreting dropout networks as a special
case of Bayesian neural nets, see http://mlg.eng.cam.ac.uk/yarin/blog_2248.html
"""
import tensorflow as tf
import numpy as np
from sklearn.datasets import load_breast_cancer
from sklearn.model_selection import KFold
from sklearn.metrics import accuracy_score, log_loss
from sklearn.ensemble import RandomForestClassifier
from sklearn.preprocessing import StandardScaler
import aboleth as ab
FOLDS = 5
RSEED = 100
ab.set_hyperseed(RSEED)
# Optimization
NITER = 20000 # Training iterations per fold
BSIZE = 10 # mini-batch size
CONFIG = tf.ConfigProto(device_count={'GPU': 0}) # Use GPU ?
LSAMPLES = 1 # We're only using 1 dropout "sample" for learning to be more
# like a MAP network
PSAMPLES = 50 # This will give LSAMPLES * PSAMPLES predictions
REG = 0.001 # weight regularizer
# Network structure
net = ab.stack(
ab.InputLayer(name='X', n_samples=LSAMPLES),
ab.DropOut(0.95),
ab.DenseMAP(output_dim=64, l1_reg=0., l2_reg=REG),
ab.Activation(h=tf.nn.relu),
ab.DropOut(0.5),
ab.DenseMAP(output_dim=64, l1_reg=0., l2_reg=REG),
ab.Activation(h=tf.nn.relu),
ab.DropOut(0.5),
ab.DenseMAP(output_dim=1, l1_reg=0., l2_reg=REG),
ab.Activation(h=tf.nn.sigmoid)
)
def main():
"""Run the demo."""
data = load_breast_cancer()
X = data.data.astype(np.float32)
y = data.target.astype(np.float32)[:, np.newaxis]
X = StandardScaler().fit_transform(X).astype(np.float32)
N, D = X.shape
# Benchmark classifier
bcl = RandomForestClassifier(random_state=RSEED)
# Data
with tf.name_scope("Input"):
X_ = tf.placeholder(dtype=tf.float32, shape=(None, D))
Y_ = tf.placeholder(dtype=tf.float32, shape=(None, 1))
with tf.name_scope("Likelihood"):
lkhood = ab.likelihoods.Bernoulli()
with tf.name_scope("Deepnet"):
Phi, reg = net(X=X_)
loss = ab.max_posterior(Phi, Y_, reg, lkhood, first_axis_is_obs=False)
with tf.name_scope("Train"):
optimizer = tf.train.AdamOptimizer(learning_rate=0.001)
train = optimizer.minimize(loss)
kfold = KFold(n_splits=FOLDS, shuffle=True, random_state=RSEED)
# Launch the graph.
acc, acc_o, ll, ll_o = [], [], [], []
init = tf.global_variables_initializer()
with tf.Session(config=CONFIG):
for k, (r_ind, s_ind) in enumerate(kfold.split(X)):
init.run()
Xr, Yr = X[r_ind], y[r_ind]
Xs, Ys = X[s_ind], y[s_ind]
batches = ab.batch(
{X_: Xr, Y_: Yr},
batch_size=BSIZE,
n_iter=NITER)
for i, data in enumerate(batches):
train.run(feed_dict=data)
if i % 1000 == 0:
loss_val = loss.eval(feed_dict=data)
print("Iteration {}, loss = {}".format(i, loss_val))
# Predict
Ey = ab.predict_expected(Phi, {X_: Xs}, PSAMPLES)
print("Fold {}:".format(k))
Ep = np.hstack((1. - Ey, Ey))
print_k_result(Ys, Ep, ll, acc, "BNN")
bcl.fit(Xr, Yr.flatten())
Ep_o = bcl.predict_proba(Xs)
print_k_result(Ys, Ep_o, ll_o, acc_o, "RF")
print("-----")
print_final_result(acc, ll, "BNN")
print_final_result(acc_o, ll_o, "RF")
def print_k_result(ys, Ep, ll, acc, name):
acc.append(accuracy_score(ys, Ep.argmax(axis=1)))
ll.append(log_loss(ys, Ep))
print("{}: accuracy = {:.4g}, log-loss = {:.4g}"
.format(name, acc[-1], ll[-1]))
def print_final_result(acc, ll, name):
print("{} final: accuracy = {:.4g} ({:.4g}), log-loss = {:.4g} ({:.4g})"
.format(name, np.mean(acc), np.std(acc), np.mean(ll), np.std(ll)))
if __name__ == "__main__":
main()

复制代码

8楼

MouJack007 发表于 2017-9-10 21:03:04

谢谢楼主分享！

9楼

MouJack007 发表于 2017-9-10 21:03:21

10楼

军旗飞扬 发表于 2017-9-10 21:06:45

谢谢楼主分享！

Aboleth: Quick Start Guide [推广有奖]

经管之家送您一份

经管之家联合CDA

感谢您参与论坛问题回答

本帖隐藏的内容

本帖隐藏的内容

扫码加我拉你入群

相关帖子

本帖被以下文库推荐

Logistic Classification using Python aboleth

Bayesian Logistic Classification using Python aboleth

Approximate Gaussian Processes using Python aboleth

浏览过的帖子

浏览过的版块

本版微信群

Aboleth: Quick Start Guide [推广有奖]

经管之家送您一份

经管之家联合CDA

感谢您参与论坛问题回答

本帖隐藏的内容

本帖隐藏的内容

扫码加我 拉你入群

相关帖子

本帖被以下文库推荐

Logistic Classification using Python aboleth

Bayesian Logistic Classification using Python aboleth

Approximate Gaussian Processes using Python aboleth

浏览过的帖子

浏览过的版块

本版微信群

扫码加我拉你入群