PyTorch教程2.5之自动微分

2388812 2023-06-05 | pdf | 0.18 MB | 次下载 | 免费

资料介绍

回想一下2.4 节，计算导数是我们将用于训练深度网络的所有优化算法中的关键步骤。虽然计算很简单，但手工计算可能很乏味且容易出错，而且这个问题只会随着我们的模型变得更加复杂而增长。

幸运的是，所有现代深度学习框架都通过提供自动微分（通常简称为 autograd ）来解决我们的工作。当我们通过每个连续的函数传递数据时，该框架会构建一个计算图来跟踪每个值如何依赖于其他值。为了计算导数，自动微分通过应用链式法则通过该图向后工作。以这种方式应用链式法则的计算算法称为反向传播。

虽然 autograd 库在过去十年中成为热门话题，但它们的历史悠久。事实上，对 autograd 的最早引用可以追溯到半个多世纪以前（Wengert，1964 年）。现代反向传播背后的核心思想可以追溯到 1980 年的一篇博士论文 ( Speelpenning, 1980 )，并在 80 年代后期得到进一步发展 ( Griewank, 1989 )。虽然反向传播已成为计算梯度的默认方法，但它并不是唯一的选择。例如，Julia 编程语言采用前向传播（Revels等人，2016 年）. 在探索方法之前，我们先来掌握autograd这个包。

import torch

						from mxnet import autograd, np, npx

npx.set_np()

						from jax import numpy as jnp

						 

						import tensorflow as tf

						 

2.5.1. 一个简单的函数

假设我们有兴趣区分函数 y=2x⊤x关于列向量x. 首先，我们分配x一个初始值。

							x = torch.arange(4.0)
x

							tensor([0., 1., 2., 3.])

						

在我们计算梯度之前y关于 x，我们需要一个地方来存放它。通常，我们避免每次求导时都分配新内存，因为深度学习需要针对相同参数连续计算导数数千或数百万次，并且我们可能会面临内存耗尽的风险。请注意，标量值函数相对于向量的梯度x是向量值的并且具有相同的形状x.

							# Can also create x = torch.arange(4.0, requires_grad=True)
x.requires_grad_(True)
x.grad # The gradient is None by default

							 

							x = np.arange(4.0)
x

							array([0., 1., 2., 3.])

						

Before we calculate the gradient of y with respect to x, we need a place to store it. In general, we avoid allocating new memory every time we take a derivative because deep learning requires successively computing derivatives with respect to the same parameters thousands or millions of times, and we might risk running out of memory. Note that the gradient of a scalar-valued function with respect to a vector x is vector-valued and has the same shape as x.

							# We allocate memory for a tensor's gradient by invoking `attach_grad`
x.attach_grad()
# After we calculate a gradient taken with respect to `x`, we will be able to
# access it via the `grad` attribute, whose values are initialized with 0s
x.grad

							 

							array([0., 0., 0., 0.])

						

							x = jnp.arange(4.0)
x

							No GPU/TPU found, falling back to CPU. (Set TF_CPP_MIN_LOG_LEVEL=0 and rerun for more info.)

						

							Array([0., 1., 2., 3.], dtype=float32)

						

							x = tf.range(4, dtype=tf.float32)
x

							<tf.Tensor: shape=(4,), dtype=float32, numpy=array([0., 1., 2., 3.], dtype=float32)>

						

							x = tf.Variable(x)

							 

我们现在计算我们的函数x并将结果分配给y。

							y = 2 * torch.dot(x, x)
y

							tensor(28., grad_fn=<MulBackward0>)

						

我们现在可以通过调用它的方法来获取y关于的梯度。接下来，我们可以通过的属性访问渐变。xbackwardxgrad

							y.backward()
x.grad

							tensor([ 0., 4., 8., 12.])

						

							# Our code is inside an `autograd.record` scope to build the computational
# graph
with autograd.record():
  y = 2 * np.dot(x, x)
y

							 

							array(28.)

						

We can now take the gradient of y with respect to x by calling its backward method. Next, we can access the gradient via x’s grad attribute.

							y.backward()
x.grad

							[09:38:36] src/base.cc:49: GPU context requested, but no GPUs found.

						

							array([ 0., 4., 8., 12.])

						

							y = lambda x: 2 * jnp.dot(x, x)
y(x)

							Array(28., dtype=float32)

						

We can now take the gradient of y with respect to x by passing through the grad transform.

							from jax import grad

# The `grad` transform returns a Python function that
# computes the gradient of the original function
x_grad = grad(y)(x)
x_grad

							 

							Array([ 0., 4., 8., 12.], dtype=float32)

						

							# Record all computations onto a tape
with tf.GradientTape() as t:
  y = 2 * tf.tensordot(x, x, axes=1)
y

							 

							<tf.Tensor: shape=(), dtype=float32, numpy=28.0>

						

We can now calculate the gradient of y with respect to x by calling the gradient method.

							x_grad = t.gradient(y, x)
x_grad

							<tf.Tensor: shape=(4,), dtype=float32, numpy=array([ 
						

函数微分深度学习 pytorch

下载该资料的人也在下载下载该资料的人还在阅读

更多 >

利用Arm Kleidi技术实现PyTorch优化 239次阅读
使用PyTorch在英特尔独立显卡上训练模型 651次阅读
Pytorch深度学习训练的方法 240次阅读
PyTorch的介绍与使用案例 435次阅读
PyTorch的特性和使用方法 601次阅读
如何使用PyTorch建立网络模型 447次阅读
TorchFix:基于PyTorch的代码静态分析 1101次阅读
基于PyTorch AMD的解决方案 947次阅读
积分与微分电路原理详解 2350次阅读
PyTorch 的 Autograd 机制和使用 1132次阅读
一文解构PyTorch：深入了解PyTorch内部机制 4033次阅读
PyTorch官网教程PyTorch深度学习:60分钟快速入门中文翻译版 1w次阅读
基于图像的微分的：一阶微分和二阶微分（拉普拉斯算子） 3w次阅读
Github上Star过千的PyTorch NLP相关项目都在这儿了！ 7320次阅读
RC微分电路的作用_RC微分电路原理 11.1w次阅读

资料 -- | 积分 --

查看他上传的所有资料

+关注个人主页

上传资料赚积分

下载排行

本周

1山景DSP芯片AP8248A2数据手册
1.06 MB | 532次下载 | 免费
2RK3399完整板原理图（支持平板，盒子VR）
3.28 MB | 339次下载 | 免费
3TC358743XBG评估板参考手册
1.36 MB | 330次下载 | 免费
4DFM软件使用教程
0.84 MB | 295次下载 | 免费
5元宇宙深度解析—未来的未来-风口还是泡沫
6.40 MB | 227次下载 | 免费
6迪文DGUS开发指南
31.67 MB | 194次下载 | 免费
7元宇宙底层硬件系列报告
13.42 MB | 182次下载 | 免费
8FP5207XR-G1中文应用手册
1.09 MB | 178次下载 | 免费

本月

1OrCAD10.5下载OrCAD10.5中文版软件
0.00 MB | 234315次下载 | 免费
2555集成电路应用800例(新编版)
0.00 MB | 33566次下载 | 免费
3接口电路图大全
未知 | 30323次下载 | 免费
4开关电源设计实例指南
未知 | 21549次下载 | 免费
5电气工程师手册免费下载(新编第二版pdf电子书)
0.00 MB | 15349次下载 | 免费
6数字电路基础pdf(下载)
未知 | 13750次下载 | 免费
7电子制作实例集锦下载
未知 | 8113次下载 | 免费
8《LED驱动电路设计》温德尔著
0.00 MB | 6656次下载 | 免费

总榜

1matlab软件下载入口
未知 | 935054次下载 | 免费
2protel99se软件下载(可英文版转中文版)
78.1 MB | 537798次下载 | 免费
3MATLAB 7.1 下载 (含软件介绍)
未知 | 420027次下载 | 免费
4OrCAD10.5下载OrCAD10.5中文版软件
0.00 MB | 234315次下载 | 免费
5Altium DXP2002下载入口
未知 | 233046次下载 | 免费
6电路仿真软件multisim 10.0免费下载
340992 | 191187次下载 | 免费
7十天学会AVR单片机与C语言视频教程下载
158M | 183279次下载 | 免费
8proe5.0野火版下载(中文版免费下载)
未知 | 138040次下载 | 免费

搜索历史

PyTorch教程2.5之自动微分

资料介绍

2.5.1. 一个简单的函数

评论

下载排行

本周

本月

总榜