机器学习（六） — 评估模型

发布时间：2024年01月18日

Evaluate model

1 test set

split the training set into training set and a test set
the test set is used to evaluate the model

1. linear regression

compute test error

$J_{test}(\vec w, b) = \frac{1}{2m_{test}}\sum_{i=1}^{m_{test}} \left [ (f(x_{test}^{(i)}) - y_{test}^{(i)})^2 \right ]$

2. classification regression

compute test error

$J_{test}(\vec w, b) = -\frac{1}{m_{test}}\sum_{i=1}^{m_{test}} \left [ y_{test}^{(i)}log(f(x_{test}^{(i)})) + (1 - y_{test}^{(i)})log(1 - f(x_{test}^{(i)}) \right ]$

2 cross-validation set

split the training set into training set, cross-validation set and test set
the cross-validation set is used to automatically choose the better model, and the test set is used to evaluate the model that chosed

3 bias and variance

high bias: $J_{train}$ and $J_{cv}$ is both high
high variance: $J_{train}$ is low, but $J_{cv}$ is high

在这里插入图片描述

if high bias: get more training set is helpless
if high variance: get more training set is helpful

4 regularization

if $\lambda$ is too small, it will lead to overfitting(high variance)
if $\lambda$ is too large, it will lead to underfitting(high bias)

在这里插入图片描述

5 method

fix high variance:
get more training set
try smaller set of features
reduce some of the higher-order terms
increase $\lambda$

fix high bias:
get more addtional features
add polynomial features
decrease $\lambda$

6 neural network and bias variance

a bigger network means a more complex model, so it will solve the high bias
more data is helpful to solve high variance

在这里插入图片描述

it turns out that a bigger(may be overfitting) and well regularized neural network is better than a small neural network

文章来源:https://blog.csdn.net/m0_65591847/article/details/135641692
本文来自互联网用户投稿，该文观点仅代表作者本人，不代表本站立场。本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如若内容造成侵权/违法违规/事实不符，请联系我的编程经验分享网邮箱：chenni525@qq.com进行投诉反馈，一经查实，立即删除！