先检查training data loss看有没有学起来如果loss很大: 可能是模型太简单了(model bias)优化问题(optimization issue) training data loss大,testing data loss大: overfitting 更少的参数,共享参数更少特征early stopping正则化dropout