training not converge on Amazon Beauty dataset according to the paper's hyperparameter setting #8

lipingcoding · 2020-11-13T00:51:01Z

No description provided.

pmixer · 2020-11-13T02:37:42Z

@lipingcoding thx for reporting the issue, I also observed some problems and still wondering what's wrong with pytorch version compared with tf implementation. Sorry to say but I still haven't figured it out yet. The only thing I can be sure currently is that original paper's hyperparameter setting could be be directly used for this codebase, as I fixed some leaky attention issue by using PyTorch's MHA, the parameter initialization issue still need to be elaborated but I haven't done it yet.

pmixer · 2020-11-13T03:36:55Z

btw @lipingcoding could u pls provide bit more information like log etc. to describe the observation that the model do not converge? Like uncomment

SASRec.pytorch/main.py

Line 93 in 30c43cf

    
           # print("loss in epoch {} iteration {}: {}".format(epoch, step, loss.item())) # expected 0.4~0.6 after init few epochs

would print loss for iteration which should be informative.

pmixer · 2020-11-13T05:19:31Z

@lipingcoding try the newly updated code?

pmixer added the question Further information is requested label Mar 28, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

training not converge on Amazon Beauty dataset according to the paper's hyperparameter setting #8

training not converge on Amazon Beauty dataset according to the paper's hyperparameter setting #8

lipingcoding commented Nov 13, 2020

pmixer commented Nov 13, 2020

pmixer commented Nov 13, 2020

pmixer commented Nov 13, 2020

training not converge on Amazon Beauty dataset according to the paper's hyperparameter setting #8

training not converge on Amazon Beauty dataset according to the paper's hyperparameter setting #8

Comments

lipingcoding commented Nov 13, 2020

pmixer commented Nov 13, 2020

pmixer commented Nov 13, 2020

pmixer commented Nov 13, 2020