Transformer: Attention Is All You Need
Paper 地址:https://arxiv.org/abs/1706.03762 Paper 代码:https://github.com/tensorflow/tensor2tensor Paper 作者:Ashish Vaswani,Noam Shazeer,Niki Parmar,Jakob Uszkoreit,Llion Jones,Aidan…
前言:手写一个简单的组合数,但是由于长期没写,导致一些细节没处理好 题目链接 #include<bits/stdc.h>
using namespace std;
#define endl "\n"#define int long longconst int N (int)2e510;
const int Mod (int)1e97;int…