Issues: huggingface/transformers
[Quick poll] Give your opinion on the future of the Hugging F...
#20706
opened Dec 9, 2022 by
LysandreJik
Open
1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
the line 32 in convert_llama_weights_to_hf is LlamaTokenizer not LlamaForTokenizer
#22287
opened Mar 21, 2023 by
zhl5842
RuntimeError: "topk_cpu" not implemented for 'Half'
#22284
opened Mar 21, 2023 by
MarvinLong
2 of 4 tasks
deploy whisper by passing last transcribed sentences to decoder's past_key values
#22277
opened Mar 20, 2023 by
hannan72
run_summarization requires a dataset_name or train_file or validation_file in all cases
#22276
opened Mar 20, 2023 by
coreyfournier
2 of 4 tasks
Batch elements interfere with each other with int8
#22269
opened Mar 20, 2023 by
leonweber
2 of 4 tasks
How to load local code for model with
trust_remote_code=True
?
#22260
opened Mar 20, 2023 by
LZY-the-boys
Different outputs of the official LLaMA repo and transformers' implementation
#22259
opened Mar 20, 2023 by
yqy2001
2 of 4 tasks
Ernie-M for pretraining multilingual models
Feature request
Request for a new feature
New model
#22257
opened Mar 19, 2023 by
KnutJaegersberg
Trying to save a model with TFT5ForConditionalGeneration
#22254
opened Mar 19, 2023 by
erlichsefisalesforce
2 of 4 tasks
FlaxDataCollatorForT5MLM :ValueError: all input arrays must have the same shape
#22246
opened Mar 18, 2023 by
alexcpn
2 of 4 tasks
ImportError: cannot import name 'AlignModel' from 'transformers
#22245
opened Mar 18, 2023 by
swjtu-jason
4 tasks
How to get T5 decoded logits using TFT5ForConditionalGeneration from encoded outputs?
#22241
opened Mar 18, 2023 by
FrozenWolf-Cyber
1 of 4 tasks
Detect Accelerate's DeepSpeed level 3 Env Vars and warn if synced_gpus is False
#22231
opened Mar 17, 2023 by
JulesGM
RuntimeError: one of the variables needed for gradient computation has been modified by an inplace operation: [torch.cuda.HalfTensor [12, 8192, 1]], which is output 0 of AsStridedBackward0, is at version 1; expected version 0 instead. Hint: the backtrace further above shows the operation that failed to compute its gradient. The variable in question was changed in there or anywhere later.
#22225
opened Mar 17, 2023 by
Tanya-11
4 tasks
export clip to text encoder and image encoder two onnxs
New model
#22221
opened Mar 17, 2023 by
susht3
2 tasks done
Positinal Encoding for T5 family of models
Feature request
Request for a new feature
#22220
opened Mar 17, 2023 by
SreehariSankar
Previous Next
ProTip!
Updated in the last three days: updated:>2023-03-18.