haoranxu
/

ALMA-13B-R

Text Generation

text-generation-inference

Model card Files Files and versions

haoranxu commited on Jan 19, 2024

Commit

f0a3613

·

verified ·

1 Parent(s): ef79141

Update README.md

Files changed (1) hide show

README.md +13 -1

README.md CHANGED Viewed

@@ -12,6 +12,16 @@ license: mit
       primaryClass={cs.CL}
 }
 ```
 # Download ALMA(-R) Models and Dataset 🚀
 We release six translation models presented in the paper:
@@ -60,4 +70,6 @@ with torch.no_grad():
     generated_ids = model.generate(input_ids=input_ids, num_beams=5, max_new_tokens=20, do_sample=True, temperature=0.6, top_p=0.9)
 outputs = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)
 print(outputs)
-```

       primaryClass={cs.CL}
 }
 ```
+```
+@misc{xu2023paradigm,
+      title={A Paradigm Shift in Machine Translation: Boosting Translation Performance of Large Language Models},
+      author={Haoran Xu and Young Jin Kim and Amr Sharaf and Hany Hassan Awadalla},
+      year={2023},
+      eprint={2309.11674},
+      archivePrefix={arXiv},
+      primaryClass={cs.CL}
+}
+```
 # Download ALMA(-R) Models and Dataset 🚀
 We release six translation models presented in the paper:
     generated_ids = model.generate(input_ids=input_ids, num_beams=5, max_new_tokens=20, do_sample=True, temperature=0.6, top_p=0.9)
 outputs = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)
 print(outputs)
+```
+Please find more details in our [GitHub repository](https://github.com/fe1ixxu/ALMA)