cleaned_context_all_sentences_storySeeker_StoryArg_SentenceLevel
This model is a fine-tuned version of google/bigbird-roberta-base on the None dataset. It achieves the following results on the evaluation set:
- Loss: 0.5372
- Accuracy: 0.774
- Auc: 0.812
- Precision: 0.856
- Recall: 0.506
Model description
More information needed
Intended uses & limitations
More information needed
Training and evaluation data
More information needed
Training procedure
Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 2e-05
- train_batch_size: 64
- eval_batch_size: 64
- seed: 42
- optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
- lr_scheduler_type: linear
- num_epochs: 10
Training results
| Training Loss | Epoch | Step | Validation Loss | Accuracy | Auc | Precision | Recall |
|---|---|---|---|---|---|---|---|
| 0.6593 | 1.0 | 83 | 0.6393 | 0.61 | 0.789 | 1.0 | 0.002 |
| 0.6303 | 2.0 | 166 | 0.6166 | 0.721 | 0.774 | 0.907 | 0.319 |
| 0.6098 | 3.0 | 249 | 0.5955 | 0.738 | 0.786 | 0.89 | 0.375 |
| 0.5901 | 4.0 | 332 | 0.5781 | 0.72 | 0.799 | 0.951 | 0.297 |
| 0.576 | 5.0 | 415 | 0.5643 | 0.754 | 0.8 | 0.878 | 0.431 |
| 0.5659 | 6.0 | 498 | 0.5537 | 0.754 | 0.806 | 0.884 | 0.427 |
| 0.5597 | 7.0 | 581 | 0.5462 | 0.763 | 0.809 | 0.872 | 0.459 |
| 0.5534 | 8.0 | 664 | 0.5409 | 0.763 | 0.812 | 0.872 | 0.459 |
| 0.5484 | 9.0 | 747 | 0.5380 | 0.772 | 0.813 | 0.865 | 0.494 |
| 0.5461 | 10.0 | 830 | 0.5372 | 0.774 | 0.812 | 0.856 | 0.506 |
Framework versions
- Transformers 4.57.1
- Pytorch 2.8.0+cu126
- Datasets 4.0.0
- Tokenizers 0.22.1
- Downloads last month
- 15
Model tree for aayush7511/cleaned_context_all_sentences_storySeeker_StoryArg_SentenceLevel
Base model
google/bigbird-roberta-base