Update README.md
Browse files
README.md
CHANGED
|
@@ -49,6 +49,7 @@ print(result)
|
|
| 49 |
|
| 50 |
## lm-eval benchmark:
|
| 51 |
|
|
|
|
| 52 |
| Tasks |Version|Filter|n-shot| Metric | |Value | |Stderr|
|
| 53 |
|---------------------------------------|------:|------|-----:|----------|---|-----:|---|-----:|
|
| 54 |
|arc_challenge | 1|none | 0|acc |↑ |0.6186|± |0.0142|
|
|
@@ -136,4 +137,5 @@ print(result)
|
|
| 136 |
| - humanities | 1|none | |acc |↑ |0.7981|± |0.0057|
|
| 137 |
| - other | 1|none | |acc |↑ |0.8304|± |0.0064|
|
| 138 |
| - social sciences| 1|none | |acc |↑ |0.8736|± |0.0059|
|
| 139 |
-
| - stem | 1|none | |acc |↑ |0.7456|± |0.0075|
|
|
|
|
|
|
| 49 |
|
| 50 |
## lm-eval benchmark:
|
| 51 |
|
| 52 |
+
```
|
| 53 |
| Tasks |Version|Filter|n-shot| Metric | |Value | |Stderr|
|
| 54 |
|---------------------------------------|------:|------|-----:|----------|---|-----:|---|-----:|
|
| 55 |
|arc_challenge | 1|none | 0|acc |↑ |0.6186|± |0.0142|
|
|
|
|
| 137 |
| - humanities | 1|none | |acc |↑ |0.7981|± |0.0057|
|
| 138 |
| - other | 1|none | |acc |↑ |0.8304|± |0.0064|
|
| 139 |
| - social sciences| 1|none | |acc |↑ |0.8736|± |0.0059|
|
| 140 |
+
| - stem | 1|none | |acc |↑ |0.7456|± |0.0075|
|
| 141 |
+
```
|