clane9
/

boldgpt_small_patch10.kmq

Model card Files Files and versions

clane9 commited on Oct 28, 2023

Commit

72efd6d

·

1 Parent(s): a91432e

Update README.md

Files changed (1) hide show

README.md +6 -17

README.md CHANGED Viewed

@@ -1,18 +1,16 @@
 ---
-license: mit
 ---
-# Model card for boldgpt_small_patch10
 ![Example training predictions](example.png)
-A Vision Transformer (ViT) model trained on BOLD activation maps from [NSD-Flat](https://huggingface.co/datasets/clane9/NSD-Flat). The training objective was to auto-regressively predict the next patch with shuffled patch order.
 ## Dependencies
 - [boldGPT](https://github.com/clane9/boldGPT)
-- [huggingface_hub](https://huggingface.co/docs/huggingface_hub/index)
-- [safetensors](https://huggingface.co/docs/safetensors/index)
 ## Usage
@@ -20,25 +18,16 @@ A Vision Transformer (ViT) model trained on BOLD activation maps from [NSD-Flat]
 from boldgpt.data import ActivityTransform
 from boldgpt.models import create_model
 from datasets import load_dataset
-from huggingface_hub import hf_hub_download
-from safetensors.torch import load_model
-model = create_model("boldgpt_small_patch10")
-load_model(
-    model,
-    hf_hub_download(
-        repo_id="clane9/boldgpt_small_patch10", filename="model.safetensors"
-    ),
-)
 dataset = load_dataset("clane9/NSD-Flat", split="train")
 dataset.set_format("torch")
-batch = dataset[:1]
 transform = ActivityTransform()
 batch["activity"] = transform(batch["activity"])
-# output: (B, N, K) predicted next token logits
 output, state = model(batch)
 ```

 ---
+license: cc-by-nc-4.0
 ---
+# Model card for `boldgpt_small_patch10.kmq`
 ![Example training predictions](example.png)
+A Vision Transformer (ViT) model trained on BOLD activation maps from [NSD-Flat](https://huggingface.co/datasets/clane9/NSD-Flat). Patches were quantized to discrete tokens using k-means (`KMeansTokenizer`). The training objective was to auto-regressively predict the next patch with shuffled patch order and cross-entropy loss.
 ## Dependencies
 - [boldGPT](https://github.com/clane9/boldGPT)
 ## Usage
 from boldgpt.data import ActivityTransform
 from boldgpt.models import create_model
 from datasets import load_dataset
+model = create_model("boldgpt_small_patch10.kmq", pretrained=True)
 dataset = load_dataset("clane9/NSD-Flat", split="train")
 dataset.set_format("torch")
 transform = ActivityTransform()
+batch = dataset[:1]
 batch["activity"] = transform(batch["activity"])
+# output: (B, N + 1, K) predicted next token logits
 output, state = model(batch)
 ```